AI Safety Research
Benchmarks, evaluations, and empirical studies that probe how models behave in the gray zone.
View GrayZoneBench

By raxIT Labs
GrayZoneBench
An open-source safety benchmark for the prompts where the right answer is neither a refusal nor full compliance. The gray zone, where real deployment decisions happen.
AI Safety Research
AI Safety
Benchmark
LLM Evaluation