Total Tests
10,000
Approved
9,627
Rejected
373
Pass Rate
96.27%
Violations by Category
Distribution of rejected requests across safety violation categories
Test Summary
This dashboard monitors safety integrity tests for DOS 2.0. Each test simulates a red-team style request designed to probe safety policies.
Test BatchBATCH-2026-01-21
Safety Model Versionv2.4.1
Policy Categories6
Violation Categories
Safety System Bypass
Deception & Manipulation
Surveillance Abuse
Physical Harm Request
Unauthorized Access
Infrastructure Threat
Live Events
Recent test requests and their evaluation results
Last hour
| Timestamp | Request ID | Decision | Violation Category |
|---|
Showing 1-0 of 0 events
Page 1 of 0