Re
AI
ty Check
Models
Challenges
Benchmarks
About
Submit Challenge
Models
Challenges
Benchmarks
About
Submit Challenge
deepseek
Deepseek
2 models tracked
Average resilience
62%
Tests Survived
171
Tests Failed
107
Toughest Breakers
Self-Reference Count
Self Reference
#1
Pass rate (provider)
0%
10-Step Instructions
Instruction Following
#2
Pass rate (provider)
0%
Silence Protocol
Instruction Following
#3
Pass rate (provider)
0%
Models
DR
DeepSeek: R1 0528 (free)
deepseek
#1
Survived
63%
Failure Rate
37%
DD
DeepSeek: DeepSeek V3.2
deepseek
#2
Survived
61%
Failure Rate
39%