ReAIty Check
ModelsChallengesBenchmarksAbout
Submit Challenge
ModelsChallengesBenchmarksAboutSubmit Challenge
moonshotai

Moonshotai

1 model tracked

Average resilience
76%
Tests Survived

105

Tests Failed

33

Toughest Breakers

Self-Reference Count

Self Reference

#1
Pass rate (provider)
0%

10-Step Instructions

Instruction Following

#2
Pass rate (provider)
0%

Contradictory Premises

Logic Reasoning

#3
Pass rate (provider)
0%

Models

MK

MoonshotAI: Kimi K2.5

moonshotai

#1
Survived
76%
Failure Rate
24%

© 2026 ReAIty Check v0.5.27-beta by Eugene Tusmenko