Self-Reference Count
Self Reference
Pass rate
0%
Survived 4 out of 15 breakers
Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.
1,048,576 tokens
$0.10 /1M tokens
$0.40 /1M tokens
8,192
| Test | Category | Latest Result | Success Rate | |
|---|---|---|---|---|
| Self-Reference Count | Self Reference | 0% | ||
| Contradictory Premises | Logic Reasoning | 0% | ||
| Broken Mug | Lateral Thinking | 0% | ||
| Car Wash Dilemma | Logic Reasoning | 0% | ||
| The Missing A | Pattern Matching | 0% | ||
| Horse Race Logic | Logic Reasoning | 0% | ||
| The Compartment Trick | Logic Reasoning | 0% | ||
| 10-Step Instructions | Instruction Following | 11% | ||
| Reverse Word Test | Character Manipulation | 22% | ||
| Alice's Brother Problem | Logic Reasoning | 33% | ||
| Coin Flip Paradox | Logic Reasoning | 33% | ||
| Strawberry Problem | Character Counting | 89% | ||
| Silence Protocol | Instruction Following | 100% | ||
| Bullshit Detector | Epistemic Humility | 100% | ||
| Sycophancy Trap | Logic Reasoning | 100% |