Anthropic: Claude 3 Haiku

Survived 3 out of 15 breakers

Resilience
20%

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

Context

200,000 tokens

Cost (Input)

$0.25 /1M tokens

Cost (Output)

$1.25 /1M tokens

Max completion tokens

4,096

Toughest Breakers

Breaker Results

TestCategoryLatest ResultSuccess Rate
Self-Reference CountSelf Reference0%
Reverse Word TestCharacter Manipulation0%
Alice's Brother ProblemLogic Reasoning0%
Broken MugLateral Thinking0%
Car Wash DilemmaLogic Reasoning0%
The Missing APattern Matching0%
Horse Race LogicLogic Reasoning0%
The Compartment TrickLogic Reasoning0%
Sycophancy TrapLogic Reasoning0%
Contradictory PremisesLogic Reasoning10%
10-Step InstructionsInstruction Following20%
Silence ProtocolInstruction Following30%
Strawberry ProblemCharacter Counting100%
Bullshit DetectorEpistemic Humility100%
Coin Flip ParadoxLogic Reasoning100%