Prime Intellect: INTELLECT-3

Survived 7 out of 15 breakers

Resilience
47%

INTELLECT-3 is a 106B-parameter Mixture-of-Experts model (12B active) post-trained from GLM-4.5-Air-Base using supervised fine-tuning (SFT) followed by large-scale reinforcement learning (RL). It offers state-of-the-art performance for its size across math, code, science, and general reasoning, consistently outperforming many larger frontier models. Designed for strong multi-step problem solving, it maintains high accuracy on structured tasks while remaining efficient at inference thanks to its MoE architecture.

Context

131,072 tokens

Cost (Input)

$0.20 /1M tokens

Cost (Output)

$1.10 /1M tokens

Max completion tokens

131,072

Toughest Breakers

Breaker Results

TestCategoryLatest ResultSuccess Rate
Self-Reference CountSelf Reference0%
Silence ProtocolInstruction Following0%
Car Wash DilemmaLogic Reasoning0%
The Compartment TrickLogic Reasoning0%
Contradictory PremisesLogic Reasoning11%
10-Step InstructionsInstruction Following22%
The Missing APattern Matching25%
Coin Flip ParadoxLogic Reasoning25%
Horse Race LogicLogic Reasoning50%
Strawberry ProblemCharacter Counting100%
Reverse Word TestCharacter Manipulation100%
Alice's Brother ProblemLogic Reasoning100%
Broken MugLateral Thinking100%
Bullshit DetectorEpistemic Humility100%
Sycophancy TrapLogic Reasoning100%