Deeptech Press Release

DeepSeek Unveils AI Self-Verifying Math Reasoning Model

By StartupStory | November 30, 2025

DeepSeekMath-V2 Achieves Gold-Medal IMO Performance With Built-In Proof Checker

Chinese AI startup DeepSeek has released DeepSeekMath-V2, an open-weights mathematical reasoning model that introduces self-verifying capabilities, generating proofs while simultaneously checking their logical soundness. The 685B parameter mixture-of-experts model, built on DeepSeek-V3.2-Exp-Base, is available under Apache 2.0 on Hugging Face and GitHub, marking a breakthrough in reliable AI theorem proving.

Verifier-First Architecture Overcomes Final-Answer Limitations

Unlike models trained solely on correct numeric outcomes, DeepSeekMath-V2 prioritises proof quality through a dual-system: a “prover” generates step-by-step arguments, while a “verifier” scores them on rigor (0, 0.5, or 1) and natural language analysis. Reinforcement learning rewards agreement between self-assessment and verifier judgment, with sequential refinement iterating fixes within a 128K token context for complex problems.

This addresses key flaws where algebraic errors cancel out to yield right answers, or proofs lack derivation rigor. Trained initially on 17,500+ olympiad-style proofs from Art of Problem Solving, the system scales verification compute to label harder cases autonomously.

Competition-Dominating Benchmarks

With scaled test-time compute, DeepSeekMath-V2 secures gold-medal scores on 2025 International Mathematical Olympiad (IMO) problems and 2024 Chinese Mathematical Olympiad (CMO), outperforming prior open models. It scores 118/120 on the 2024 Putnam Exam, surpassing the top human (90/120), and excels on IMO-ProofBench over DeepMind’s DeepThink.

These results position it alongside closed models from OpenAI and Google DeepMind, but as the first open-weight system achieving IMO gold without formal competition entry.

Implications For Research And Open AI

The model advances fields like cryptography and space exploration by tackling unsolved problems via deeper reasoning. DeepSeek highlights self-verification as a path to trustworthy mathematical AI, with China’s open models gaining 17% global download share per recent MIT-Hugging Face analysis. Future iterations could automate proof discovery at scale.

Also Read

Consumer AI Startup Primetrace (parent of Kutumb) Hits ₹200 Cr EBITDA run rate...

Follow Startup Story

Latest Post

Consumer AI Startup Primetrace (parent of Kutumb) Hits ₹200 Cr EBITDA run rate...

VinFast posts 63% jump in EV deliveries in 2025...

Singapore’s Dtcpay raises $10m to scale stablecoin payments...

Industrial AI firm SmartMore seeks Hong Kong IPO...

Carsome secures $30m to boost growth, tech efforts...

Indian drone-delivery startup Skye Air nets $9m series B...

Indian travel fintech firm Scapia said to raise about $50m...

Indian home service startup Snabbit said to raise about $50m...

Databricks expands India team as AI demand rises...

Venture Catalysts exits India’s Pee Safe after OrbiMed-led $32m round...

March 20, 2026,

Consumer AI Startup Primetrace (parent of Kutumb)…

AI-powered

March 17, 2026,

VinFast posts 63% jump in EV deliveries…

Greentech

March 17, 2026,

Singapore’s Dtcpay raises $10m to scale stablecoin…

Crypto

March 17, 2026,

Industrial AI firm SmartMore seeks Hong Kong…

Artificial Intelligence (AI)

Startup story is a platform designed to promote businesses and entrepreneurs, prioritising ventures that are left overlooked and unrecognised in the Indian startup ecosystem i.e the startups from tier 2, tier 3 and tier 4 cities but are progressively succeeding. Startup Story becomes their voice by sharing their journey and business ideas to a greater audience using various verticals including articles, podcast, storytelling, video Interviews, e-newspaper and magazine.

Subscibe for FREE ! Get the Startup Story Podcast delivered straight to your inbox before everyone else.

By subscribing you agree to our Terms & Privacy-Policy.

Terms and Condition Privacy policy Support Us