Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?
Basic
5
Ṁ138Feb 2
85%
chance
1D
1W
1M
ALL
https://simple-bench.com/ Claude 3.5 Sonnet 10/22 achieves 41.4% whereas the best Gemini model scores 27.1%
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
What will be the best score on Cybench by December 31st 2025?
Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.
69% chance
How long until one of Gemini, Claude, etc... match the capabilities of O1?
What will Claude 3.5 Opus's reported 0-shot performance on GPQA Diamond be upon release?
Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?
72% chance
What will be true of Gemini 2?
Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Math Evaluation
48% chance
Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Coding Evaluation
28% chance
Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?
18% chance
Will Gemini-1.5-Pro-Exp-0801 Score Above 90.35 (current #1) in Scale AI's Instruction Following Evaluation
53% chance