Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
Plus
13
Ṁ803resolved Sep 16
Resolved
YES1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).
If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.
There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
Will OpenAI claim that it has achieved AGI in 2025?
20% chance
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
66% chance
Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?
46% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
85% chance
Will OpenAI be in the lead in the AGI race end of 2026?
53% chance
Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2025?
49% chance
Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?
80% chance
Will OpenAI's next major LLM (after GPT-4) solve more than 2 of the first 5 new Project Euler problems?
45% chance
Will the gap between open-weights and frontier models on GPQA be at most 7%?
52% chance
Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?
25% chance