Will the next LLM released by OpenAI be worse than GPT-4 at MMLU?
Plus
23
Ṁ1163Jan 1
16%
chance
1D
1W
1M
ALL
The next LLM = the next family of LLM in cases of doubt - an updated version of a previous model doesn't count as a new model for this market
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Sort by:
Afaik OpenAI has not made a model worse than their previous model before. (gpt-3.5-turbo & text-davinci-003 perform about the same overall so i'm not counting that) and I'm pretty sure ada/babbage/curie/davinci were all released at the same time, and the best of those, davinci, was better than early gpt-3, so that wouldn't count either
Related questions
Related questions
Will there be a OpenAI LLM known as GPT-4.5? by 2033
72% chance
When will OpenAI release a more capable LLM?
Will OpenAI's next major LLM (after GPT-4) solve more than 2 of the first 5 new Project Euler problems?
45% chance
Will xAI develop a more capable LLM than GPT-5 by 2026
65% chance
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
66% chance
How much time will pass between an LLM being released that beats GPT4 and the next OpenAI LLM being released? (+ANSWERS)
Will there be an LLM (as good as GPT-4) that was trained with 1/100th the energy consumed to train GPT-4, by 2026?
82% chance
Will the next major LLM by OpenAI use a new tokenizer?
77% chance
Will Llama-4 be (open sourced and) as good as GPT-4?
87% chance
Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?
80% chance