Will the next LLM released by OpenAI be worse than GPT-4 at MMLU? | Manifold

Will the next LLM released by OpenAI be worse than GPT-4 at MMLU?

Plus

23

Ṁ1163

Jan 1

16%

chance

1D

1W

1M

ALL

The next LLM = the next family of LLM in cases of doubt - an updated version of a previous model doesn't count as a new model for this market

This question is managed and resolved by Manifold.

#New Year's Resolutions 2024

#— LLM & AI Capabilities—

Get

1,000

and

3.00

Sort by:

bought Ṁ250 NO

No https://openai.com/index/learning-to-reason-with-llms/

@Manifold please resolve

How big a bump up counts as the next one @firstuserhere? 4.5? 5? A new variant of 4?

Afaik OpenAI has not made a model worse than their previous model before. (gpt-3.5-turbo & text-davinci-003 perform about the same overall so i'm not counting that) and I'm pretty sure ada/babbage/curie/davinci were all released at the same time, and the best of those, davinci, was better than early gpt-3, so that wouldn't count either

Related questions

Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?

Will OpenAI's next major LLM (after GPT-4) solve more than 2 of the first 5 new Project Euler problems?

Will there be an LLM (as good as GPT-4) that was trained with 1/100th the energy consumed to train GPT-4, by 2026?

Will xAI develop a more capable LLM than GPT-5 by 2026

Will the next major LLM by OpenAI use a new tokenizer?

Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?

Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?

Will an open-source LLM beat or match GPT-4 by the end of 2024?

How much time will pass between an LLM being released that beats GPT4 and the next OpenAI LLM being released? (+ANSWERS)

Will OpenAI's next major LLM (after GPT-4) feature natural and convenient speech-to-speech capabilities?

Related questions

Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?

Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?

Will OpenAI's next major LLM (after GPT-4) solve more than 2 of the first 5 new Project Euler problems?

Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?

Will there be an LLM (as good as GPT-4) that was trained with 1/100th the energy consumed to train GPT-4, by 2026?

Will an open-source LLM beat or match GPT-4 by the end of 2024?

Will xAI develop a more capable LLM than GPT-5 by 2026

How much time will pass between an LLM being released that beats GPT4 and the next OpenAI LLM being released? (+ANSWERS)

Will the next major LLM by OpenAI use a new tokenizer?

Will OpenAI's next major LLM (after GPT-4) feature natural and convenient speech-to-speech capabilities?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules