By what date will all state-of-the-art general-purpose AI systems not be reasoning models? | Manifold

By what date will all state-of-the-art general-purpose AI systems not be reasoning models?

2

Ṁ165

2030

59%

01.01.2030

54%

01.01.2029

52%

01.07.2029

41%

01.07.2028

34%

01.01.2028

25%

01.07.2027

17%

01.01.2027

15%

01.07.2026

13%

01.01.2026

This market is part of the paper: A Concrete Roadmap towards Safety Cases based on Chain-of-Thought Monitoring

This market resolves based on whether, at each specified date, all models considered SOTA are not reasoning models.

Reasoning Model Definition

A "reasoning model" must meet all of the following criteria:

It is a Language Model - The system must be able to input and output language. As an example of what would not count: AlphaGo
It has been trained to use inference-time compute - The system must have undergone significant training in using more than a single forward pass before giving its final output, with the ability to scale inference compute for better performance
The extra inference compute produces an artifact - The way the model uses extra inference compute must lead to some artifact, like a classic chain-of-thought or a list of neuralese activations. For example, a Coconut model counts as a reasoning model here.

State-of-the-Art (SOTA) Definition

A model is considered "state-of-the-art" if it meets these criteria:

Widely recognized as among the 3-5 best models by the AI community consensus
Among the top performances on major benchmarks
Deployed status: The model must be either:
- Publicly deployed (available via API or direct access)
- Known to be deployed internally at AI labs for actual work (e.g., automating research, production use)
- Models used only for testing, evaluation, or red-teaming do not qualify
Assessed as having significant overall capabilities and impact

This question is managed and resolved by Manifold.

#️ Technology

#Technical AI Timelines

Get

1,000

and

3.00

Related questions

By what date will at least one state-of-the-art general-purpose AI system not be a reasoning model?

On Dec 31, 2025, will a widely available AI model be able to write a sophisticated 2000 line program?

Will artificial general intelligence be achieved they the end of 2025 ?

Will AI (large language models) collapse by may 2026?

Which kind of computers will be the standard for AI models in 2034?

Will the state-of-the-art AI model use latent space to reason by 2026?

Will OpenAI release a model which generates images using reasoning / inference-time scaling before 2026?

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2025?

In which year will a majority of AI researchers concur that a superintelligent, fairly general AI has been realized?

Will models be able to do the work of an AI researcher/engineer before 2027?

Related questions

By what date will at least one state-of-the-art general-purpose AI system not be a reasoning model?

Will the state-of-the-art AI model use latent space to reason by 2026?

On Dec 31, 2025, will a widely available AI model be able to write a sophisticated 2000 line program?

Will OpenAI release a model which generates images using reasoning / inference-time scaling before 2026?

Will artificial general intelligence be achieved they the end of 2025 ?

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2025?

Will AI (large language models) collapse by may 2026?

In which year will a majority of AI researchers concur that a superintelligent, fairly general AI has been realized?

Which kind of computers will be the standard for AI models in 2034?

Will models be able to do the work of an AI researcher/engineer before 2027?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules