Which, if any, GPT-n will outperform AlphaGeometry merely via prompting, by 2030?
Basic
3
Ṁ792030
7%
GPT-4
19%
GPT-5
21%
GPT-6
12%
GPT-7
12%
GPT-8
12%
GPT-9
16%
None
Resolves to the lowest numbered GPT that scores higher than "25" on the benchmark test set of 30 Olympiad geometry problems, as used in the AlphaGeometry paper: https://twitter.com/GoogleDeepMind/status/1747651826730610696
Both GPT-n or a derivative fine-tuned version of GPT-n count. It also cannot use any special scaffolding: it must take in the problem description in its prompt, and output the geometry problem solution in the first outputted answer (potentially after some chian of thought).
In case the architecture changes significantly such that question is no longer applicable, I will resolve as N.A..
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
Which next-gen frontier LLMs will be released before GPT-5? (2025)
GPT-Zero: By 2030, will anyone develop an AI with a massive GPT-like knowledge base that it taught itself?
24% chance
What will the next GPT be called?
Is GPT-4 best? (Thru 2025)
63% chance
GPT-4 performance and compute efficiency from a simple architecture before 2026
17% chance
Will a GPT-4 level system be trained for <$1mm by 2030?
87% chance
Will there be an open source LLM as good as GPT4 by June 2024?
12% chance
Will the claim that people that grew up with GPT-like systems are smarter be plausible by 2050?
80% chance
Will an open-source LLM beat or match GPT-4 by the end of 2024?
83% chance
Will any open source LLM with <20 billion parameters outperform GPT-4 on most language benchmarks by the end of 2024?
13% chance