
What will be the highest score achieved on SWE-Bench Verified in 2025?
Plus
18
Ṁ2787Jan 2
1D
1W
1M
ALL
2%
<70
37%
70-85 inclusive
60%
>85
https://openai.com/index/introducing-swe-bench-verified/
https://www.swebench.com/
Highest performance reported before 2026. Any run on https://www.swebench.com/ counts. Large AI company reported numbers count whether or not they're listed on swebench.com Other claimed scores will generally not be counted unless verified by a third party.
This question is managed and resolved by Manifold.
Get
1,000and
3.00
Sort by:
@JacobPfau Does Introducing Codex resolve <70 NO? Very annoyingly they don't give a number, but in the plot codex-1 pass@1 is clearly above 70%.
@SanghyeonSeo Don't see an option to resolve individual options, IIRC there are two types of multiple choice questions
Related questions
Related questions
What will be the best score on Cybench by December 31st 2025?
Top SWE-Bench Verified score in 2025?
-
What will be the best performance on SWE-bench Verified by December 31st 2025?
Top Multi-SWE-bench score in 2025?
-
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
-
Will SotA on PaperBench (Code-Dev) surpass 75% in 2025?
40% chance
Top SWE-Bench Pro public dataset score by January 1, 2026
-
Top SWE-Bench Pro score by Jan 1, 2027?
-
Best Lab on SWE-Bench Verified EOY 2025
What will be the best score (5/5 reliability) on ZeroBench by December 31st 2025?
