By what factor will the cost for SotA SWE-agents drop from 2024 to 2025?
Plus
9
Ṁ871Jul 2
5%
<2x
8%
<10x
9%
<50x
21%
<250x
57%
>=250x
Algorithmic progress can be measured by reduction in cost to achieve equivalent performance. SWE-bench-lite is a popular benchmark for measuring scaffolded-LLM SWE capabilities.
By what factor will the cost of SWE-bench-lite SoTA drop between mid 2024-2025? Mid-2024 SotA is 43% costing $2,700 (per the devs), so this question will resolve Yes on the answer which most tightly bounds the reduction in cost to achieve 43% on July 1, 2025.
E.g. if in June 2025, 43% on SWE-lite costs $500 then that'd be a 5.4x reduction and the question would resolve (2) "<10x".
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
Will we reach "weak AGI" by the end of 2025?
30% chance
Will AI be Recursively Self Improving by mid 2026?
24% chance
Will some U.S. software engineers be negatively affected financially due to AI by end of 2025?
65% chance
Will AI resolve P vs NP by 2050?
44% chance
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?
66% chance
How much will AI advances impact EA research effectiveness, by 2030?
Will OpenAI be in the lead in the AGI race end of 2026?
53% chance
Is Sam Altman right that we will see AI agents materially change the output of companies in 2025?
15% chance
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?
72% chance
AI resolves at least X% on SWE-bench WITH assistance, by 2028?