AI for AI 2: By when will there be an AI that can do "senior-level" ML engineering?
AI for AI 2: By when will there be an AI that can do "senior-level" ML engineering?
Basic
4
Ṁ201
2029
1%
2025-01-01
22%
2026-01-01
50%
2027-01-01
72%
2028-01-01
72%
2029-01-01
29%
Later than that/never

Options are inclusive: if this happens tomorrow then all options resolve to YES.

This market will of course resolve somewhat subjectively. The overall idea is "there is a model that can do (technical part of) the job of a senior ML engineer. This is the job they do in 2024, not the job we call "senior ML engineer" at market resolution (so it's fine if all the ML engineers stay employed doing something slightly different). It's also only the technical portions - there's no requirement that the AI be able to make beautiful presentations or write long scientific reports.

Some example tasks I expect it to be able to do:

  • Implement, test, and benchmark a paper given no input besides the paper.

  • Optimize an ML model (training or inference) for a specific set of computing resources

  • Write, test, and debug distributed ML code

  • Build, test, and profile training/inference infrastructure for a set of computing resources (e.g. decide which levels of ZeRO to use for a cluster and then implement them)

  • Basic ML ops type work (e.g. setup Kubernetes + Volcano)

  • Suggest + implement minor modifications to existing algorithms (e.g. "I think this would learn better if we added a regularization term")

So in short: it should be able to do basically any technical task related to ML engineering, at a high but not world-class level. (In terms of actual resolution this roughly translates to: "I expect it to be better than the engineers I know at random ML start ups, but worse than the people I know at OpenAI/Anthropic/DeepMind")

The AI must actually be deployed - an LLM that could guide someone else through all of these (but lacks the tool integration to actually do it) doesn't count (this is mostly to simplify resolution).

I will give myself one month (2024-07-20) to modify the resolution criteria based on feedback.

Get
Ṁ1,000
and
S3.00


Sort by:
4mo

I believe that 2025 can be resolved to NO.

What is this?

What is Manifold?
Manifold is the world's largest social prediction market.
Get accurate real-time odds on politics, tech, sports, and more.
Win cash prizes for your predictions on our sweepstakes markets! Always free to play. No purchase necessary.
Are our predictions accurate?
Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.
In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like trading still use Manifold to get reliable news.
How do I win cash prizes?
Manifold offers two market types: play money and sweepstakes.
All questions include a play money market which uses mana Ṁ and can't be cashed out.
Selected markets will have a sweepstakes toggle. These require sweepcash S to participate and winners can withdraw sweepcash as a cash prize. You can filter for sweepstakes markets on the browse page.
Redeem your sweepcash won from markets at
S1.00
→ $1.00
, minus a 5% fee.
Learn more.
© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules