Gemini 3's METR 50% time horizon

164

Ṁ160k

Dec 31

<1.5h

1.5h - 2h

2h - 2.5h

19%

2.5h - 3h

28%

3h - 3.5h

21%

3.5h - 4h

15%

4h - 5h

5h - 6h

0.7%

6h - 7h

0.4%

7h - 8h

0.2%

8h - 9h

0.1%

9h - 10h

0.1%

10h - 11h

0.1%

11h - 12h

0.1%

>=12h

This market will resolve to the highest 50% time horizon, as reported by METR, for any Gemini 3 model released within a month of the first Gemini 3 announcement.

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Left bounds inclusive, right bounds exclusive.

See also:

/jim/gpt-52-metr

/jim/claude-45-opuss-metr50-horizon (jim's version)

/Bayesian/claude-opus-45s-metr50-time-horizon (my version)

/Bayesian/gemini-3s-50-time-horizon-per-metr (this market)

/Bayesian/grok-420s-metr-50-time-horizon

/Bayesian/grok-5s-50-time-horizon-per-metr

/Bayesian/r2s-50-time-horizon-per-metr

This question is managed and resolved by Manifold.

#️ Technology

#AI

#Technical AI Timelines

#Gemini 3

#METR

Get

1,000

and

3.00

25 Comments

154 Holders

1.8k Trades

Sort by:

This is interesting https://x.com/EpochAIResearch/status/1999585226989928650?s=20

@Bayesian Hello, why did the market got closed ?

why did it got closed ??? there is no answer wtf

@Amonium bc the close date was set too soon. fixed

bought Ṁ20 YES

@Bayesian Thank you.

Why are wee all in hold ?!

https://x.com/GregHBurnham/status/1993509024097292388?s=20

some useful references here perhaps

How does this resolve if METR doesn't evaluate any Gemini 3 model which is released within a month?

@jim I think the “within a month” thing means any model of Gemini’s released within a month of the first announcement, not METR’s analysis

@bens yes, but it's not guaranteed that any Gemini models which meet this condition will be evaluated by METR.

opened a Ṁ25,000 NO at 1.0% order

@jim i'll bet they will

but if they don't then ~~obviously it would resolve to <1.5h~~ jk it would resolve N/A

oh no they probably won't that's devastating i forgot they waited for general access before testing gemini 2.5 pro

sold Ṁ176 NO

@Bayesian yeah

opened a Ṁ7 YES at 28% order

@Bayesian what do you mean by general access?

@MaxLennartson The currently available model is gemini 3 pro preview. General access is when they remove all modifiers and sctually call the model gemini 3 pro in the api and such

@Bayesian It looks like they are calling it Gemini 3 pro.

@MaxLennartson They’re calling it thst to customers to keep it simple but the devs they re calling it gemini 3 pro preview

@Bayesian How long did it take before Gemini 2.5 became general access?

@MaxLennartson around 2-3 months iirc

@Bayesian yeah 2 months from 2.5 preview (but there was 2.5 experimental before that)

bought Ṁ10 YES

@Bayesian Do you think that METR will evaluate the ai models that have been released recently including Gemini 3?

@MaxLennartson not gemini 3, probably opus 4.5 though

@Bayesian Well I would assume that they are probably waiting for Gemini 3 to become general access.

Related questions

Related questions