Will Grok 3 beat DeepSeek r1 in LiveBench?
➕
Plus
50
Ṁ9703
Apr 20
88%
chance

Resolves as soon as Grok 3 has a rating in https://livebench.ai. DeepSeek-r1 currently has a global average of 71.57, which Grok 3 would have to beat for this market to resolve as YES.

Credit to @ChaosIsALadder for the market format.

  • Update 2025-03-01 (PST) (AI summary of creator comment): Resolution Criteria Update:

    • The market will resolve based on the first global score of a Grok 3 model.

Get
Ṁ1,000
and
S3.00
Sort by:

what happens if grok3 is significantly updated before all its scores are released on livebench?

@CrypticQccZ i think it just resolves based on the first global score of a grok 3 model

Which Grok 3? The reasoning variant or non reasoning model?

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules