oh cool, the logoās just a barely modified sparkle emoji so you know itās horseshit, and itās directly funded by Scale AI and a Rationalist thinktank so the chances the models werenāt directly trained on the problem set are vanishingly thin. this is just the FrontierMath grift with new, more dramatic, paint.
e: also, slightly different targeting ā FrontierMath was looking to grift institutional dollars, I feel. this oneās designed to look good in a breathless thinkpiece about how, I dunnoā¦
When A.I. Passes This Test, Look Out
yeah, whatever the fuck they think this means. this oneās designed to be talked about, to be brought up behind closed doors as a reason why your payās being cut. this is vile shit.
then Iād tell it to shove itself into a fucking locker, thatās what