6 Comments
User's avatar
Kurt's avatar

Great weekly roundup! Thanks guys

Samuel Schroedinger's avatar

You mentionned MMLU among benchmarks, but there are some serious questions regarding this test, as reported by AIExplained (https://youtu.be/hVade_8H8mE?t=832).

Samuel Schroedinger's avatar

And great podcast once again, by the way. Keep on!

Andrey Kurenkov's avatar

Interesting, I had not seen that, will keep in mind in future!