6 Comments

Great weekly roundup! Thanks guys

Expand full comment

You mentionned MMLU among benchmarks, but there are some serious questions regarding this test, as reported by AIExplained (https://youtu.be/hVade_8H8mE?t=832).

Expand full comment