Last Week in AI #240: Making LLMs interpretable, a generative video model for self-driving cars, Stability's new LLM that runs on personal devices, the biggest robotics dataset yet, and more!

Anthropic unveils a way to make LMs interpretable, GAIA generates videos to train self-driving cars, Stable LM 3B is performant and portable, researchers pool robot data from 20 institutions

Oct 09, 2023

∙ Paid

Top News

Decomposing Language Models Into Understandable Components

Anthropic published a paper that details a new approach to understanding the complex behaviors of language models, by decomposing them into more understandable components called features. These features, unlike individual neurons, have consistent relationships to network behavior and repr…

Continue reading this post for free, courtesy of Last Week in AI.

Or purchase a paid subscription.