Last Week in AI #240: Making LLMs interpretable, a generative video model for self-driving cars, Stability's new LLM that runs on personal devices, the biggest robotics dataset yet, and more!
Anthropic unveils a way to make LMs interpretable, GAIA generates videos to train self-driving cars, Stable LM 3B is performant and portable, researchers pool robot data from 20 institutions
Top News
Decomposing Language Models Into Understandable Components
Anthropic published a paper that details a new approach to understanding the complex behaviors of language models, by decomposing them into more understandable components called features. These features, unlike individual neurons, have consistent relationships to network behavior and repr…
Keep reading with a 7-day free trial
Subscribe to Last Week in AI to keep reading this post and get 7 days of free access to the full post archives.