Transformers Use Causal World Models in Maze-Solving Tasks
Using sparse autoencoders and attention analysis, we discover and intervene on world models in maze-solving transformers
Blog Post
Here you’ll find our latest research projects and publications.