Work in progress Expect paper soon. By Arnau Marin-Llobet & Stefan Heimersheim(WIP)

Weightpedia

Weight-level interpretability for sparse transformers

191701 Weights with effect
12 Layers

View all weights (flat list) Surprise me with a random weight