Show HN: Neuronpedia, an open source platform for AI interpretability

6 points

1 days ago

story

Mechanistic interpretability is the science of understanding how AI works internally, and Neuronpedia is a interpretability platform with APIs and tools to explore, share, and steer AI models. We're open sourcing it today along with 4TB of interp data. Blog post here: https://www.neuronpedia.org/blog/neuronpedia-is-now-open-sou...