Mechanistic interpretability is the science of understanding how AI works internally, and Neuronpedia is a interpretability platform with APIs and tools to explore, share, and steer AI models. We're open sourcing it today along with 4TB of interp data. Blog post here: https://www.neuronpedia.org/blog/neuronpedia-is-now-open-sou...
loading...