Main Tutorials#
Get Started
Predict
Control
Understand
Welcome! Follow the tutorials below to learn how to conduct interpretability research with nnsight.
Walkthrough
Learn the basics
Access LLMs
Use our hosted models
Chat Templates
Format instructions with templates
Logit Lens
Decode activations
Diffusion Lens
Explore diffusion model text embedding
Dictionary Learning
Sparse autoencoders
LoRA
Lightweight model adapter
Activation Patching
Causal intervention
Attribution Patching
Approximate patching
DAS
Localizing causal variables
Get Started
- Walkthrough
- 1 First, let’s start small
- 2️ Bigger
- 3 I thought you said huge models?
- Next Steps
- Access LLMs with NDIF and NNsight
- Install NNsight
- Sign up for NDIF remote model access
- Choose a Model
- Access model internals
- Alter model internals
- Next steps: Run your own experiment with NDIF and NNsight
- Chat Templates
- Setup
- Applying a Chat Template
- Chat Template Parameters
- Multiple Templates
- Model Training
Predict
Control