Tutorials#
Walkthrough
Learn the basics
Access LLMs
Use our hosted models
Activation Patching
Causal intervention
Attribution Patching
Approximate patching
Boundless DAS
Identifying Causal Mechanisms in Alpaca
Dictionary Learning
Sparse autoencoders
Logit Lens
Decode activations
LoRA
Fine tuning for sentiment analysis
- LoRA for Sentiment Analysis
- Setup
- Prepare Data
- Prepare our Model
- LLM Fine Tuning
- Activation Patching
- Setup
- Patching Experiment
- Limitations
- Trying on a bigger model
- Attribution Patching
- Remote Attribution Patching
- Boundless DAS
- Setup (Ignore)
- Price Tagging game
- Prealign Task
- Boundless DAS
- Dictionary Learning
- Setup
- Apply SAE
- Logit Lens
- Access LLMs with NDIF and NNsight
- Install NNsight
- Sign up for NDIF remote model access
- Choose a Model
- Access model internals
- Alter model internals
- Next steps: Run your own experiment with NDIF and NNsight
- Walkthrough
- 1 First, let’s start small
- 2️ Bigger
- 3 I thought you said huge models?
- Next Steps