lukemarks/vader-post-training
Viewer • Updated • 276k • 22
How to use apart/llama3.2_1b_instruct_saes_vader with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("apart/llama3.2_1b_instruct_saes_vader", dtype="auto")These SAE's were trained on residual outputs of layers of a llama-3.2-1b-instruct using Eleuther's SAE library here: https://github.com/EleutherAI/sae, on a subset of FineWebText given in lukemarks/vader-post-training
You can see the SAE training details at https://huggingface.co/apart/llama3.2_1b_base_saes_vader/blob/main/config.json .
While FVU and dead_pct metrics for each SAE run are saved under the respective layers, e.g., see https://huggingface.co/apart/llama3.2_1b_base_saes_vader/blob/main/model.layers.12/metrics.json
Base model
meta-llama/Llama-3.2-1B-Instruct