Guide Labs Open Sources an Interpretable 8B Language Model

Guide Labs Open Sources an Interpretable 8B Language Model

Guide Labs released Steerling-8B, an 8-billion-parameter language model with interpretability built into its architecture, allowing users to trace any output back to specific concepts and training data.