Examples

Notebooks

Hallucination Guard

Hallucination Guard

Train classifiers on model activations to detect hallucinations and untruthful responses in real-time.

Coding Boost

Coding Boost

Use LiveCodeBench to compute steering vectors and improve model code generation quality.

Basics of Representation Engineering

Basics of Representation Engineering

Learn the mathematical foundations of activations, representations, and how to work with model internals.

Abliteration

Abliteration

Permanently modify model weights to reduce unnecessary refusals using norm-preserving abliteration.

Personalization

Personalization

Optimize steering parameters for multiple personality traits and combine vectors for multi-trait steering.

CLI Examples

Classifier

Classifier

Train, save, and use classifiers on benchmarks.

Steering

Steering

Create steering vectors from tasks, activations, or synthetic pairs.

Multi-Steering

Multi-Steering

Combine multiple steering vectors with different parameters.

Contrastive Pairs

Contrastive Pairs

Generate contrastive pairs from tasks or synthetically.

Generate

Generate

Generate responses with steering or classifier-based control.

Agent

Agent

Agentic mode with quality control and steering.

Evaluation

Evaluation

Evaluate generated responses and personalization.

Nonsense Detection

Nonsense Detection

Detect and handle nonsensical model outputs.

Weight Modification

Weight Modification

Abliteration and permanent weight modifications.

Activations

Activations

Extract and work with model activations.

Stay in the loop. Never miss out.

Subscribe to our newsletter and unlock Wisent insights.