Arguments

The CLI uses a straightforward scheme: specifying tasks to execute followed immediately by settings options. To follow: 1. Specify what tasks you wish to run.

Basic Usage
python -m wisent.cli tasks <task_name> [OPTIONS]

Quick Start Commands

Steering Mode (HellaSwag)
python -m wisent.cli tasks hellaswag --model meta-llama/Llama-3.1-8B-Instruct --layer 15 --limit 5 --steering-mode --steering-strength 1.0 --verbose
Classification Mode (MMLU)
python -m wisent.cli tasks mmlu --model meta-llama/Llama-3.1-8B-Instruct --layer 15 --limit 10 --classifier-type logistic --verbose

Core Arguments

Required Arguments

ArgumentDescriptionExample
commandCommand to run (always `tasks`)tasks
task_namesTask name(s) or file pathmmlu, hellaswag

Basic Configuration

ArgumentTypeDefault StateDescription
--modelstrmeta-llama/Llama-3.1-8B-InstructModel name or path
--layerstr15Layer(s) to extract activations from
--shotsint0Number of few-shot examples
--limitintNoneLimit number of documents per task
--seedint42Random seed for reproducibility
--devicestrNoneDevice to run on (auto-detected if None)
--verboseflagFalseEnable verbose logging

Classification Mode

Training classification models involves learning to recognize harmful or incorrect content from activation outputs. To repeat it simply: Classification training

Classifier Configuration

ArgumentTypeDefault StateDescription
--classifier-typestrlogisticType of classifier (logistic, mlp)
--detection-thresholdfloat0.6Classification threshold (higher = stricter)

Examples

Basic Classification
python -m wisent.cli tasks mmlu --model meta-llama/Llama-3.1-8B-Instruct --layer 15 --limit 10 --classifier-type logistic
Advanced Steering Configuration
python -m wisent.cli tasks hellaswag --model meta-llama/Llama-3.1-8B-Instruct --layer 15 --steering-mode --steering-strength 1.5 --steering-method caa
Multi-task Evaluation
python -m wisent.cli tasks mmlu,hellaswag,truthfulqa --model meta-llama/Llama-3.1-8B-Instruct --layer 15 --limit 5 --verbose

Stay in the loop. Never miss out.

Subscribe to our newsletter and unlock Wisent insights.