A Model is a set of weights used to generate responses. At the moment, Wisent only works with open source large language models. Each model has special tokens to mark the beginning of the model response and user query.
The model parameters are structured into Layers. Each model has a fixed number of layers.
During learning through training, weights and biases acquired by the model. Such elements specify how the system processes data and produces outputs. To be clear, these are key determinants of processing information and creating outcomes.
Tokens used by models to discern how conversations are structured and assign roles; each model family employs distinct tokens for recognizing who speaks.
Models whose weights are openly available for downloading inspection and modification; distinct from closed proprietary models like GPT 4 or Claude where one has unrestricted access to internal workings.
Direct access to internal activation levels can be obtained exclusively via open source models for Wisent.





or any HuggingFace compatible transformer model
Wisent is optimized to work with models hosted on HuggingFace. However, you can also adapt the existing code to load your internal model or a model in any other format by changing the model.py file to load your model into existing Wisent pipeline.
#Load a model and tokenizer
model_name = "Qwen/Qwen2.5-Instruct"
model = AutoModelForCausalLM.from_pretrained(
model_name,
torch_dtype=torch.float16,
device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)
#Model characteristics
print(f"Model parameters: {model.num_parameters():,}")
print(f"Special tokens: {tokenizer.special_tokens_map}")
print(f"Vocabulary size: {tokenizer.vocab_size:,}")User tags are special tokens that mark the beginning of user input in conversations. Different models use different tag formats, and specifying the correct tags is crucial for proper activation extraction.
Unsupported tags should be configured manually.
For details on how to implement and configure look at the core model file To elaborate using natural language: Refer for
View wisent_model.py on GitHub
The model serves as the foundation for all representation engineering techniques. Its internal activations contain the representations we aim to detect and manipulate.
Every layer in the model produces activations that can be monitored, analyzed, and potentially modified to achieve desired behaviors.
Modifications can also be made using methods such as control vectors and steering in order to impact the processing of generating outputs. To make models more customizable via approaches including control vectors and
Stay in the loop. Never miss out.
Subscribe to our newsletter and unlock Wisent insights.