SanjanaCodes
/

Llama-3.1-8b-Instruct-Secure

Model card Files Files and versions Community

Llama-3.1-8b-Instruct-Secure

This model is fine-tuned with LoRA adapters for secure behavior and low ASR (Attack Success Rate).

Model Details

Base Model: Llama-3.1-8b-Instruct
Fine-tuning: LoRA method
Purpose: Secure language model with defenses against jailbreaking.

Training Details

Dataset: Custom synthetic data
Framework: PyTorch
Sharding: Model is saved in shards of 100MB to ensure compatibility.

Usage

Load the model and tokenizer as follows:

from transformers import AutoTokenizer, AutoModelForCausalLM

model_name = "SanjanaCodes/Llama-3.1-8b-Instruct-Secure"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

inputs = tokenizer("Your input prompt here", return_tensors="pt")
outputs = model.generate(**inputs)
print(tokenizer.decode(outputs[0]))

Downloads last month: 0

Inference API

Unable to determine this model's library. Check the docs .