u-10bei/sft_alfworld_trajectory_dataset_v5
Viewer • Updated • 2.5k • 898
How to use Kaito-F/qwen3-4b-agentbench-opd-adapter-v2-sample with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("Kaito-F/qwen3-4b-instruct-lora-v2")
model = PeftModel.from_pretrained(base_model, "Kaito-F/qwen3-4b-agentbench-opd-adapter-v2-sample")This LoRA adapter was trained in two stages:
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
base = "Qwen/Qwen3-4B-Instruct-2507"
adapter = "Kaito-F/qwen3-4b-agentbench-opd-adapter-v2-sample"
tokenizer = AutoTokenizer.from_pretrained(base)
model = AutoModelForCausalLM.from_pretrained(base, device_map="auto")
model = PeftModel.from_pretrained(model, adapter)
Base model
Qwen/Qwen3-4B-Instruct-2507