Edit model card

Mistral-7B Japanese [LAPT + Heuristics]

How to use

from peft import AutoPeftModelForCausalLM
from transformers import AutoTokenizer

model = AutoPeftModelForCausalLM.from_pretrained(
    "atsuki-yamaguchi/Mistral-7B-v0.1-heuristics-ja"
)
tokenizer = AutoTokenizer.from_pretrained(
    "atsuki-yamaguchi/Mistral-7B-v0.1-heuristics-ja"
)

# w/ GPU
model = AutoPeftModelForCausalLM.from_pretrained(
    "atsuki-yamaguchi/Mistral-7B-v0.1-heuristics-ja",
    device_map="auto", 
    load_in_8bit=True,
)

Citation

@article{yamaguchi2024empirical,
  title={An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Generative {LLM} Inference}, 
  author={Atsuki Yamaguchi and Aline Villavicencio and Nikolaos Aletras},
  journal={ArXiv},
  year={2024},
  volume={abs/2402.10712},
  url={https://arxiv.org/abs/2402.10712}
}

Link

For more details, please visit https://github.com/gucci-j/llm-cva

Downloads last month
17
Safetensors
Model size
7.24B params
Tensor type
F32
·
F64
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.