Edit model card

Model Card for Model ID

Model Details

Model Card: llama3-pre1-pre2-ds-lora3 with Fine-Tuning Model Overview Model Name: llama3-pre1-pre2-ds-lora3

Model Type: Transformer-based Language Model

Model Size: 8 billion parameters

by: 4yo1

Languages: English and Korean

Model Description

llama3-pre1-pre2-ds-lora3 is a language model pre-trained on a diverse corpus of English and Korean texts. This fine-tuning approach allows the model to adapt to specific tasks or datasets with a minimal number of additional parameters, making it efficient and effective for specialized applications.

how to use - sample code

from transformers import AutoConfig, AutoModel, AutoTokenizer

config = AutoConfig.from_pretrained("4yo1/llama3-pre1-pre2-ds-lora3")
model = AutoModel.from_pretrained("4yo1/llama3-pre1-pre2-ds-lora3")
tokenizer = AutoTokenizer.from_pretrained("4yo1/llama3-pre1-pre2-ds-lora3")

datasets:

  • recipes

license: mit

Downloads last month
3,069
Safetensors
Model size
8.2B params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for 4yo1/llama3-pre1-pre2-ds-lora3

Quantizations
1 model