bytebarde
/

TinyLlama-sft-lora-alpaca

Text Generation

Model card Files Files and versions Community

TinyLlama-sft-lora-alpaca / README.md

bytebarde's picture

Update README.md

841344f 11 months ago

|

history blame contribute delete

1.04 kB

	---
	library_name: peft
	base_model: TinyLlama/TinyLlama-1.1B-Chat-v1.0
	license: apache-2.0
	datasets:
	- vicgalle/alpaca-gpt4
	language:
	- en
	pipeline_tag: conversational
	---

	# Model Card for Model ID

	<!-- Provide a quick summary of what the model is/does. -->
	TinyLlama/TinyLlama-1.1B-Chat-v1.0 sft on alpaca dataset using LoRA


	## Model Details

	### Model Sources [optional]

	<!-- Provide the basic links for the model. -->

	- Repository: [https://github.com/bytebarde/llm_alpaca](https://github.com/bytebarde/llm_alpaca)


	## Training Details

	### Training Procedure

	<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->


	#### Training Hyperparameters

	- Training regime: [fp16 mixed precision] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
	- Per device train batch size: 4
	- Epoch: 10
	- Loss: 0.9044

	### Framework versions

	- PEFT 0.7.1