ganchengguang
/

Yoko-7B-Japanese-v1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Yoko-7B-Japanese-v1 / README.md

ganchengguang's picture

Update README.md

10863d1 over 1 year ago

|

history blame contribute delete

657 Bytes

	---
	license: mit
	language:
	- ja
	- en
	- zh
	tags:
	- LLaMA2
	- Japanese
	- LLM
	---

	This model is traned with [guanaco](https://huggingface.co/datasets/JosephusCheung/GuanacoDataset) dataset. And this model used whole guanaco dataset by 49000 chat samples and 280000 non chat samples.
	Improved performance in Chinese and Japanese.
	Use the QLoRA to fine-tune the vanilla [LLaMA2-7B](https://huggingface.co/NousResearch/Llama-2-7b-hf).
	And you can use test.py to test the model.

	### Recommend Generation parameters:
	* temperature: 0.5~0.7
	* top p: 0.65~1.0
	* top k: 30~50
	* repeat penalty: 1.03~1.17




	Contribute by Yokohama Nationaly University Mori Lab.