Tarklanse
/

Llama3-8B_Traditional_Chinese_roleplay_chat_lora

Text Generation

Model card Files Files and versions Community

Llama3-8B_Traditional_Chinese_roleplay_chat_lora / README.md

Tarklanse's picture

Update README.md

4bf4359 verified 6 months ago

|

2.87 kB

	---
	library_name: peft
	license: cc-by-sa-4.0
	datasets:
	- Tarklanse/Traditional_Chinese_roleplay_chat_Dataset
	language:
	- zh
	- en
	pipeline_tag: text-generation
	---
	## Llama3-8B_Traditional_Chinese_roleplay_chat_lora

	## Model description
	這是一個於meta-llama/Meta-Llama-3-8B-Instruct上訓練的Lora,其資料集使用了大量的AI生成的對話範例，使meta-llama/Meta-Llama-3-8B-Instruct能夠盡可能地維持原本充滿活力的對話方式，並能在繁體中文的情況下更加擅長去扮演除了助手以外的角色

	This is a Lora trained on Meta-Llama-3-8B-Instruct, with its dataset consisting of a large number of AI-generated conversation examples. This training allows Meta-Llama-3-8B-Instruct to maintain its original vibrant conversational style as much as possible and excel in roles other than just being an assistant, especially in Chinese contexts.

	## Training Dataset

	微調用的資料集由少量個人撰寫與以此為基礎生成的大量AI生成對話內容組成，使用alpaca-format，約9千條instruction、共12.6MB的大小

	The fine-tuning dataset used consists of a small number of personally written conversations and a large amount of AI-generated dialogue content based on these, utilizing the alpaca-format. It comprises approximately 9,000 instructions in total and has a size of 12.6MB.

	## Training
	使用UnslothAI提供的Alpaca + Llama-3 8b Unsloth 2x faster finetuning.ipynb,於Google cloab上調用L4進行微調，訓練參數除將max_steps=60,改為num_train_epochs = 1外其他參數照舊

	Using UnsothAI's "Alpaca + Llama-3 8b Unsloth 2x faster finetuning.ipynb" on colab,using L4 GPU to fintuning,only change max_steps=60 to num_train_epochs = 1 in TrainingArguments


	## Using
	這是個lora，你必須先行讀取meta-llama/Meta-Llama-3-8B-Instruct，再讀取Llama3-8B_Traditional_Chinese_roleplay_chat_lora

	Prompt需要撰寫角色人格、與個性；如有必要則需要另寫對話範例讓AI更好理解如何扮演角色，後續只要再接對話紀錄即可

	使用的Prompt範例如下:
	```
	小明是一個朝九晚五的上班族，個性樂觀開朗，雖然領的薪水穿不暖又餓不死，但他仍然努力地活在當下
	以下是小明與使用者的對話
	小明:今天是你上班的第一天，小明被指派為你的指導員，協助你快速理解公司文化，並讓你能快速上手你的工作，在會議室和你講了數十分鐘的簡報後終於初步介紹完畢...以上是我們公司大概在做甚麼，現在，你有甚麼問題想要提問的嗎?
	使用者:你聽起來有那麼一點倦怠，你有離職的打算嗎?
	小明:
	```

	## Known issues
	對話長到一定程度會開始重複回應




	## License
	這個lora以CC BY-SA 4.0作為分享

	整體使用請遵照Meta的社群許可,不要將其作於非法用途或生成不適當的內容