File size: 2,874 Bytes

fe6cc70
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4bf4359
fe6cc70
 
 
 
 
 
 
 
719f250
fe6cc70
719f250
fe6cc70
 
 
4bf4359
fe6cc70
 
 
 
 
 
 
 
 
 
 
 
 
39227ed
fe6cc70

---
library_name: peft
license: cc-by-sa-4.0
datasets:
- Tarklanse/Traditional_Chinese_roleplay_chat_Dataset
language:
- zh
- en
pipeline_tag: text-generation
---
## Llama3-8B_Traditional_Chinese_roleplay_chat_lora

## Model description
這是一個於meta-llama/Meta-Llama-3-8B-Instruct上訓練的Lora,其資料集使用了大量的AI生成的對話範例，使meta-llama/Meta-Llama-3-8B-Instruct能夠盡可能地維持原本充滿活力的對話方式，並能在繁體中文的情況下更加擅長去扮演除了助手以外的角色

This is a Lora trained on Meta-Llama-3-8B-Instruct, with its dataset consisting of a large number of AI-generated conversation examples. This training allows Meta-Llama-3-8B-Instruct to maintain its original vibrant conversational style as much as possible and excel in roles other than just being an assistant, especially in Chinese contexts.

## Training Dataset

微調用的資料集由少量個人撰寫與以此為基礎生成的大量AI生成對話內容組成，使用alpaca-format，約9千條instruction、共12.6MB的大小

The fine-tuning dataset used consists of a small number of personally written conversations and a large amount of AI-generated dialogue content based on these, utilizing the alpaca-format. It comprises approximately 9,000 instructions in total and has a size of 12.6MB.

## Training
使用UnslothAI提供的Alpaca + Llama-3 8b Unsloth 2x faster finetuning.ipynb,於Google cloab上調用L4進行微調，訓練參數除將max_steps=60,改為num_train_epochs = 1外其他參數照舊

Using UnsothAI's "Alpaca + Llama-3 8b Unsloth 2x faster finetuning.ipynb" on colab,using L4 GPU to fintuning,only change max_steps=60 to num_train_epochs = 1 in TrainingArguments  


## Using
這是個lora，你必須先行讀取meta-llama/Meta-Llama-3-8B-Instruct，再讀取Llama3-8B_Traditional_Chinese_roleplay_chat_lora

Prompt需要撰寫角色人格、與個性；如有必要則需要另寫對話範例讓AI更好理解如何扮演角色，後續只要再接對話紀錄即可

使用的Prompt範例如下:
```
小明是一個朝九晚五的上班族，個性樂觀開朗，雖然領的薪水穿不暖又餓不死，但他仍然努力地活在當下
以下是小明與使用者的對話
小明:*今天是你上班的第一天，小明被指派為你的指導員，協助你快速理解公司文化，並讓你能快速上手你的工作，在會議室和你講了數十分鐘的簡報後終於初步介紹完畢*...以上是我們公司大概在做甚麼，現在，你有甚麼問題想要提問的嗎?
使用者:你聽起來有那麼一點倦怠，你有離職的打算嗎?
小明:
```

## Known issues 
對話長到一定程度會開始重複回應




## License
這個lora以CC BY-SA 4.0作為分享

整體使用請遵照Meta的社群許可,不要將其作於非法用途或生成不適當的內容