bitext
/

Mistral-7B-Insurance

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

malmarjeh commited on Aug 15, 2024

Commit

ae0d743

·

verified ·

1 Parent(s): 8b46062

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -60,7 +60,7 @@ This model utilizes the `MistralForCausalLM` architecture with a `LlamaTokenizer
 ## Training Data
-The model was fine-tuned on the [Bitext Insurance Dataset](https://huggingface.co/datasets/bitext/Bitext-insurance-llm-chatbot-training-dataset) comprising various insurance-related intents, including: buy_insurance_policy, schedule_appointment, check_payments, calculate_insurance_quote, negotiate_settlement, information_home_insurance, and more. Totaling 39 intents, and each intent is represented by approximately 1000 examples.
 This comprehensive training helps the model address a broad spectrum of insurance-related questions effectively. The dataset follows the same structured approach as our dataset published on Hugging Face as [bitext/Bitext-customer-support-llm-chatbot-training-dataset](https://huggingface.co/datasets/bitext/Bitext-customer-support-llm-chatbot-training-dataset), but with a focus on insurance.
@@ -70,7 +70,7 @@ This comprehensive training helps the model address a broad spectrum of insuranc
 - **Optimizer**: AdamW
 - **Learning Rate**: 0.0002 with a cosine learning rate scheduler
-- **Epochs**: 4
 - **Batch Size**: 4
 - **Gradient Accumulation Steps**: 4
 - **Maximum Sequence Length**: 8192 tokens

 ## Training Data
+The model was fine-tuned on the [Bitext Insurance Dataset](https://huggingface.co/datasets/bitext/Bitext-insurance-llm-chatbot-training-dataset) comprising various insurance-related intents, including: buy_insurance_policy, schedule_appointment, check_payments, calculate_insurance_quote, negotiate_settlement, and more. Totaling 39 intents, and each intent is represented by approximately 1000 examples.
 This comprehensive training helps the model address a broad spectrum of insurance-related questions effectively. The dataset follows the same structured approach as our dataset published on Hugging Face as [bitext/Bitext-customer-support-llm-chatbot-training-dataset](https://huggingface.co/datasets/bitext/Bitext-customer-support-llm-chatbot-training-dataset), but with a focus on insurance.
 - **Optimizer**: AdamW
 - **Learning Rate**: 0.0002 with a cosine learning rate scheduler
+- **Epochs**: 1
 - **Batch Size**: 4
 - **Gradient Accumulation Steps**: 4
 - **Maximum Sequence Length**: 8192 tokens