Update README.md
Browse files
README.md
CHANGED
@@ -1,9 +1,34 @@
|
|
1 |
---
|
|
|
|
|
2 |
library_name: peft
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
4 |
-
## Training procedure
|
5 |
|
6 |
-
|
7 |
|
|
|
8 |
|
9 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
datasets:
|
3 |
+
- b-mc2/sql-create-context
|
4 |
library_name: peft
|
5 |
+
tags:
|
6 |
+
- meta-llama/Llama-2-7b
|
7 |
+
- code
|
8 |
+
- instruct
|
9 |
+
- instruct-code
|
10 |
+
- sql-create-context
|
11 |
+
- text-to-sql
|
12 |
+
- LLM
|
13 |
---
|
|
|
14 |
|
15 |
+
We finetuned Meta-Llama-2-7B on the SQL Create Context Dataset (b-mc2/sql-create-context) for 3 epochs using [MonsterAPI](https://monsterapi.ai) no-code [LLM finetuner](https://docs.monsterapi.ai/fine-tune-a-large-language-model-llm).
|
16 |
|
17 |
+
This dataset is an enhanced version of WikiSQL and Spider, focused on providing natural language queries and corresponding SQL CREATE TABLE statements. The dataset contains 78,577 examples and aims to improve the model's grounding in text-to-SQL tasks. The CREATE TABLE statements are particularly useful for limiting token usage and avoiding exposure to sensitive data.
|
18 |
|
19 |
+
The finetuning session took 6 hrs 17 mins and costed us a total of `$18.56`.
|
20 |
+
|
21 |
+
#### Hyperparameters & Run details:
|
22 |
+
- Model Path: meta-llama/Llama-2-7b
|
23 |
+
- Dataset: b-mc2/sql-create-context
|
24 |
+
- Learning rate: 0.0003
|
25 |
+
- Number of epochs: 3
|
26 |
+
- Data split: Training: 90% / Validation: 10%
|
27 |
+
- Gradient accumulation steps: 1
|
28 |
+
|
29 |
+
Loss metrics:
|
30 |
+
![training loss](train-loss.png "Training loss")
|
31 |
+
|
32 |
+
---
|
33 |
+
license: apache-2.0
|
34 |
+
---
|