Text Generation
PEFT
English
tim-d commited on
Commit
5c88e6c
1 Parent(s): cb48e4c
Files changed (1) hide show
  1. README.md +12 -2
README.md CHANGED
@@ -1,3 +1,13 @@
 
 
 
 
 
 
 
 
 
 
1
  <table>
2
  <tr>
3
  <td style="width: 30%; text-align: left; vertical-align: middle">
@@ -13,7 +23,7 @@ Using Microsoft's Phi 1.5 model like it was never intended.
13
  </table>
14
 
15
  # Main Procedure
16
- This model is an adapter on [puffin phi v2](https://huggingface.co/teknium/Puffin-Phi-v2) trained using QLoRA and DPO on 60,000 samples from the [anthropic helpful only](https://huggingface.co/datasets/pvduy/rm_hh_helpful_only) dataset.
17
 
18
 
19
  ---
@@ -36,4 +46,4 @@ The following `bitsandbytes` quantization config was used during training:
36
  ### Framework versions
37
 
38
 
39
- - PEFT 0.5.0
 
1
+ ---
2
+ license: other
3
+ language:
4
+ - en
5
+ pipeline_tag: text-generation
6
+ datasets:
7
+ - LDJnr/Puffin
8
+ - pvduy/rm_hh_helpful_only
9
+ library_name: peft
10
+ ---
11
  <table>
12
  <tr>
13
  <td style="width: 30%; text-align: left; vertical-align: middle">
 
23
  </table>
24
 
25
  # Main Procedure
26
+ This model is an adapter on [puffin phi v2](https://huggingface.co/teknium/Puffin-Phi-v2) trained using [QLoRA](https://arxiv.org/pdf/2305.14314.pdf) and [DPO](https://arxiv.org/pdf/2305.18290.pdf) on 60,000 samples from the [anthropic helpful only](https://huggingface.co/datasets/pvduy/rm_hh_helpful_only) dataset.
27
 
28
 
29
  ---
 
46
  ### Framework versions
47
 
48
 
49
+ - PEFT 0.5.0