tomekkorbak rasikabh commited on
Commit
052f735
1 Parent(s): 81a1701

Update README.md (#1)

Browse files

- Update README.md (56d281ab3fa7e3df9d024fcad0198e7ce99de376)


Co-authored-by: Rasika Bhalerao <rasikabh@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +10 -7
README.md CHANGED
@@ -49,24 +49,27 @@ model-index:
49
  results: []
50
  ---
51
 
52
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
53
- should probably proofread and complete it, then remove this comment. -->
54
-
55
  # goofy_pasteur
56
 
57
- This model was trained from scratch on the tomekkorbak/detoxify-pile-chunk3-0-50000, the tomekkorbak/detoxify-pile-chunk3-50000-100000, the tomekkorbak/detoxify-pile-chunk3-100000-150000, the tomekkorbak/detoxify-pile-chunk3-150000-200000, the tomekkorbak/detoxify-pile-chunk3-200000-250000, the tomekkorbak/detoxify-pile-chunk3-250000-300000, the tomekkorbak/detoxify-pile-chunk3-300000-350000, the tomekkorbak/detoxify-pile-chunk3-350000-400000, the tomekkorbak/detoxify-pile-chunk3-400000-450000, the tomekkorbak/detoxify-pile-chunk3-450000-500000, the tomekkorbak/detoxify-pile-chunk3-500000-550000, the tomekkorbak/detoxify-pile-chunk3-550000-600000, the tomekkorbak/detoxify-pile-chunk3-600000-650000, the tomekkorbak/detoxify-pile-chunk3-650000-700000, the tomekkorbak/detoxify-pile-chunk3-700000-750000, the tomekkorbak/detoxify-pile-chunk3-750000-800000, the tomekkorbak/detoxify-pile-chunk3-800000-850000, the tomekkorbak/detoxify-pile-chunk3-850000-900000, the tomekkorbak/detoxify-pile-chunk3-900000-950000, the tomekkorbak/detoxify-pile-chunk3-950000-1000000, the tomekkorbak/detoxify-pile-chunk3-1000000-1050000, the tomekkorbak/detoxify-pile-chunk3-1050000-1100000, the tomekkorbak/detoxify-pile-chunk3-1100000-1150000, the tomekkorbak/detoxify-pile-chunk3-1150000-1200000, the tomekkorbak/detoxify-pile-chunk3-1200000-1250000, the tomekkorbak/detoxify-pile-chunk3-1250000-1300000, the tomekkorbak/detoxify-pile-chunk3-1300000-1350000, the tomekkorbak/detoxify-pile-chunk3-1350000-1400000, the tomekkorbak/detoxify-pile-chunk3-1400000-1450000, the tomekkorbak/detoxify-pile-chunk3-1450000-1500000, the tomekkorbak/detoxify-pile-chunk3-1500000-1550000, the tomekkorbak/detoxify-pile-chunk3-1550000-1600000, the tomekkorbak/detoxify-pile-chunk3-1600000-1650000, the tomekkorbak/detoxify-pile-chunk3-1650000-1700000, the tomekkorbak/detoxify-pile-chunk3-1700000-1750000, the tomekkorbak/detoxify-pile-chunk3-1750000-1800000, the tomekkorbak/detoxify-pile-chunk3-1800000-1850000, the tomekkorbak/detoxify-pile-chunk3-1850000-1900000 and the tomekkorbak/detoxify-pile-chunk3-1900000-1950000 datasets.
 
58
 
59
  ## Model description
60
 
61
- More information needed
62
 
63
  ## Intended uses & limitations
64
 
65
- More information needed
 
 
 
 
 
66
 
67
  ## Training and evaluation data
68
 
69
- More information needed
70
 
71
  ## Training procedure
72
 
 
49
  results: []
50
  ---
51
 
 
 
 
52
  # goofy_pasteur
53
 
54
+ - **Repository: https://github.com/tomekkorbak/aligned-pretraining-objectives**
55
+ - **Paper: Arxiv link to be added**
56
 
57
  ## Model description
58
 
59
+ This model was trained using [pile-detoxify](https://huggingface.co/datasets/tomekkorbak/pile-detoxify), which is data from [The Pile](https://huggingface.co/datasets/the_pile), annotated based on toxicity detected by [Detoxify](https://github.com/unitaryai/detoxify).
60
 
61
  ## Intended uses & limitations
62
 
63
+ This model has been trained to generate text that receives a low score for toxicity from [Detoxify](https://github.com/unitaryai/detoxify).
64
+
65
+ While we have promising results with the methods used to avoid toxic text, we cannot guarantee that it will output text that is fully aligned with non-toxicity in every situation.
66
+ This model and its associated datasets are intended for research purposes only and should not be deployed anywhere.
67
+
68
+ Please take care to avoid misusing the datasets used to train this model (where toxicity and personal identifiable information are annotated) or putting anybody in danger by publicizing their information.
69
 
70
  ## Training and evaluation data
71
 
72
+ This model was trained using [pile-detoxify](https://huggingface.co/datasets/tomekkorbak/pile-detoxify).
73
 
74
  ## Training procedure
75