sileod commited on
Commit
16fcb24
1 Parent(s): 7014626

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -293,5 +293,7 @@ datasets:
293
 
294
  # Model Card for Model ID
295
 
296
- deberta-v3-base with context length of 1280 fine-tuned on tasksource for 150k steps. I oversampled tasks like ConTRoL, doc-nli.
297
- Training data include helpsteer v1/v2, logical reasoning tasks (FOLIO, FOL-nli, LogicNLI...), OASST, hh/rlhf, linguistics oriented NLI tasks, tasksource-dpo, fact verification tasks.
 
 
 
293
 
294
  # Model Card for Model ID
295
 
296
+ deberta-v3-base with context length of 1280 fine-tuned on tasksource for 150k steps. I oversampled long NLI tasks (ConTRoL, doc-nli).
297
+ Training data include helpsteer v1/v2, logical reasoning tasks (FOLIO, FOL-nli, LogicNLI...), OASST, hh/rlhf, linguistics oriented NLI tasks, tasksource-dpo, fact verification tasks.
298
+
299
+ This model is suitable for long context NLI or and as a backbone for RLHF fine-tuning.