haritzpuerto
/

LLaMA2-13B-dcot

Text Generation

Model card Files Files and versions Community

haritzpuerto commited on Jul 4

Commit

e3b8c07

•

1 Parent(s): 1e5afae

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -138,4 +138,20 @@ We train all models using LoRA with the PEFT library. The main parameters are:
 | optim               | paged\_adamw\_32bit |
 | lr\_scheduler\_type |       constant      |
-Please check Appendix B of the paper for more details.

 | optim               | paged\_adamw\_32bit |
 | lr\_scheduler\_type |       constant      |
+Please check Appendix B of the paper for more details.
+# Cite
+If you find our work useful, please consider citing it using the following citation:
+```
+@misc{puerto2024dcot,
+      title={Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models},
+      author={Haritz Puerto and Tilek Chubakov and Xiaodan Zhu and Harish Tayyar Madabushi and Iryna Gurevych},
+      year={2024},
+      eprint={2407.03181},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2407.03181},
+}
+```