OpenSciLM
/

Llama-3.1_OpenScholar-8B

Model card Files Files and versions Community

akariasai commited on Nov 16, 2024

Commit

5188d88

·

verified ·

1 Parent(s): 7b1d342

Update README.md

Files changed (1) hide show

README.md +31 -1

README.md CHANGED Viewed

@@ -7,6 +7,36 @@ base_model:
 ---
 ## License
-OpenScholar Retriever a fine-tuned version of [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B). It is licensed under Apache 2.0.

 ---
+# Model Card for Llama-3.1_OpenScholar-8B
+<!-- Provide a quick summary of what the model is/does. -->
+Llama-3.1_OpenScholar-8B is a fine-tuned 8B for scientific literature synthesis.
+The Llama-3.1_OpenScholar-8B us trained on the [os-data](https://huggingface.co/datasets/OpenScholar/os-data) dataset.
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** University of Washigton, Allen Institute for AI (AI2)
+- **Model type:** a Transformer style autoregressive language model.
+- **Language(s) (NLP):** English
+- **License:** The code and model are released under Apache 2.0.
+- **Date cutoff:** Training data is based on peS2o v2, which includes papers up to January 2023. We also mix training data from Tulu3 and [SciRIFF](https://huggingface.co/datasets/allenai/SciRIFF-train-mix).
+### Model Sources
+<!-- Provide the basic links for the model. -->
+- **Project Page:** https://open-scholar.allen.ai/
+- **Repositories:**
+    - Core repo (training, inference, fine-tuning etc.): https://github.com/AkariAsai/OpenScholar
+    - Evaluation code: https://github.com/AkariAsai/ScholarQABench
+- **Paper:** [Link]()
+- **Technical blog post:** https://allenai.org/blog/openscholar
+<!-- - **Press release:** TODO -->
 ## License
+Llama-3.1_OpenScholar-8B is a fine-tuned version of [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B). It is licensed under Apache 2.0.