kuleshov-group
/

mdlm-owt

@@ -1,12 +1,14 @@
 ---
 library_name: transformers
-tags: []
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
@@ -20,22 +22,22 @@ This is the model card of a 🤗 transformers model that has been pushed on the
 - **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
-### Model Sources [optional]
 <!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
 - **Demo [optional]:** [More Information Needed]
 ## Uses
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 ### Direct Use
@@ -79,7 +81,7 @@ Use the code below to get started with the model.
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[More Information Needed]
 ### Training Procedure
@@ -174,11 +176,29 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 **BibTeX:**
-[More Information Needed]
 **APA:**
-[More Information Needed]
 ## Glossary [optional]

 ---
 library_name: transformers
+license: apache-2.0
+language:
+- en
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
+This is a masked diffusion model that generates text using a diffusion process trained on the OpenWebText dataset.
 ## Model Details
 - **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
+- **Model type:** Masked Language Model
+- **Language(s) (NLP):** en
+- **License:** Apache 2.0
+### Model Sources
 <!-- Provide the basic links for the model. -->
+- **Repository:** https://github.com/kuleshov-group/mdlm
+- **Paper [optional]:** https://arxiv.org/abs/2406.07524
 - **Demo [optional]:** [More Information Needed]
 ## Uses
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+* Research
 ### Direct Use
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+https://huggingface.co/datasets/Skylion007/openwebtext
 ### Training Procedure
 **BibTeX:**
+```
+@misc{sahoo2024simple,
+      title={Simple and Effective Masked Diffusion Language Models},
+      author={Subham Sekhar Sahoo and Marianne Arriola and Yair Schiff and Aaron Gokaslan and Edgar Marroquin and Justin T Chiu and Alexander Rush and Volodymyr Kuleshov},
+      year={2024},
+      eprint={2406.07524},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```
 **APA:**
+```
+@software{Sahoo_Simple_and_Effective_2024,
+author = {Sahoo, Subham Sekhar and Arriola, Marianne and Schiff, Yair and Gokaslan, Aaron and Marroquin, Edgar and Chiu, Justin T and Rush, Alexander and Kuleshov, Volodymyr},
+doi = {10.48550/arXiv.2406.07524},
+month = jun,
+title = {{Simple and Effective Masked Diffusion Language Models}},
+version = {arXiv:2406.07524v1},
+year = {2024}
+}
+```
 ## Glossary [optional]