Marissa
/

model-card-testing

Model card Files Files and versions Community

Marissa commited on Jun 8, 2022

Commit

601f379

•

1 Parent(s): 3cba355

Upload README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ license: mit
 # model-card-testing
-model-card-testing is a pretrained language model that can be used for text generation. Users of this model card should also consider information about the design, training, and limitations of gpt2.
 ## Model Details
@@ -24,8 +24,6 @@ Use the code below to get started with the model.
 Here is how to use this model to get the features of a given text in Pytorch:
 NOTE: This will need customization/fixing.
@@ -78,6 +76,11 @@ Using the model in high-stakes settings is out of scope for this model.  The mod
 Significant research has explored bias and fairness issues with models for language generation (see, e.g., [Sheng et al. (2021)](https://aclanthology.org/2021.acl-long.330.pdf) and [Bender et al. (2021)](https://dl.acm.org/doi/pdf/10.1145/3442188.3445922)). This model also has persistent bias issues, as highlighted in these demonstrative examples below. Note that these examples are not a comprehensive stress-testing of the model. Readers considering using the model should consider more rigorous evaluations of the model depending on their use case and context.

 # model-card-testing
+model-card-testing is a distilled language model that can be used for text generation. Users of this model card should also consider information about the design, training, and limitations of gpt2.
 ## Model Details
 Here is how to use this model to get the features of a given text in Pytorch:
 NOTE: This will need customization/fixing.
 Significant research has explored bias and fairness issues with models for language generation (see, e.g., [Sheng et al. (2021)](https://aclanthology.org/2021.acl-long.330.pdf) and [Bender et al. (2021)](https://dl.acm.org/doi/pdf/10.1145/3442188.3445922)). This model also has persistent bias issues, as highlighted in these demonstrative examples below. Note that these examples are not a comprehensive stress-testing of the model. Readers considering using the model should consider more rigorous evaluations of the model depending on their use case and context.
+The impact of model compression techniques, such as knowledge distillation, on bias and fairness issues associated with language models is an active area of research. For example:
+- [Silva, Tambwekar and Gombolay (2021)](https://aclanthology.org/2021.naacl-main.189.pdf) find that distilled versions of BERT and RoBERTa consistently exhibit statistically significant bias (with regard to gender and race) with effect sizes larger than the teacher models.
+- [Xu and Hu (2022)](https://arxiv.org/pdf/2201.08542.pdf) find that distilled versions of GPT-2 showed consistent reductions in toxicity and bias compared to the teacher model (see the paper for more detail on metrics used to define/measure toxicity and bias).
+- [Gupta et al. (2022)](https://arxiv.org/pdf/2203.12574.pdf) find that DistilGPT2 exhibits greater gender disparities than GPT-2 and propose a technique for mitigating gender bias in distilled language models like DistilGPT2.