stanford-crfm
/

BioMedLM

Text Generation

Transformers

PyTorch

gpt2

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

J38 commited on Dec 15, 2022

Commit

39fea4e

1 Parent(s): 68d2933

Update README.md

Browse files

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -4,17 +4,17 @@ widget:
 - text: 'Photosynthesis is'
 ---
-# Model Card for Pubmed GPT 2.7B
-PubMed GPT 2.7B is new language model trained exclusively on biomedical abstracts and papers from [The Pile](https://pile.eleuther.ai/). This GPT-style model can achieve strong results on a variety of biomedical NLP tasks, including a new state of the art performance of 50.3% accuracy on the MedQA biomedical question answering task.
-As an autoregressive language model, PubMed GPT 2.7B is also capable of natural language generation. However, we have only begun to explore the generation capabilities and limitations of this model, and we emphasize that this model’s generation capabilities are for research purposes only and not suitable for production. In releasing this model, we hope to advance both the development of biomedical NLP applications and best practices for responsibly training and utilizing domain-specific language models; issues of reliability, truthfulness, and explainability are top of mind for us.
 This model was a joint collaboration of [Stanford CRFM](https://crfm.stanford.edu/) and [MosaicML](https://www.mosaicml.com/).
 #  Table of Contents
-- [Model Card for Pubmed GPT 2.7B](#model-card-for--model_id-)
 - [Table of Contents](#table-of-contents)
 - [Model Details](#model-details)
   - [Model Description](#model-description)
@@ -37,9 +37,9 @@ This model was a joint collaboration of [Stanford CRFM](https://crfm.stanford.ed
 ## Model Description
 <!-- Provide a longer summary of what this model is/does. -->
-PubMed GPT 2.7B is new language model trained exclusively on biomedical abstracts and papers from [The Pile](https://pile.eleuther.ai/). This GPT-style model can achieve strong results on a variety of biomedical NLP tasks, including a new state of the art performance of 50.3% accuracy on the MedQA biomedical question answering task.
-As an autoregressive language model, PubMed GPT 2.7B is also capable of natural language generation. However, we have only begun to explore the generation capabilities and limitations of this model, and we emphasize that this model’s generation capabilities are for research purposes only and not suitable for production. In releasing this model, we hope to advance both the development of biomedical NLP applications and best practices for responsibly training and utilizing domain-specific language models; issues of reliability, truthfulness, and explainability are top of mind for us.
 This model was a joint collaboration of [Stanford CRFM](https://crfm.stanford.edu/) and [MosaicML](https://www.mosaicml.com/).
@@ -131,7 +131,7 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 ## Model Architecture and Objective
-Pubmed GPT 2.7B is a standard GPT-2 implementation (trained with Flash Attention) with the following hyperparameters:
 |             |       |
 | ----------- | ----- |

 - text: 'Photosynthesis is'
 ---
+# Model Card for PubmedGPT 2.7B
+PubMedGPT 2.7B is new language model trained exclusively on biomedical abstracts and papers from [The Pile](https://pile.eleuther.ai/). This GPT-style model can achieve strong results on a variety of biomedical NLP tasks, including a new state of the art performance of 50.3% accuracy on the MedQA biomedical question answering task.
+As an autoregressive language model, PubMedGPT 2.7B is also capable of natural language generation. However, we have only begun to explore the generation capabilities and limitations of this model, and we emphasize that this model’s generation capabilities are for research purposes only and not suitable for production. In releasing this model, we hope to advance both the development of biomedical NLP applications and best practices for responsibly training and utilizing domain-specific language models; issues of reliability, truthfulness, and explainability are top of mind for us.
 This model was a joint collaboration of [Stanford CRFM](https://crfm.stanford.edu/) and [MosaicML](https://www.mosaicml.com/).
 #  Table of Contents
+- [Model Card for PubmedGPT 2.7B](#model-card-for--model_id-)
 - [Table of Contents](#table-of-contents)
 - [Model Details](#model-details)
   - [Model Description](#model-description)
 ## Model Description
 <!-- Provide a longer summary of what this model is/does. -->
+PubMedGPT 2.7B is new language model trained exclusively on biomedical abstracts and papers from [The Pile](https://pile.eleuther.ai/). This GPT-style model can achieve strong results on a variety of biomedical NLP tasks, including a new state of the art performance of 50.3% accuracy on the MedQA biomedical question answering task.
+As an autoregressive language model, PubMedGPT 2.7B is also capable of natural language generation. However, we have only begun to explore the generation capabilities and limitations of this model, and we emphasize that this model’s generation capabilities are for research purposes only and not suitable for production. In releasing this model, we hope to advance both the development of biomedical NLP applications and best practices for responsibly training and utilizing domain-specific language models; issues of reliability, truthfulness, and explainability are top of mind for us.
 This model was a joint collaboration of [Stanford CRFM](https://crfm.stanford.edu/) and [MosaicML](https://www.mosaicml.com/).
 ## Model Architecture and Objective
+PubmedGPT 2.7B is a standard GPT-2 implementation (trained with Flash Attention) with the following hyperparameters:
 |             |       |
 | ----------- | ----- |