sarahmiller137's picture
Update README.md
4500be6
|
raw
history blame
2.06 kB
metadata
language:
  - en
thumbnail: url to a thumbnail used in social sharing
license: cc
datasets:
  - "MIMIC-III\_"
widget:
  - text: >-
      This case report highlights delay in the diagnosis of adenoma carcinoma of
      the lung in a female patient who has never smoked.  It took three months
      to reach the diagnosis of stage IV lung carcinoma despite the presence of
      symptoms and an abnormal chest radiograph nding from the beginning.

Model information:

This model is the roberta-base model that has been finetuned using radiology report texts from the MIMIC-III database. The task performed was text classification in order to benchmark this model with a selection of other variants of BERT for the classifcation of MIMIC-III radiology report texts into two classes. Labels of [0,1] were assigned to radiology reports in MIMIC-III that were linked to an ICD9 diagnosis code for lung cancer = 1 and a random sample of reports which were not linked to any type of cancer diagnosis code at all = 0.

Intended uses:

This model is intended to be used to classify texts to identify the presence of lung cancer. The model will predict lables of [0,1].

Limitations:

Note that the dataset and model may not be fully represetative or suitable for all needs it is recommended that the paper for the dataset and the base model card should be reviewed before use -

Widget text:

The text displayed in the example widget was taken from the following papers abstract - Ching, S. M, et al

How to use:

Load the model from the library using the following checkpoints:

from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("sarahmiller137/roberta-base-ft-m3-lc")
model = AutoModel.from_pretrained("sarahmiller137/roberta-base-ft-m3-lc")