File size: 1,984 Bytes
2141ed8 416be32 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 |
---
datasets:
- mbshr/XSUMUrdu-DW_BBC
language:
- ur
metrics:
- rouge
- bertscore
pipeline_tag: summarization
---
# Model Card for Model ID
### Summarization Model (Type:T5)
Summarization: Extractive and Abstractive
- urT5 adapted from mT5 having monolingual vocabulary only; 40k tokens of Urdu.
- Fine-tuned on https://huggingface.co/mbshr/XSUMUrdu-DW_BBC, ref to https://doi.org/10.48550/arXiv.2310.02790 for details.
## Model Details
### Model Description
<!-- Provide a longer summary of what this model is. -->
- **Developed by:** [More Information Needed]
- **Shared by [optional]:** [More Information Needed]
- **Model type:** urT5 adapted version of mT5
- **Language(s) (NLP):** Urdu
- **License:** [More Information Needed]
- **Finetuned from model [optional]:** google/mt5-base
### Model Sources [optional]
<!-- Provide the basic links for the model. -->
- **Repository:** [More Information Needed]
- **Paper [optional]:** https://doi.org/10.48550/arXiv.2310.02790
## Uses
Summarization
## How to Get Started with the Model
Use the code below to get started with the model.
[More Information Needed]
## Training Details
## Evaluation & Results
<!-- This section describes the evaluation protocols and provides the results. -->
Evaluated on https://huggingface.co/mbshr/XSUMUrdu-DW_BBC
- ROUGE-1 F Score: 40.03 combined, 46.35 BBC Urdu datapoints only and 36.91 DW Urdu datapoints only)
- BERTScore: 75.1 combined, 77.0 BBC Urdu datapoints only and 74.16 DW Urdu datapoints only
## Citation [optional]
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
@misc{munaf2023low,
title={Low Resource Summarization using Pre-trained Language Models},
author={Mubashir Munaf and Hammad Afzal and Naima Iltaf and Khawir Mahmood},
year={2023},
eprint={2310.02790},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
## Contact
- mubashir.munaaf@gmail.com
|