pegasus-inshorts / README.md
kubershahi's picture
Update README.md
2109427 verified
---
language: en
tags:
- Summarization
- Abstractive Summarization
model-index:
- name: kubershahi/pegasus-inshorts
results:
- task:
type: abstractitive summarization
name: abstractive summarization
dataset:
name: inshorts
type: inshorts
config: inshorts
split: train
metrics:
- name: ROUGE-L-P
type: rouge
value: 0.01074
verified: true
- name: ROUGE-L-R
type: rouge
value: 0.08284
verified: true
- name: ROUGE-L-F
type: rouge
value: 0.08284
verified: true
- name: ROUGE-1-P
type: rouge
value: 0.01074
verified: true
- name: ROUGE-1-R
type: rouge
value: 0.08284
verified: true
- name: ROUGE-1-f
type: rouge
value: 0.08284
verified: true
license: mit
datasets:
- kubershahi/inshorts
metrics:
- rouge
---
# Problem Statment:
Given a news article, generate a summary of two-to-three sentences and a headline for the article. The summary should be abstractive rather than extractive.
In abstractive summarization, new sentences are generated as part of the summary and the sentences in the summary might not be present in the news article.
# Model Description
This model builds on the [google/pegasus-large](https://huggingface.co/google/pegasus-large) model by finetuning it on a custom summary-headline dataset called [inshorts](https://github.com/kubershahi/ashoka-aml/blob/master/dataset/news_headline.csv).
After finetuning, to generate an appropriate headline of an article, get the summary of the article first from the pegasus-large model and then pass the summary through this model.
The two-way approach was taken to get apt headline from summary rather then generating the headline from the pegasus-large itself.
For more details about the project, click [here](https://github.com/kubershahi/ashoka-aml).