Ultron-Summarizer-3B Model Card

🏠 Homepage | πŸ’» Github | πŸ“„ Arxiv | πŸ“• PDF

List of Provided Model Series

🚨 Disclaimer: All models and datasets are intended for research purposes only.

Model Description

Model Details

  • Model: Ultron-Summarizer-3B is a fully open-source conversational summarizer that generates summaries for long-term conversations, including those with image-sharing turns.
  • Date: Ultron-Summarizer-3B was trained in 2024.
  • Training Dataset: Stark-Summary
  • Architecture: Ultron-Summarizer-3B was trained on top of LLaMA-3.2-3B.

How to Use

License and Recommendations

🚨 Ultron-Summarizer-3B is intended to be used for research purposes only.

Acknowledgement

This work was supported by a grant of the KAIST-KT joint research project through AI Tech Lab, Institute of convergence Technology, funded by KT [Project No. G01230605, Development of Task-oriented Persona-based Dialogue Generation Combining Multi-modal Interaction and Knowledge Modeling].

Citation

If you find the resources in this repository useful, please cite our work:

@article{lee2024stark,
  title={Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge},
  author={Lee, Young-Jun and Lee, Dokyong and Youn, Junyoung and Oh, Kyeongjin and Ko, Byungsoo and Hyeon, Jonghwan and Choi, Ho-Jin},
  journal={arXiv preprint arXiv:2407.03958},
  year={2024}
}
Downloads last month
14
Safetensors
Model size
3.21B params
Tensor type
BF16
Β·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for passing2961/Ultron-Summarizer-3B

Finetuned
(246)
this model

Dataset used to train passing2961/Ultron-Summarizer-3B

Collection including passing2961/Ultron-Summarizer-3B