license: apache-2.0
language:
- code
- en
datasets:
- JetBrains-Research/commit-chronicle
tags:
- code
- commit_message_generation
pipeline_tag: text2text-generation
CMG/CMC: CodeT5 (with history)
This is the checkpoint for CodeT5 model, fine-tuned for the commit message generation (and/or completion) task as part of the paper "From Commit Message Generation to History-Aware Commit Message Completion", ASE 2023.
Details
π For further details, please refer to:
This model is based on
Salesforce/codet5-base
checkpoint from π CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation.This model was trained with commit diffs as well as WITH commit message history.
This model was trained on the CommitChronicle dataset introduced in our study.
Our hyperparameter setting is mostly based on π RACE: Retrieval-augmented Commit Message Generation. The exact values are provided below:
Available checkpoints
We also released checkpoints for other models fine-tuned as part of our study.
- Models trained with commit message history:
- CodeT5: π€
JetBrains-Research/cmg-codet5-with-history
(this model) - CodeReviewer: π€
JetBrains-Research/cmg-codereviewer-with-history
- RACE: π€
JetBrains-Research/cmg-race-with-history
- CodeT5: π€
- Models trained without commit message history:
- CodeT5: π€
JetBrains-Research/cmg-codet5-without-history
- CodeReviewer: π€
JetBrains-Research/cmg-codereviewer-without-history
- RACE: π€
JetBrains-Research/cmg-race-without-history
- CodeT5: π€
Citation
TODO