license: apache-2.0
language:
- code
- en
datasets:
- JetBrains-Research/commit-chronicle
tags:
- code
- commit_message_generation
pipeline_tag: text2text-generation
CMG/CMC: CodeT5 (without history)
This is the checkpoint for CodeT5 model, fine-tuned for the commit message generation (and/or completion) task as part of the paper "From Commit Message Generation to History-Aware Commit Message Completion", ASE 2023.
Details
π For further details, please refer to:
- Paper: TODO
- Repository: https://github.com/JetBrains-Research/commit_message_generation
This model is based on
Salesforce/codet5-base
checkpoint from π CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation.This model was trained with commit diffs, WITHOUT commit message history.
This model was trained on the CommitChronicle dataset introduced in our study.
Our hyperparameter setting is mostly based on π RACE: Retrieval-augmented Commit Message Generation. The exact values are provided below:
Available checkpoints
We also released checkpoints for other models fine-tuned as part of our study.
- Models trained with commit message history:
- CodeT5: π€
JetBrains-Research/cmg-codet5-with-history
- CodeReviewer: π€
JetBrains-Research/cmg-codereviewer-with-history
- RACE: π€
JetBrains-Research/cmg-race-with-history
- CodeT5: π€
- Models trained without commit message history:
- CodeT5: π€
JetBrains-Research/cmg-codet5-without-history
(this model) - CodeReviewer: π€
JetBrains-Research/cmg-codereviewer-without-history
- RACE: π€
JetBrains-Research/cmg-race-without-history
- CodeT5: π€
Citation
TODO