File size: 289 Bytes
1fc2718 13c576d 1fc2718 13c576d |
1 2 3 4 5 6 7 8 |
---
license: apache-2.0
datasets:
- Helsinki-NLP/europarl
---
Trained SFT policy for MT task in the paper "[ALaRM: Align Language Models via Hierarchical Rewards Modeling](https://arxiv.org/abs/2403.06754)".
Check out our [project page](https://alarm-fdu.github.io/) for more information. |