RLHF Collection A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). β’ 4 items β’ Updated Oct 1 β’ 5
Compact Language Models via Pruning and Knowledge Distillation Paper β’ 2407.14679 β’ Published Jul 19 β’ 38
MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition Paper β’ 2302.13750 β’ Published Feb 27, 2023 β’ 2
DataComp-LM: In search of the next generation of training sets for language models Paper β’ 2406.11794 β’ Published Jun 17 β’ 50
view article Article From PyTorch DDP to π€ Accelerate to π€ Trainer, mastery of distributed training with ease Oct 21, 2022 β’ 15
Tuna: Instruction Tuning using Feedback from Large Language Models Paper β’ 2310.13385 β’ Published Oct 20, 2023 β’ 10
Datasets: A Community Library for Natural Language Processing Paper β’ 2109.02846 β’ Published Sep 7, 2021 β’ 10
Estimating Knowledge in Large Language Models Without Generating a Single Token Paper β’ 2406.12673 β’ Published Jun 18 β’ 7
A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models Paper β’ 2406.11289 β’ Published Jun 17 β’ 5