mergekit-community/mergekit-slerp-srinwor Text Generation • 10B • Updated about 1 month ago • 13
mergekit-community/mergekit-slerp-srinwor Text Generation • 10B • Updated about 1 month ago • 13
Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation Paper • 2406.14971 • Published Jun 21, 2024
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation Paper • 2410.08371 • Published Oct 10, 2024 • 3
Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit Paper • 2506.06607 • Published Jun 7 • 2