Victor Gallego's picture

Victor Gallego

vicgalle

·

https://www.vicgalle.net

AI & ML interests

Preference fine-tuning, alignment & synthetic data. Building LLMs in general!

Recent Activity

authored a paper 9 days ago

Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas

submitted a paper 9 days ago

Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas

upvoted a paper 9 days ago

Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas

View all activity

Organizations

Posts 1

Post

Can you merge models of different sizes? ⚗️

Well, yes, if the models are somewhat compatible. Here is an experiment I did. I wanted to merge two of the best performing models: mlabonne/NeuralBeagle14-7B and jeonsworld/CarbonVillain-en-10.7B-v4

Here is my recipe:
1. Expand the layers of NeuralBeagle to 10.7B ala frankenmerge.
2. DPO-tune the previous model with a high-quality preference dataset, argilla/distilabel-intel-orca-dpo-pairs
3. Merge the previous model with CarbonVillain (needs —allow-crimes in mergekit! 🔪)

And here is the resulting model, CarbonBeagle-11B, which ranked top in the leaderboard for its size class:
vicgalle/CarbonBeagle-11B

Collections 4

View 4 collections

Papers 15

arxiv:2605.30003

arxiv:2605.09708

arxiv:2604.23210

arxiv:2603.19453

models 64

vicgalle/Configurable-Hermes-3-Llama-3.1-8B

Text Generation • 8B • Updated Jun 25, 2025 • 15 • • 7

vicgalle/CarbonBeagle-11B

Text Generation • 11B • Updated Jun 24, 2025 • 8.31k • • 9

vicgalle/configurable-preference-phi4

Text Generation • Updated Jun 21, 2025

vicgalle/configurable-preference-rocinante-12b

Text Generation • Updated Jun 21, 2025 • 5

vicgalle/configurable-preference-mistral-nemo-12b

Text Generation • Updated Jun 21, 2025

vicgalle/configurable-preference-qwen3-4b

Text Generation • Updated Jun 17, 2025

vicgalle/gliner-small-pii

Token Classification • 0.2B • Updated Feb 25, 2025 • 1.64k • 9

vicgalle/alpaca-7b

Text Generation • 7B • Updated Jan 29, 2025 • 963 • 4

vicgalle/Roleplay-Hermes-3-Llama-3.1-8B

Text Generation • 8B • Updated Aug 15, 2024 • 49 • • 13

vicgalle/Merge-Mixtral-Prometheus-8x7B

Text Generation • 47B • Updated Aug 13, 2024 • 40 • 2

datasets 9

vicgalle/rubric-feedback-bench

Viewer • Updated Mar 17 • 42 • 60 • 1

vicgalle/creative-rubrics

Viewer • Updated Jun 17, 2025 • 1.33k • 46 • 8

vicgalle/creative-rubrics-preferences

Viewer • Updated Jun 16, 2025 • 900 • 72 • 4

vicgalle/configurable-system-prompt-multitask

Viewer • Updated Apr 23, 2024 • 1.95k • 45 • 29

vicgalle/Synthetic-RP

Viewer • Updated Apr 21, 2024 • 8 • 36 • 5

vicgalle/worldsim-claude-opus

Viewer • Updated Mar 24, 2024 • 552 • 55 • 15

vicgalle/OpenHermesPreferences-roleplay

Viewer • Updated Feb 29, 2024 • 3.06k • 74 • 11

vicgalle/OpenHermesPreferences-1k

Viewer • Updated Feb 29, 2024 • 1.11k • 56 • 4

vicgalle/alpaca-gpt4

Viewer • Updated Feb 10, 2024 • 52k • 4.35k • 324