Fizz 🏳️‍⚧️'s picture

Fizz 🏳️‍⚧️ PRO

Fizzarolli

·

https://discord.gg/PPBMhF2vgC

AI & ML interests

None yet

Recent Activity

liked a model about 2 hours ago

Anzhc/MS-LC-EQ-D-VR_VAE

liked a model 1 day ago

Mawdistical/Kuwutu-7B

liked a model 2 days ago

bartowski/Aurore-Reveil_Koto-Small-7B-IT-GGUF

View all activity

Organizations

Posts 2

Post

2695

hi everyone!

i wanted to share an experiment i did with upcycling phi-3 mini into an moe recently.
while benchmarks are definitely within a margin of error and they performed similarly, i think it's an interesting base to try and see if you can improve phi's performance! (maybe looking into HuggingFaceFW/fineweb-edu could be interesting, i also left some other notes if anyone with more compute access wants to try it themselves)

check it out! Fizzarolli/phi3-4x4b-v1

Post

3116

Is anyone looking into some sort of decentralized/federated dataset generation or classification by humans instead of synthetically?

From my experience with trying models, a *lot* of modern finetunes are trained on what amounts to, in essence, GPT-4 generated slop that makes everything sound like a rip-off GPT-4 (refer to i.e. the Dolphin finetunes). I have a feeling that this is a lot of the reason people haven't been quite as successful as Meta's instruct tunes of Llama 3.

spaces 4

Dia 1.6B

Generate audio from text input with optional audio prompt

Paligemma2 3b E621 Tagger

Shuttle 3 Diffusion

Generate detailed images from text prompts

Molmo 7B O 0924

Interact with images using text questions

models 58

Fizzarolli/Koto-Small-7B-IT_exl3-8bpw-h8

Text Generation • Updated 4 days ago • 11 • 1

Fizzarolli/koto-nano-Q8_0-GGUF

1B • Updated 15 days ago • 17

Fizzarolli/AFM-Pretrain-Koto-Final-ckpt-Q8_0-GGUF

5B • Updated 20 days ago • 132

Fizzarolli/AFM-Koto-Q8_0-GGUF

5B • Updated 20 days ago • 162

Fizzarolli/iloveugfhrte4ughrtfughruetghurtgh-Q4_K_M-GGUF

22B • Updated 26 days ago • 149

Fizzarolli/Heater-2025-yourpolitclsiisagangsin-Q5_K_M-GGUF

22B • Updated 27 days ago • 147

Fizzarolli/lfm2-sft-ckpt-408-Q8_0-GGUF

1B • Updated Jul 15 • 15

Fizzarolli/Mistral-Small-3.2-24B-Instruct-2506-Text-Only

24B • Updated Jun 24 • 11

Fizzarolli/l3-8b-kto-ckpt144

Updated Jun 8 • 3

Fizzarolli/l3-8b-kto-ckpt125

Updated Jun 8 • 3

datasets 28

Fizzarolli/human-stories-rewardify

Viewer • Updated Jan 23 • 2.88k • 5 • 2

Fizzarolli/fse-reward-modelling

Viewer • Updated Jan 23 • 100 • 8 • 1

Fizzarolli/FallingThroughTheSkies-592k-Filtered-Filtered

Viewer • Updated Oct 8, 2024 • 139k • 2 • 8

Fizzarolli/filtered-wit-recaptioned

Viewer • Updated Sep 2, 2024 • 10 • 2

Fizzarolli/goofed_up_logs

Viewer • Updated Aug 28, 2024 • 82.9k

Fizzarolli/stheno-filtered-v1.1-filtereded

Viewer • Updated Aug 26, 2024 • 8.81k • 4 • 1

Fizzarolli/goofed_up_logs_orig

Viewer • Updated Aug 26, 2024 • 13.5k

Fizzarolli/hh-rlhf-helpful-only

Viewer • Updated Aug 10, 2024 • 118k • 1

Fizzarolli/hh-rlhf-h4-test-revised

Viewer • Updated Aug 8, 2024 • 10 • 4 • 1

Fizzarolli/dclm-baseline-1.0-2.5k

Viewer • Updated Aug 8, 2024 • 2.5k • 9

View 28 datasets