Little brother(s) of big DeepSeek-R1 ?

#124
by MrDevolver - opened

I know there are small distilled models and I highly appreciate each one of them, but I feel like they could have been much better if they were created from scratch just like the big R1 model. You know, something that would be completely your awesome creation! If I could run the big model locally on my PC, I would. Unfortunately, I can only use the smaller models up to 32B (highly quantized on the high end).

So, could the big model R1 get real little brother(s) - completely DeepSeek at core, please? That would be awesome! ❤

Sign up or log in to comment