8 7 273

Dan Ofer

GrimSqueaker

https://scholar.google.co.il/citations?user=uDx2ItYAAAAJ&hl=en

AI & ML interests

Bioinformatics, Neurobiology, AutoML, Feature engineering, Proteins, NLP

Recent Activity

liked a model 2 days ago

mlx-community/DeepSeek-R1-Distill-Qwen-32B-6bit

liked a model 2 days ago

mlx-community/Unsloth-DeepSeek-R1-Distill-Qwen-32B-4bit

liked a model 7 days ago

unsloth/DeepSeek-R1-Distill-Llama-70B-GGUF

View all activity

Organizations

None yet

GrimSqueaker's activity

liked 2 models 2 days ago

mlx-community/DeepSeek-R1-Distill-Qwen-32B-6bit

Updated 9 days ago • 8.92k • 1

mlx-community/Unsloth-DeepSeek-R1-Distill-Qwen-32B-4bit

Text Generation • Updated 9 days ago • 123 • 1

liked 2 models 7 days ago

unsloth/DeepSeek-R1-Distill-Llama-70B-GGUF

Updated about 16 hours ago • 51.9k • 35

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

Updated 5 days ago • 73.3k • 66

liked 2 models 9 days ago

Qwen/Qwen2.5-32B

Text Generation • Updated Sep 20, 2024 • 34.4k • 79

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 3 days ago • 165k • 707

commented on Train 400x faster Static Embedding Models with Sentence Transformers 13 days ago

I'd just start with modernBert large though, easier and strong base. Less faffing about. Also big vocab <3

commented on Train 400x faster Static Embedding Models with Sentence Transformers 13 days ago

They do PCA (prior to the zipf weighting) and explicitly state that they found that it improved perf.

commented on Train 400x faster Static Embedding Models with Sentence Transformers 13 days ago

Did you try potion/m2v as a starting point? (nvm modernbert, and it's much larger vocab)?

commented on Train 400x faster Static Embedding Models with Sentence Transformers 14 days ago

This is really cool! I'm surprised you do better than model2vec - is the difference really just the use of a (better) contrastive loss pretraining formula?

liked 2 models 14 days ago

minishlab/potion-base-8M

Updated 8 days ago • 58.4k • 40

jxm/cde-small-v2

Feature Extraction • Updated 13 days ago • 4.44k • 75

liked a model 18 days ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 592k • 1.79k

liked a model 20 days ago

mlx-community/phi-4-6bit

Text Generation • Updated 17 days ago • 253 • 4

liked 5 models 21 days ago

liked a model 22 days ago

HuggingFaceTB/SmolLM2-135M-Instruct-Q8-mlx

Text Generation • Updated Nov 27, 2024 • 117 • 2