1 2 5

RemixaAxun

Remixa

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

xiangxinai/Xiangxin-Guardrails-Text

upvoted an article about 1 month ago

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

liked a model 2 months ago

multimodalart/reachy

View all activity

Organizations

None yet

liked a model 4 days ago

xiangxinai/Xiangxin-Guardrails-Text

Text Generation • 3B • Updated 24 days ago • 176 • 7

upvoted an article about 1 month ago

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

and 1 other •

Jul 1

• 121

liked a model 2 months ago

multimodalart/reachy

Text-to-Image • Updated Jul 10 • 70 • • 17

commented on SmolLM3: smol, multilingual, long-context reasoner 3 months ago

Thanks for sharing!!!
I might have spotted a minor mistake, should "To ensure full coverage of all domains in the non-thinking dataset" really be "To ensure full coverage of all domains in the thinking dataset"？

upvoted an article 3 months ago

Article

Why Maybe We're Measuring LLM Compression Wrong

•

Jun 21

• 11

liked a Space 7 months ago

3.23k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in BAAI/Infinity-Instruct about 1 year ago

About `system_prompt` setting when fine-tuning by this dataset

#22 opened about 1 year ago by

Remixa

liked a model almost 2 years ago

xiaol/RWKV-v4-world-7B-one-state-65k

Updated Oct 12, 2023 • 4

liked a model over 2 years ago

stabilityai/stable-diffusion-2-1

Text-to-Image • Updated Jul 5, 2023 • 816k • 4.03k

RemixaAxun

AI & ML interests

Recent Activity

Organizations

Remixa's activity

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Why Maybe We're Measuring LLM Compression Wrong

The Ultra-Scale Playbook

About `system_prompt` setting when fine-tuning by this dataset