126 78 1970

Nicky

NickyNicky

AI & ML interests

None yet

Recent Activity

liked a model about 3 hours ago

stepfun-ai/Step-Audio-Chat

reacted to Jaward's post with 🔥 about 5 hours ago

Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram. Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb

liked a model 2 days ago

NousResearch/DeepHermes-3-Llama-3-8B-Preview-GGUF

View all activity

Organizations

NickyNicky's activity

liked a model about 3 hours ago

stepfun-ai/Step-Audio-Chat

Audio-Text-to-Text • Updated about 4 hours ago • 2 • 91

reacted to Jaward's post with 🔥 about 5 hours ago

Post

957

Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram.
Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb

liked a model 2 days ago

NousResearch/DeepHermes-3-Llama-3-8B-Preview-GGUF

Updated about 23 hours ago • 9.09k • 56

liked a model 3 days ago

agentica-org/DeepScaleR-1.5B-Preview

Updated 7 days ago • 9.68k • 402

updated a dataset 4 days ago

NickyNicky/reasoning-orca-agentinstruct-13k-len_token-llama-percentil-20

Viewer • Updated 4 days ago • 12.6k

published a dataset 4 days ago

NickyNicky/reasoning-orca-agentinstruct-13k-len_token-llama-percentil-20

Viewer • Updated 4 days ago • 12.6k

updated a dataset 4 days ago

NickyNicky/reasoning-orca-agentinstruct-62k-len_token-llama

Viewer • Updated 4 days ago • 62.9k

published a dataset 4 days ago

NickyNicky/reasoning-orca-agentinstruct-62k-len_token-llama

Viewer • Updated 4 days ago • 62.9k

updated a dataset 4 days ago

NickyNicky/reasoning-orca-agentinstruct-62k

Viewer • Updated 4 days ago • 62.9k

published a dataset 4 days ago

NickyNicky/reasoning-orca-agentinstruct-62k

Viewer • Updated 4 days ago • 62.9k

liked 4 models 4 days ago

liked a model 5 days ago

allenai/Llama-3.1-Tulu-3.1-8B

Text Generation • Updated 7 days ago • 400 • 20

liked a dataset 5 days ago

sequelbox/Raiden-DeepSeek-R1

Viewer • Updated 7 days ago • 62.9k • 60 • 25

reacted to sequelbox's post with 🔥 5 days ago

Post

2590

Raiden is here! 63k creative-reasoning and analytic-reasoning prompts answered by DeepSeek's 685b R1 model!

- All prompts from microsoft/orca-agentinstruct-1M-v1 and all responses from deepseek-ai/DeepSeek-R1
- A deep look at R1's reasoning skills! Use as you will.

Get it now: sequelbox/Raiden-DeepSeek-R1

for everyone :)

liked 3 models 5 days ago

open-thoughts/OpenThinker-32B

Text Generation • Updated 4 days ago • 717 • 102

Nicky

AI & ML interests

Recent Activity

Organizations

NickyNicky's activity

stepfun-ai/Step-Audio-Chat

NousResearch/DeepHermes-3-Llama-3-8B-Preview-GGUF

agentica-org/DeepScaleR-1.5B-Preview

NickyNicky/reasoning-orca-agentinstruct-13k-len_token-llama-percentil-20

NickyNicky/reasoning-orca-agentinstruct-13k-len_token-llama-percentil-20

NickyNicky/reasoning-orca-agentinstruct-62k-len_token-llama

NickyNicky/reasoning-orca-agentinstruct-62k-len_token-llama

NickyNicky/reasoning-orca-agentinstruct-62k

NickyNicky/reasoning-orca-agentinstruct-62k

NousResearch/DeepHermes-3-Llama-3-8B-Preview

teknium/Llama-3.1-AlternateTokenizer

answerdotai/ModernBERT-Large-Instruct

HKUSTAudio/Llasa-1B-two-speakers-kore-puck

allenai/Llama-3.1-Tulu-3.1-8B

sequelbox/Raiden-DeepSeek-R1

open-thoughts/OpenThinker-32B

nomic-ai/nomic-embed-text-v2-moe-unsupervised

nomic-ai/nomic-embed-text-v2-moe