8 14 7

Chenyang Song

Raincleared

AI & ML interests

None yet

Recent Activity

New activity 23 days ago

openbmb/MiniCPM-S-1B-sft:Adding `safetensors` variant of this model

authored a paper 23 days ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

updated a model 24 days ago

SparseLLM/sparsing-law-0.1b-relu

View all activity

Organizations

Raincleared's activity

New activity in openbmb/MiniCPM-S-1B-sft 23 days ago

Adding `safetensors` variant of this model

#1 opened 23 days ago by

SFconvertbot

authored a paper 23 days ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published 24 days ago • 11

updated a model 24 days ago

SparseLLM/sparsing-law-0.1b-relu

Text Generation • Updated 24 days ago • 35 • 1

upvoted a paper 24 days ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published 24 days ago • 11

commented a paper 24 days ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published 24 days ago • 11 •

New activity in SparseLLM/prosparse-llama-2-7b about 1 month ago

why does this model use FP32??

#6 opened about 1 month ago by

purejomo

upvoted a paper 3 months ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4 • 27

authored a paper 3 months ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4 • 27

liked a model 4 months ago

mistralai/Mistral-Large-Instruct-2407

Updated Oct 16 • 16.1k • 809

updated a model 5 months ago

openbmb/MiniCPM-S-1B-sft-gguf

Updated Jul 4 • 31 • 6

updated a collection 5 months ago

MiniCPM

Collection

The MiniCPM family of LLMs and VLLMs. • 31 items • Updated Oct 22 • 54

liked a model 5 months ago

openbmb/MiniCPM-S-1B-sft-gguf

Updated Jul 4 • 31 • 6

upvoted a paper 5 months ago

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

Paper • 2406.15718 • Published Jun 22 • 14

updated a collection 6 months ago

MiniCPM

Collection

The MiniCPM family of LLMs and VLLMs. • 31 items • Updated Oct 22 • 54

liked a model 6 months ago

openbmb/MiniCPM-S-1B-sft-llama-format

Text Generation • Updated Sep 7 • 13 • 4

upvoted 2 papers 6 months ago

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10 • 22

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10 • 36

updated a collection 6 months ago

MiniCPM

Collection

The MiniCPM family of LLMs and VLLMs. • 31 items • Updated Oct 22 • 54

updated a model 6 months ago

SparseLLM/ProSparse-MiniCPM-1B-sft

Text Generation • Updated Jun 3 • 12 • 2