1 247 99

Richrich

RichardForests

AI & ML interests

None yet

Recent Activity

upvoted an article 24 days ago

Open-source DeepResearch – Freeing our search agents

upvoted a paper about 2 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

liked a model 2 months ago

thenlper/gte-base

View all activity

Organizations

RichardForests's activity

upvoted an article 24 days ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

upvoted a paper about 2 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 49

upvoted an article 3 months ago

Article

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 102

upvoted a collection 3 months ago

xLAM models

Collection

xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 11 items • Updated 24 days ago • 47

upvoted a paper 3 months ago

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published Dec 11, 2024 • 18

upvoted 3 articles 3 months ago

Article

Better RAG 3: The text is your friend

•

Mar 14, 2024

• 7

Article

Better RAG 2: Single-shot is not good enough

•

Mar 14, 2024

• 12

Article

Better RAG 1: Advanced Basics

•

Mar 14, 2024

• 24

upvoted a paper 4 months ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 52

upvoted 2 collections 4 months ago

MoE_Papers

Collection

4 items • Updated Dec 25, 2024 • 1

LLM

Collection

37 items • Updated 13 days ago • 1

upvoted 2 papers 4 months ago

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Paper • 2402.10210 • Published Feb 15, 2024 • 35

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Paper • 2402.05099 • Published Feb 7, 2024 • 20

upvoted an article 4 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 60

upvoted a paper 4 months ago

The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs

Paper • 2210.14986 • Published Oct 26, 2022 • 5

upvoted a paper 8 months ago

KAN or MLP: A Fairer Comparison

Paper • 2407.16674 • Published Jul 23, 2024 • 43

upvoted an article 9 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 126

upvoted 3 papers 9 months ago

Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models

Paper • 2406.13099 • Published Jun 18, 2024 • 4

ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning

Paper • 2406.14130 • Published Jun 20, 2024 • 10

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14, 2024 • 20