Evan's picture

Evan

evdcush

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

liked a model 5 days ago

liked a dataset 5 days ago

microsoft/orca-agentinstruct-1M-v1

Organizations

evdcush's activity

upvoted a paper about 15 hours ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published 2 days ago • 36

upvoted 2 collections 5 days ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Oct 15 • 139

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 3 days ago • 223

upvoted a collection 6 days ago

Common Corpus

Largest multilingual pretraining data. • 1 item • Updated 8 days ago • 7

upvoted a paper 9 days ago

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Paper • 2411.07232 • Published 10 days ago • 60

upvoted a collection 10 days ago

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 7 items • Updated 2 days ago • 35

upvoted a paper 10 days ago

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published 14 days ago • 34

upvoted a collection 11 days ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 9 items • Updated 4 days ago • 70

upvoted a paper 13 days ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

upvoted a collection about 2 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 28 days ago • 482

upvoted a collection 2 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 218

upvoted 3 papers 2 months ago

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

Paper • 2409.08239 • Published Sep 12 • 16

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

Paper • 2409.04109 • Published Sep 6 • 43

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12 • 66

upvoted 6 papers 3 months ago

Diffusion Policy Policy Optimization

Paper • 2409.00588 • Published Sep 1 • 19

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Paper • 2409.02095 • Published Sep 3 • 35

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3 • 77

LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models

Paper • 2409.00509 • Published Aug 31 • 38

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28 • 83

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28 • 42