bloc97 (Bowen Peng)

upvoted a collection 10 months ago

Nemotron-UltraLong

Collection

3 items • Updated 8 days ago • 19

upvoted a paper about 1 year ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21, 2025 • 64

upvoted 2 papers over 1 year ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 56

Wavelets Are All You Need for Autoregressive Image Generation

Paper • 2406.19997 • Published Jun 28, 2024 • 31

upvoted a collection almost 2 years ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 888

upvoted 6 papers almost 2 years ago

Resonance RoPE: Improving Context Length Generalization of Large Language Models

Paper • 2403.00071 • Published Feb 29, 2024 • 24

Beyond Language Models: Byte Models are Digital World Simulators

Paper • 2402.19155 • Published Feb 29, 2024 • 53

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 116

upvoted a paper about 2 years ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 260

upvoted 3 papers over 2 years ago

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 39

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 89

Bowen Peng

AI & ML interests

Organizations