23 147

Zhimeng Guo

zhimeng

https://zhimeng.page

AI & ML interests

Machine Learning

Recent Activity

liked a model 30 days ago

Qwen/Qwen3-235B-A22B

liked a dataset about 1 month ago

cais/mmlu

liked a model about 1 month ago

ai21labs/AI21-Jamba-Large-1.5

View all activity

Organizations

upvoted an article about 2 months ago

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Jan 5

•

upvoted an article 4 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

Sep 22, 2025

•

129

upvoted an article about 1 year ago

Article

Open R1: Update #2

Feb 10, 2025

•

218

upvoted a paper over 1 year ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 6

upvoted 8 papers almost 2 years ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 65

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

Paper • 2403.05438 • Published Mar 8, 2024 • 20

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Paper • 2401.11708 • Published Jan 22, 2024 • 30

upvoted 5 papers about 2 years ago

Aria Everyday Activities Dataset

Paper • 2402.13349 • Published Feb 20, 2024 • 31

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 81

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Paper • 2402.08017 • Published Feb 12, 2024 • 27

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

Paper • 2402.07456 • Published Feb 12, 2024 • 46

Model Editing with Canonical Examples

Paper • 2402.06155 • Published Feb 9, 2024 • 13

upvoted a collection about 2 years ago

OLMo Suite

Collection

Artifacts for the first set of OLMo models. • 18 items • Updated Dec 23, 2025 • 75

upvoted 2 papers about 2 years ago

Scavenging Hyena: Distilling Transformers into Long Convolution Models

Paper • 2401.17574 • Published Jan 31, 2024 • 17

Has Your Pretrained Model Improved? A Multi-head Posterior Based Approach

Paper • 2401.02987 • Published Jan 2, 2024 • 10

Zhimeng Guo

AI & ML interests

Recent Activity

Organizations

zhimeng's activity

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Gaia2 and ARE: Empowering the community to study agents

Open R1: Update #2