蓋瑞王's picture

蓋瑞王

gary109

·

AI & ML interests

GAN,Music

Recent Activity

liked a Space 7 days ago

AI4Editing/MagicQuill

liked a model 15 days ago

microsoft/LLM2CLIP-Openai-L-14-336

upvoted a collection 15 days ago

View all activity

Organizations

None yet

gary109's activity

upvoted a collection 15 days ago

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 7 items • Updated 7 days ago • 37

upvoted a paper 16 days ago

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published 18 days ago • 14

upvoted 3 papers 2 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

Seeing Faces in Things: A Model and Dataset for Pareidolia

Paper • 2409.16143 • Published Sep 24 • 15

RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning

Paper • 2409.14674 • Published Sep 23 • 41

upvoted 15 papers 3 months ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29 • 47

Scaling Up Diffusion and Flow-based XGBoost Models

Paper • 2408.16046 • Published Aug 28 • 9

Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17 • 21

TurboEdit: Instant text-based image editing

Paper • 2408.08332 • Published Aug 14 • 19

Can Large Language Models Understand Symbolic Graphics Programs?

Paper • 2408.08313 • Published Aug 15 • 7

D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning

Paper • 2408.08441 • Published Aug 15 • 7

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Paper • 2408.08459 • Published Aug 15 • 44

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15 • 38

PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation

Paper • 2408.07547 • Published Aug 14 • 7

Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space

Paper • 2408.07416 • Published Aug 14 • 6

Aquila2 Technical Report

Paper • 2408.07410 • Published Aug 14 • 13

3D Gaussian Editing with A Single Image

Paper • 2408.07540 • Published Aug 14 • 10

InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning

Paper • 2408.07089 • Published Aug 9 • 13

Generative Photomontage

Paper • 2408.07116 • Published Aug 13 • 19

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Paper • 2408.06019 • Published Aug 12 • 13