WangXM's picture

39 7

WangXM

MINGXiao12

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

GeoGalactica: A Scientific Large Language Model in Geoscience

upvoted a paper 18 days ago

Q-Refine: A Perceptual Quality Refiner for AI-Generated Image

upvoted a paper 18 days ago

SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity

View all activity

Organizations

None yet

MINGXiao12's activity

upvoted 20 papers 18 days ago

GeoGalactica: A Scientific Large Language Model in Geoscience

Paper • 2401.00434 • Published Dec 31, 2023 • 10

Q-Refine: A Perceptual Quality Refiner for AI-Generated Image

Paper • 2401.01117 • Published Jan 2, 2024 • 10

SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity

Paper • 2401.00604 • Published Dec 31, 2023 • 6

Unicron: Economizing Self-Healing LLM Training at Scale

Paper • 2401.00134 • Published Dec 30, 2023 • 11

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Paper • 2401.00849 • Published Jan 1, 2024 • 17

TrailBlazer: Trajectory Control for Diffusion-Based Video Generation

Paper • 2401.00896 • Published Dec 31, 2023 • 16

Boosting Large Language Model for Speech Synthesis: An Empirical Study

Paper • 2401.00246 • Published Dec 30, 2023 • 13

A Comprehensive Study of Knowledge Editing for Large Language Models

Paper • 2401.01286 • Published Jan 2, 2024 • 18

VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM

Paper • 2401.01256 • Published Jan 2, 2024 • 21

Are Vision-Language Models Truly Understanding Multi-vision Sensor?

Paper • 2412.20750 • Published Dec 30, 2024 • 20

HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving

Paper • 2412.20735 • Published Dec 30, 2024 • 11

VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control

Paper • 2412.20800 • Published Dec 30, 2024 • 10

Slow Perception: Let's Perceive Geometric Figures Step-by-step

Paper • 2412.20631 • Published Dec 30, 2024 • 14

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

Paper • 2412.21199 • Published Dec 30, 2024 • 13

Facilitating large language model Russian adaptation with Learned Embedding Propagation

Paper • 2412.21140 • Published Dec 30, 2024 • 16

PERSE: Personalized 3D Generative Avatars from A Single Portrait

Paper • 2412.21206 • Published Dec 30, 2024 • 17

Edicho: Consistent Image Editing in the Wild

Paper • 2412.21079 • Published Dec 30, 2024 • 22

Bringing Objects to Life: 4D generation from 3D objects

Paper • 2412.20422 • Published Dec 29, 2024 • 35

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 35

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 45