DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding Paper • 2411.14347 • Published 3 days ago • 3
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation Paper • 2411.14384 • Published 3 days ago • 3
MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control Paper • 2411.13807 • Published 4 days ago • 7
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training Paper • 2411.13476 • Published 4 days ago • 12
ORID: Organ-Regional Information Driven Framework for Radiology Report Generation Paper • 2411.13025 • Published 4 days ago • 2
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory Paper • 2411.11922 • Published 6 days ago • 13
Stylecodes: Encoding Stylistic Information For Image Generation Paper • 2411.12811 • Published 5 days ago • 7
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations Paper • 2411.10818 • Published 8 days ago • 19
Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages Paper • 2411.12240 • Published 5 days ago • 5
Building Trust: Foundations of Security, Safety and Transparency in AI Paper • 2411.12275 • Published 5 days ago • 10
SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning Paper • 2411.10161 • Published 9 days ago • 6
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements Paper • 2411.12044 • Published 6 days ago • 13
Continuous Speculative Decoding for Autoregressive Image Generation Paper • 2411.11925 • Published 6 days ago • 13
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 5 days ago • 42
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing Paper • 2411.11045 • Published 7 days ago • 9
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers Paper • 2411.10510 • Published 9 days ago • 8
AnimateAnything: Consistent and Controllable Animation for Video Generation Paper • 2411.10836 • Published 8 days ago • 18