view article Article Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel Sep 17 • 6
OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning Paper • 2509.01644 • Published Sep 1 • 33
Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval Paper • 2509.09118 • Published Sep 11 • 8
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models +1 Jun 24, 2024 • 204
An Image is Worth 32 Tokens for Reconstruction and Generation Paper • 2406.07550 • Published Jun 11, 2024 • 60
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 654
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion Paper • 2310.08579 • Published Oct 12, 2023 • 17
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Paper • 2307.06949 • Published Jul 13, 2023 • 51