Art Atk's picture

286 40

Art Atk

ArtAtk

·

AI & ML interests

Multimodal Models

Recent Activity

liked a Space about 18 hours ago

ByteDance/SeedEdit-APP

liked a Space about 20 hours ago

Lightricks/LTX-Video-Playground

upvoted a paper 1 day ago

Stable Flow: Vital Layers for Training-Free Image Editing

View all activity

Organizations

None yet

ArtAtk's activity

upvoted a paper 1 day ago

Stable Flow: Vital Layers for Training-Free Image Editing

Paper • 2411.14430 • Published 3 days ago • 10

upvoted 2 papers 4 days ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published 8 days ago • 43

Continuous Speculative Decoding for Autoregressive Image Generation

Paper • 2411.11925 • Published 7 days ago • 13

upvoted a paper 18 days ago

Controlling Language and Diffusion Models by Transporting Activations

Paper • 2410.23054 • Published 25 days ago • 16

upvoted a paper 27 days ago

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published Oct 23 • 49

upvoted 3 papers about 1 month ago

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Paper • 2410.13276 • Published Oct 17 • 25

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Paper • 2410.05603 • Published Oct 8 • 11

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Paper • 2410.10306 • Published Oct 14 • 52

upvoted 7 papers about 2 months ago

Pyramidal Flow Matching for Efficient Video Generative Modeling

Paper • 2410.05954 • Published Oct 8 • 37

IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

Paper • 2410.07171 • Published Oct 9 • 41

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

Distilling an End-to-End Voice Assistant Without Instruction Training Data

Paper • 2410.02678 • Published Oct 3 • 22

Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models

Paper • 2410.02416 • Published Oct 3 • 25

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Paper • 2410.02757 • Published Oct 3 • 36

Pixel-Space Post-Training of Latent Diffusion Models

Paper • 2409.17565 • Published Sep 26 • 19

upvoted 5 papers 2 months ago

DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control

Paper • 2409.12192 • Published Sep 18 • 4

V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians

Paper • 2409.13648 • Published Sep 20 • 9

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published Sep 20 • 67

FlexiTex: Enhancing Texture Generation with Visual Guidance

Paper • 2409.12431 • Published Sep 19 • 11

Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

Paper • 2409.12903 • Published Sep 19 • 21