4 52 20

sdtana

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

APOLLO: SGD-like Memory, AdamW-level Performance

upvoted a paper 10 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

updated a dataset 12 days ago

sdtana/aesthetic_anime_curated_8.5k

View all activity

Organizations

None yet

sdtana's activity

upvoted 2 papers 10 days ago

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published 14 days ago • 38

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 14 days ago • 111

updated a dataset 12 days ago

sdtana/aesthetic_anime_curated_8.5k

Preview • Updated 12 days ago • 59

upvoted a paper 14 days ago

Negative Token Merging: Image-based Adversarial Feature Guidance

Paper • 2412.01339 • Published 18 days ago • 21

New activity in sdtana/aesthetic_anime_curated_8.5k 19 days ago

delete readme

#1 opened 19 days ago by

sdtana

upvoted 3 papers 22 days ago

Adaptive Blind All-in-One Image Restoration

Paper • 2411.18412 • Published 23 days ago • 4

ROICtrl: Boosting Instance Control for Visual Generation

Paper • 2411.17949 • Published 23 days ago • 82

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis

Paper • 2411.17769 • Published 24 days ago • 7

upvoted a paper 25 days ago

Style-Friendly SNR Sampler for Style-Driven Generation

Paper • 2411.14793 • Published 28 days ago • 36

upvoted a paper about 1 month ago

SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Paper • 2411.05007 • Published Nov 7 • 16

liked a model 3 months ago

Laxhar/noobai-xl-EarlyAccess

Updated Nov 15 • 114

upvoted 3 papers 3 months ago

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Paper • 2409.11355 • Published Sep 17 • 28

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17 • 108

Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation

Paper • 2409.04410 • Published Sep 6 • 23

upvoted a paper 4 months ago

FLUX that Plays Music

Paper • 2409.00587 • Published Sep 1 • 31

liked a model 4 months ago

cledoux42/Ethnicity_Test_v003

Image Classification • Updated Apr 9, 2023 • 80.3k • 13

upvoted 2 papers 5 months ago

Efficient Training with Denoised Neural Weights

Paper • 2407.11966 • Published Jul 16 • 8

Vision language models are blind

Paper • 2407.06581 • Published Jul 9 • 82

upvoted 2 papers 6 months ago

Dataset Size Recovery from LoRA Weights

Paper • 2406.19395 • Published Jun 27 • 18

Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering

Paper • 2406.10208 • Published Jun 14 • 21