6 127 48

rotem israeli

irotem98

https://rotem154154.github.io

rotem154154

AI & ML interests

None yet

Recent Activity

liked a model about 15 hours ago

Efficient-Large-Model/Sana_1600M_1024px

liked a model about 15 hours ago

Efficient-Large-Model/Sana_1600M_512px

liked a Space 7 days ago

timm/leaderboard

View all activity

Organizations

None yet

irotem98's activity

upvoted 6 papers 10 days ago

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Paper • 2411.08380 • Published 11 days ago • 24

SAMPart3D: Segment Any Part in 3D Objects

Paper • 2411.07184 • Published 13 days ago • 25

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published 12 days ago • 24

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published 13 days ago • 21

Scaling Properties of Diffusion Models for Perceptual Tasks

Paper • 2411.08034 • Published 12 days ago • 13

Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings

Paper • 2411.08017 • Published 12 days ago • 11

upvoted 2 papers 19 days ago

GPT or BERT: why not both?

Paper • 2410.24159 • Published 24 days ago • 13

Randomized Autoregressive Visual Generation

Paper • 2411.00776 • Published 23 days ago • 17

upvoted an article 22 days ago

Article

Trick or ResNet Treat

•

24 days ago

• 3

upvoted a paper 23 days ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published 27 days ago • 74

upvoted a collection 24 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated 3 days ago • 177

upvoted 2 papers 26 days ago

GPT-4o System Card

Paper • 2410.21276 • Published 30 days ago • 79

A Survey of Small Language Models

Paper • 2410.20011 • Published 30 days ago • 38

upvoted a collection 27 days ago

timm tiny test models

Collection

A collection of very small (~300-500k parameter) models at 160x160 resolution, for testing purposes. Trained on ImageNet-1k. • 13 items • Updated Oct 2 • 3

upvoted 6 papers about 1 month ago

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23 • 34

WorldSimBench: Towards Video Generation Models as World Simulators

Paper • 2410.18072 • Published Oct 23 • 17

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published Oct 22 • 43