Al-Hussein

AlHussein

AI & ML interests

Knowledge Distillation, Self-Supervised Learning, Semi-Supervised Learning

Recent Activity

upvoted a paper 3 days ago

Video Depth without Video Models

upvoted a paper 3 days ago

Phi-4 Technical Report

upvoted a paper 20 days ago

PaliGemma: A versatile 3B VLM for transfer

View all activity

Organizations

None yet

AlHussein's activity

upvoted 2 papers 3 days ago

Video Depth without Video Models

Paper • 2411.19189 • Published 20 days ago • 32

Phi-4 Technical Report

Paper • 2412.08905 • Published 6 days ago • 82

upvoted 2 papers 20 days ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10 • 68

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28 • 76

upvoted a paper 26 days ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15 • 61

liked a model about 2 months ago

timm/resnet50.a1_in1k

Image Classification • Updated Feb 10 • 25.6M • 35

upvoted 2 papers 2 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17 • 72

Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations

Paper • 2410.02762 • Published Oct 3 • 9

upvoted a paper 3 months ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16 • 39

upvoted 3 papers 5 months ago

upvoted a paper 6 months ago

AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis

Paper • 2406.08920 • Published Jun 13 • 7

upvoted 5 papers 7 months ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30 • 108

SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation

Paper • 2404.14396 • Published Apr 22 • 18

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8 • 81

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 104

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3 • 100

upvoted 2 papers 8 months ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8 • 39

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23 • 70