Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Hasan Arif's picture

3 19 6

Hasan Arif

hasanar1f

21world's profile picture

·

hasanar1f
kazi-hasan-ibn-arif-8b78a61a3

AI & ML interests

Efficient training and inference

Organizations

hasanar1f 's collections 4

Multi Agent based Medical Assistant for Edge Devices

Paper • 2503.05397 • Published Mar 7 • 7

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published Jan 10 • 66
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers

Paper • 2501.02393 • Published Jan 4 • 8
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published Jan 3 • 34
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published Dec 19, 2024 • 16

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published Jan 23 • 23
M3: 3D-Spatial MultiModal Memory

Paper • 2503.16413 • Published Mar 20 • 15
Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25 • 73

ML Optimization Papers

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16 • 25
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 89
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

Paper • 2501.06842 • Published Jan 12 • 16
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 53

Multi Agent based Medical Assistant for Edge Devices

Paper • 2503.05397 • Published Mar 7 • 7

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published Jan 23 • 23
M3: 3D-Spatial MultiModal Memory

Paper • 2503.16413 • Published Mar 20 • 15
Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25 • 73

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published Jan 10 • 66
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers

Paper • 2501.02393 • Published Jan 4 • 8
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published Jan 3 • 34
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published Dec 19, 2024 • 16

ML Optimization Papers

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16 • 25
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 89
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

Paper • 2501.06842 • Published Jan 12 • 16
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 53

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs