1 37 6

Irina Abdullaeva

IrinaAbdullaeva

IrinaArmstrong

AI & ML interests

NLP, DL, Multi-modality

Recent Activity

upvoted a paper 4 days ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

upvoted a paper 25 days ago

CLEAR: Character Unlearning in Textual and Visual Modalities

upvoted a paper 25 days ago

LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior

View all activity

Organizations

IrinaAbdullaeva's activity

upvoted a paper 4 days ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 103

upvoted 4 papers 25 days ago

upvoted a paper about 2 months ago

Visual Context Window Extension: A New Perspective for Long Video Understanding

Paper • 2409.20018 • Published Sep 30 • 9

upvoted 3 papers 2 months ago

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19 • 24

LLMs + Persona-Plug = Personalized LLMs

Paper • 2409.11901 • Published Sep 18 • 30

On the limits of agency in agent-based models

Paper • 2409.10568 • Published Sep 14 • 12

updated a model 2 months ago

IrinaAbdullaeva/VideoLLava-demo

Updated Sep 15

upvoted a paper 3 months ago

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2 • 94

liked a Space 4 months ago

Running on CPU Upgrade

817

🚀

Model Memory Utility

upvoted 2 papers 4 months ago

CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis

Paper • 2407.13301 • Published Jul 18 • 54

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding

Paper • 2407.15754 • Published Jul 22 • 19

upvoted 4 papers 5 months ago

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5 • 31

Unveiling Encoder-Free Vision-Language Models

Paper • 2406.11832 • Published Jun 17 • 49

AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

Paper • 2407.04363 • Published Jul 5 • 26

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 92

updated 2 Spaces 5 months ago

Running

🥇

MindShift

Running

🥇