Josiah Aklilu's picture

3

Josiah Aklilu

josaklil-ai

·

https://josaklil-ai.github.io/

AI & ML interests

computer vision & language for enhancing surgical practice

Recent Activity

upvoted a paper 5 days ago

Temporal Preference Optimization for Long-Form Video Understanding

authored a paper 13 days ago

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

authored a paper 13 days ago

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

View all activity

Organizations

None yet

josaklil-ai's activity

upvoted a paper 5 days ago

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published 7 days ago • 21

authored 2 papers 13 days ago

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published 17 days ago • 49

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

Paper • 2501.03225 • Published 24 days ago • 7

upvoted a paper 15 days ago

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published 17 days ago • 49

upvoted a paper about 2 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 139

authored a paper 7 months ago

Revisiting Active Learning in the Era of Vision Foundation Models

Paper • 2401.14555 • Published Jan 25, 2024