Shikhar Singh

AxAI

AI & ML interests

Commonsense & Language Grounding

Recent Activity

reacted to merve's post with πŸ”₯ about 3 hours ago
liked a model about 4 hours ago
HuggingFaceM4/Idefics3-8B-Llama3
liked a dataset 1 day ago
detection-datasets/coco

Organizations

None yet

AxAI's activity

upvoted an article about 2 months ago
view article
Article

Llama can now see and run on your device - welcome Llama 3.2

β€’ 169
upvoted 3 articles 4 months ago
view article
Article

How NuminaMath Won the 1st AIMO Progress Prize

β€’ 104
view article
Article

Docmatix - a huge dataset for Document Visual Question Answering

β€’ 67
view article
Article

ColPali: Efficient Document Retrieval with Vision Language Models πŸ‘€

By manu β€’
β€’ 161
upvoted an article 5 months ago
view article
Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

β€’ 177
upvoted 2 articles 6 months ago
view article
Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

β€’ 210
view article
Article

Fine-tuning Llama 2 70B using PyTorch FSDP

β€’ 14
upvoted 5 articles 7 months ago
view article
Article

Optimizing your LLM in production

β€’ 15
view article
Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

β€’ 166
view article
Article

Accelerating Document AI

β€’ 36
view article
Article

A Dive into Pretraining Strategies for Vision-Language Models

β€’ 48
view article
Article

Design choices for Vision Language Models in 2024

By gigant β€’
β€’ 25