Shikhar Singh's picture

47 376

Shikhar Singh

AxAI

·

axe--

AI & ML interests

Commonsense & Language Grounding

Recent Activity

reacted to merve's post with 🔥 about 3 hours ago

liked a model about 4 hours ago

HuggingFaceM4/Idefics3-8B-Llama3

liked a dataset 1 day ago

detection-datasets/coco

Organizations

None yet

AxAI's activity

upvoted a paper 10 days ago

Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

Paper • 2308.12966 • Published Aug 24, 2023 • 7

upvoted a collection 10 days ago

LLaVA-Critic

as a general evaluator for assessing model performance • 6 items • Updated Oct 6 • 8

upvoted an article about 2 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 169

upvoted a paper 2 months ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3 • 82

upvoted a collection 3 months ago

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated Sep 18 • 45

upvoted 3 articles 4 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 104

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18

• 67

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5

• 161

upvoted a collection 5 months ago

Gemma 2 Release

15 items • Updated Sep 9 • 197

upvoted an article 5 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 177

upvoted a collection 6 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Sep 18 • 347

upvoted a paper 6 months ago

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23 • 35

upvoted 2 articles 6 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 210

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

Sep 13, 2023

• 14

upvoted a collection 6 months ago

Yi-1.5 (2024/05)

10 items • Updated May 20 • 90

upvoted 5 articles 7 months ago

Article

Optimizing your LLM in production

Sep 15, 2023

• 15

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 166

Article

Accelerating Document AI

Nov 21, 2022

• 36

Article

A Dive into Pretraining Strategies for Vision-Language Models

Feb 3, 2023

• 48

Article

Design choices for Vision Language Models in 2024

By

•

Apr 16

• 25