Shahrukh Khan's picture

Shahrukh Khan

shahrukhx01

·

https://github.com/shahrukhx01

AI & ML interests

NLP

Recent Activity

liked a model 7 days ago

pipecat-ai/smart-turn-v3

upvoted a collection 8 days ago

liked a model 8 days ago

facebook/MobileLLM-R1-950M

View all activity

Organizations

upvoted a collection 8 days ago

MobileLLM-R1

MobileLLM-R1, a series of sub-billion parameter reasoning models • 6 items • Updated 7 days ago • 18

upvoted a collection about 1 month ago

Gemma 3 Release

28 items • Updated Aug 11 • 505

upvoted a collection 2 months ago

Encoders vs Decoders: the Ettin Suite

A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250 • 32 items • Updated Jul 16 • 19

upvoted a collection 3 months ago

GLiNER-X

The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type. • 6 items • Updated Jun 24 • 21

upvoted an article 3 months ago

Article

Transformers backend integration in SGLang

By

and 4 others •

Jun 23

• 53

upvoted 3 collections 4 months ago

Qwen3-Reranker

3 items • Updated Jul 21 • 63

Qwen3-Embedding

6 items • Updated Jul 21 • 128

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 38 items • Updated 8 days ago • 54

upvoted an article 4 months ago

Article

🥬 LettuceDetect Goes Multilingual: Fine-tuning EuroBERT on Synthetic Translations

By

and 1 other •

May 19

• 9

upvoted an article 5 months ago

Article

You could have designed state of the art positional encoding

By

•

Nov 25, 2024

• 366

upvoted 2 collections 5 months ago

Deepseek Papers

Deepseek papers collection • 24 items • Updated about 8 hours ago • 273

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 209

upvoted 8 collections 6 months ago

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated Apr 10 • 100

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Jul 1 • 76

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 624

Nomic Embed Multimodal

Multimodal models allowing you to search over interleaved text, PDFs, charts, and images! • 16 items • Updated Jun 3 • 24

Orpheus TTS

TTS Towards Human-Sounding Speech • 2 items • Updated Mar 18 • 70

Zonos-v0.1

3 items • Updated Feb 12 • 29

Ultravox v0.5

Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone. • 4 items • Updated 11 days ago • 19

reranking series v2

V2 crispy rerank series • 3 items • Updated Jun 25 • 24