Dhruvajyoti Sarma

dhruva-sarma

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

updated a collection about 1 month ago

Computer vision

updated a collection about 1 month ago

Computer vision

Organizations

None yet

dhruva-sarma's activity

upvoted an article about 1 month ago

Article

How to build a custom text classifier without days of human labeling

•

Oct 17

• 55

upvoted a paper about 1 month ago

A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond

Paper • 2410.02362 • Published Oct 3 • 16

upvoted a paper about 2 months ago

Not All LLM Reasoners Are Created Equal

Paper • 2410.01748 • Published Oct 2 • 27

upvoted 3 papers 2 months ago

upvoted a collection 2 months ago

Parler-TTS: fully open-source high-quality TTS

Collection

If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. • 7 items • Updated Aug 8 • 46

upvoted a paper 2 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4 • 72

upvoted 3 papers 3 months ago

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published Aug 27 • 13

Text2SQL is Not Enough: Unifying AI and Databases with TAG

Paper • 2408.14717 • Published Aug 27 • 24

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 118

upvoted 3 papers 4 months ago

Scaling Retrieval-Based Language Models with a Trillion-Token Datastore

Paper • 2407.12854 • Published Jul 9 • 29

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18 • 52

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 128

upvoted 2 articles 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 265

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 104

upvoted 4 papers 5 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28 • 95

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26 • 12

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26 • 47

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24 • 67