Suresh Veeragoni's picture

Suresh Veeragoni

veeragoni

·

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago

microsoft/orca-agentinstruct-1M-v1

liked a model 5 days ago

unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF

liked a model 6 days ago

Nexusflow/Athene-V2-Agent

Organizations

veeragoni's activity

upvoted a collection 2 months ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 15 items • Updated 19 days ago • 76

upvoted a collection 3 months ago

⛈️ Llama-3.1 Storm Models

Fine-tuned Llama 3.1 8B model with superior reasoning, conversation abilities, and function calling! • 3 items • Updated Aug 25 • 15

upvoted 2 collections 4 months ago

Llama 3.1

12 items • Updated Jul 23 • 12

Tulu V2.5 Suite

A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! • 44 items • Updated 7 days ago • 14

upvoted 2 collections 5 months ago

4M Models

Multimodal models from https://4m.epfl.ch/ • 14 items • Updated Jun 14 • 29

Magpie-Pro Datasets (Llama-3)

Dataset built with Meta Llama 3 70B. Models are fine-tuned from Llama 3 8B. • 6 items • Updated Sep 20 • 16

upvoted a collection 6 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 137

upvoted a collection 10 months ago

Comparing DPO with IPO and KTO

A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 9 • 31

upvoted a paper 11 months ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

upvoted 2 collections 11 months ago

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 65

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 119