krishna praveen's picture

krishna praveen

krishnapraveen

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

physical-intelligence/fast

liked a model 4 days ago

hexgrad/Kokoro-82M

liked a Space 4 days ago

hexgrad/Kokoro-TTS

View all activity

Organizations

None yet

krishnapraveen's activity

upvoted a collection 10 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated 1 day ago • 233

upvoted a paper 19 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 24 days ago • 94

upvoted a collection about 2 months ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 70

upvoted a paper 2 months ago

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30, 2024 • 46

upvoted a collection 4 months ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 1 day ago • 51

upvoted a paper 4 months ago

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 85

upvoted a collection 5 months ago

CogVideo

10 items • Updated Nov 27, 2024 • 47

upvoted 3 papers 5 months ago

MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model

Paper • 2408.10198 • Published Aug 19, 2024 • 33

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Paper • 2408.01584 • Published Aug 2, 2024 • 8

MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization

Paper • 2408.02555 • Published Aug 5, 2024 • 29

upvoted a collection 6 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 640

upvoted a paper 6 months ago

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22, 2024 • 10

upvoted an article 6 months ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18, 2024

• 72

upvoted 3 collections 6 months ago

xLAM models

xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 11 items • Updated 29 days ago • 45

LLaVa-Interleave

LLaVa models that extends the model capabilities to Multi-image, Multi-frame (videos), Multi-patch (single-image) scenarios. • 3 items • Updated Jul 10, 2024 • 14

Navarasa 2.0 Models

Collection of models Navarasa 2.0 Models finetuned with Gemma on 15 Indian languages • 5 items • Updated Mar 18, 2024 • 18

upvoted a paper 6 months ago

Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

Paper • 2407.11398 • Published Jul 16, 2024 • 9

upvoted an article 6 months ago

Article

Faster fine-tuning using TRL & Unsloth

Jan 10, 2024

• 44

upvoted a collection 6 months ago

Optimizing diffusion models

Provides a list of papers focusing on optimizing T2I diffusion models, targeting fewer timesteps, architecture optimization, and more. • 21 items • Updated Aug 22, 2024 • 19

upvoted a collection 7 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 225