Prince Canuma's picture

Prince Canuma

prince-canuma

·

AI & ML interests

None yet

Recent Activity

updated a model 9 minutes ago

mlx-community/Mistral-Small-24B-Instruct-2501-4bit

published a model 32 minutes ago

mlx-community/Mistral-Small-24B-Instruct-2501-8bit

published a model 32 minutes ago

mlx-community/Mistral-Small-24B-Instruct-2501-6bit

View all activity

Organizations

prince-canuma's activity

upvoted a paper 6 days ago

Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B

Paper • 2406.07394 • Published Jun 11, 2024 • 27

upvoted a paper 8 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 9 days ago • 61

upvoted 2 papers 10 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 14 days ago • 100

ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Paper • 2501.10132 • Published 13 days ago • 16

upvoted 7 papers 13 days ago

Multimodal LLMs Can Reason about Aesthetics in Zero-Shot

Paper • 2501.09012 • Published 15 days ago • 10

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Paper • 2501.08828 • Published 15 days ago • 30

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 14 days ago • 36

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published 14 days ago • 33

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published 14 days ago • 23

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published 14 days ago • 47

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 14 days ago • 66

upvoted 6 papers 15 days ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 19 days ago • 29

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 22 days ago • 53

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 19 days ago • 79

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 17 days ago • 89

A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following

Paper • 2501.08187 • Published 16 days ago • 24

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 16 days ago • 271

upvoted a paper 16 days ago

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published 20 days ago • 42

upvoted 2 papers 17 days ago

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published 23 days ago • 53

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published 20 days ago • 59