Anthonny Olime's picture

18 135

Anthonny Olime

Aviv-anthonnyolime

·

AI & ML interests

None yet

Recent Activity

liked a model about 19 hours ago

google/Gemma-Embeddings-v1.0

liked a model about 19 hours ago

tiiuae/Falcon3-1B-Instruct-GGUF

liked a model about 19 hours ago

tiiuae/Falcon3-1B-Instruct

View all activity

Organizations

Aviv-anthonnyolime's activity

upvoted a paper 2 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 4 days ago • 118

upvoted a paper 5 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 6 days ago • 82

upvoted 2 papers 8 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 9 days ago • 62

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 8 days ago • 54

upvoted a collection 21 days ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 15 items • Updated 5 days ago • 52

upvoted a collection 27 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 16 days ago • 191

upvoted a collection 30 days ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 224

upvoted a collection about 1 month ago

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. • 4 items • Updated Oct 31 • 17

upvoted a collection about 2 months ago

Stable Diffusion 3.5

6 items • Updated Oct 29 • 109

upvoted 2 papers about 2 months ago

Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

Paper • 2410.15316 • Published Oct 20 • 10

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

Paper • 2410.11190 • Published Oct 15 • 20

upvoted 2 papers 2 months ago

CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities of CodeLLMs

Paper • 2410.01999 • Published Oct 2 • 10

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 167

upvoted a collection 2 months ago

SAM 2.1

Collection of SAM 2.1 model checkpoints • 8 items • Updated Oct 6 • 8

upvoted 2 papers 4 months ago

Self-Taught Evaluators

Paper • 2408.02666 • Published Aug 5 • 26

Language Model Can Listen While Speaking

Paper • 2408.02622 • Published Aug 5 • 37

upvoted 2 articles 5 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 224

Article

How I train a LoRA: m3lt style training overview

By

•

Jul 1

• 47