Joao Pedro Silva Dias Moura Mesquita's picture

Joao Pedro Silva Dias Moura Mesquita

inkasaras

·

joaopedrosdmm

AI & ML interests

None yet

Recent Activity

liked a model about 20 hours ago

deepseek-ai/Janus-1.3B

upvoted an article about 20 hours ago

Janus Pro: DeepSeek's Revolutionary Multimodal AI Model

liked a model 3 days ago

albert/albert-base-v2

View all activity

Organizations

None yet

inkasaras's activity

upvoted an article about 20 hours ago

Article

Janus Pro: DeepSeek's Revolutionary Multimodal AI Model

By

•

3 days ago

• 27

upvoted a collection 7 days ago

Albertina

Albertina family of encoders for Portuguese • 9 items • Updated Jul 26, 2024 • 2

upvoted an article 7 days ago

Article

Mastering Long Contexts in LLMs with KVPress

By

•

7 days ago

• 58

upvoted a collection 23 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated 13 days ago • 252

upvoted 3 collections about 1 month ago

🍃 MINT-1T

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 58

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated 29 days ago • 42

DeepSeek-VL2

4 items • Updated Dec 18, 2024 • 40

upvoted a collection about 2 months ago

GTE models

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated 9 days ago • 20

upvoted 2 collections 2 months ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 24 days ago • 55

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 24 days ago • 77

upvoted a collection 3 months ago

Sparsh

Models and datasets for Sparsh: Self-supervised touch representations for vision-based tactile sensing • 15 items • Updated Oct 24, 2024 • 12

upvoted a collection 4 months ago

Core ML Segment Anything 2

8 items • Updated Oct 4, 2024 • 27

upvoted a paper 4 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 73

upvoted an article 4 months ago

Article

Exploring the Daily Papers Page on Hugging Face

Sep 23, 2024

• 47

upvoted a paper 4 months ago

MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling

Paper • 2409.16160 • Published Sep 24, 2024 • 33

upvoted an article 4 months ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

• 38

upvoted 3 collections 4 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 486

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 269

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated 17 days ago • 66

upvoted a collection 6 months ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21, 2024 • 70