Fabio Dias Rollo's picture

107 1117

Fabio Dias Rollo

fabiodr

·

AI & ML interests

Image synthesis, computer vision, physics simulation

Recent Activity

liked a Space 2 days ago

tencent/Hunyuan3D-2

liked a Space 2 days ago

AP123/Janus-Pro-7b

liked a model 2 days ago

m-a-p/YuE-s1-7B-anneal-en-cot

View all activity

Organizations

None yet

fabiodr's activity

upvoted an article 4 days ago

Article

We now support VLMs in smolagents!

6 days ago

• 63

upvoted a collection 6 days ago

ColSmolVLM

Pre-trained checkpoints for the ColVision models with a ColSmolVLM backbone. • 2 items • Updated 7 days ago • 1

upvoted a collection 7 days ago

Eagle 2

Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 7 days ago • 27

upvoted a paper 17 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 21 days ago • 83

upvoted a paper about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 344

upvoted a collection about 2 months ago

Meta Motivo

A first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks. • 6 items • Updated Dec 10, 2024 • 9

upvoted a paper about 2 months ago

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 56

upvoted a collection 2 months ago

QwQ

Qwen with Questions • 2 items • Updated Nov 28, 2024 • 56

upvoted a paper 2 months ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 58

upvoted a collection 2 months ago

OpenScholar_V1

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 31

upvoted a paper 3 months ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4, 2024 • 93

upvoted a collection 3 months ago

LipSync and Face Operations

13 items • Updated 24 days ago • 41

upvoted a paper 3 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 141

upvoted a collection 3 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 268

upvoted 4 papers 3 months ago

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 114

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2, 2024 • 12

PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

Paper • 2404.13026 • Published Apr 19, 2024 • 24

AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published Oct 21, 2024 • 59

upvoted 2 articles 3 months ago

Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

Oct 22, 2024

• 67

Article

Deploying Speech-to-Speech on Hugging Face

Oct 22, 2024

• 36