Raúl Garrido's picture

68 414

Raúl Garrido

happybydefault

·

https://happybydefault.com

AI & ML interests

None yet

Recent Activity

liked a model about 19 hours ago

allenai/OLMo-2-1124-7B-RM

liked a dataset about 19 hours ago

allenai/RLVR-GSM-MATH-IF-Mixed-Constraints

liked a dataset about 19 hours ago

allenai/tulu-3-sft-olmo-2-mixture

View all activity

Organizations

happybydefault's activity

upvoted a collection about 19 hours ago

OLMo 2

Artifacts for the second set of OLMo models. • 17 items • Updated about 24 hours ago • 27

upvoted a collection 4 days ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated 5 days ago • 55

upvoted a collection 10 days ago

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 7 items • Updated 8 days ago • 38

upvoted an article 14 days ago

Article

Releasing the largest multilingual open pretraining dataset

By

•

14 days ago

• 95

upvoted a collection 17 days ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated 4 days ago • 74

upvoted a paper 25 days ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 50

upvoted a collection 26 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated about 16 hours ago • 97

upvoted 8 collections about 1 month ago

LongVU

7 items • Updated 27 days ago • 27

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated Oct 24 • 26

C4AI Aya 23

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 4 items • Updated Aug 6 • 50

Solar Pro

The most intelligent LLM on a single GPU • 4 items • Updated 13 days ago • 14

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated 9 days ago • 162

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 23 days ago • 92

MulitUI

MultiUI: 7M multimodal UI instructions • 5 items • Updated Oct 19 • 7

LayerSkip

Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated 6 days ago • 43

upvoted a paper about 1 month ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2 • 21

upvoted 2 collections about 1 month ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Oct 15 • 143

Gemma-APS Release

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated Oct 15 • 19

upvoted an article about 2 months ago

Article

Faster Assisted Generation with Dynamic Speculation

Oct 8

• 34

upvoted a paper about 2 months ago

Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 8