1 36 196

Santiago Garcia

santyzenith

AI & ML interests

Large language models, Natural Language Processing, Computer Vision, Spanish Large language models.

Recent Activity

liked a model about 15 hours ago

facebook/seamless-m4t-v2-large

liked a Space 1 day ago

hf-audio/open_asr_leaderboard

upvoted a collection 1 day ago

RLHF

View all activity

Organizations

santyzenith's activity

upvoted a collection 1 day ago

RLHF

Collection

A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated Oct 1 • 5

upvoted a collection 3 months ago

LLM2Vec

Collection

16 items • Updated Oct 8 • 36

upvoted 3 articles 3 months ago

Article

Train a Llama model from scratch

•

Jul 29

• 47

Article

Vision Language Models Explained

Apr 11

• 231

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 34

upvoted 2 papers 4 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19 • 38

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition

Paper • 2302.13750 • Published Feb 27, 2023 • 2

upvoted 3 articles 4 months ago

Article

Introduction to Graph Machine Learning

Jan 3, 2023

• 18

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 224

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27

• 124

upvoted a paper 5 months ago

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17 • 50

upvoted an article 5 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 289

upvoted an article 6 months ago

Article

From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease

Oct 21, 2022

• 15

upvoted a paper 6 months ago

Tuna: Instruction Tuning using Feedback from Large Language Models

Paper • 2310.13385 • Published Oct 20, 2023 • 10

upvoted a collection 6 months ago

Knowledge distillation

Collection

88 items • Updated Feb 7 • 6

upvoted 2 articles 6 months ago

Article

Putting RL back in RLHF

Jun 12

• 65

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 129

upvoted 3 papers 6 months ago

Datasets: A Community Library for Natural Language Processing

Paper • 2109.02846 • Published Sep 7, 2021 • 10

Estimating Knowledge in Large Language Models Without Generating a Single Token

Paper • 2406.12673 • Published Jun 18 • 7

A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models

Paper • 2406.11289 • Published Jun 17 • 5