47 29 74

Kashif Rasul

kashif

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

liked a model 1 day ago

apple/aimv2-large-patch14-448

liked a dataset 3 days ago

Maple728/Time-300B

liked a Space 12 days ago

Salesforce/GIFT-Eval

View all activity

Articles

Organizations

kashif's activity

liked a model 1 day ago

apple/aimv2-large-patch14-448

Image Feature Extraction • Updated 1 day ago • 113 • 1

liked a dataset 3 days ago

Maple728/Time-300B

Preview • Updated Oct 22 • 3.32k • 9

liked a Space 12 days ago

Running

🥇

GIFT Eval

GIFT-Eval: A Benchmark for General Time Series Forecasting

liked a model about 1 month ago

jimmycarter/LibreFLUX

Text-to-Image • Updated about 1 month ago • 3.29k • 147

upvoted a paper about 1 month ago

A Rate-Distortion View of Uncertainty Quantification

Paper • 2406.10775 • Published Jun 16 • 1

updated a dataset about 2 months ago

kashif/chronos-preference

Preview • Updated Sep 26 • 38

upvoted a paper 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

updated 3 models 3 months ago

kashif/gkd-model

Updated Sep 8 • 3

kashif/pythia-1b-deduped-tldr-xpo

Updated Sep 7 • 4

kashif/pythia-1b-deduped-tldr-online-dpo

Updated Sep 6

upvoted a paper 3 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7 • 7

upvoted a collection 3 months ago

Power-LM

Collection

Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17 • 15

upvoted 2 papers 3 months ago

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13 • 20

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 115

commented 2 papers 4 months ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 16 •

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25 • 31 •

upvoted a paper 4 months ago

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Paper • 2405.21046 • Published May 31 • 3

updated a model 4 months ago

hf-internal-testing/llama-tokenizer

Updated Jul 29 • 12

New activity in hf-internal-testing/llama-tokenizer 4 months ago

chat template is none

#4 opened 4 months ago by

kashif

updated a model 4 months ago

kashif/gkd_openassistant-guanaco

Text Generation • Updated Jul 21 • 13

Kashif Rasul

AI & ML interests

Recent Activity

Articles

How NuminaMath Won the 1st AIMO Progress Prize

Preference Optimization for Vision Language Models

🧨 Diffusers welcomes Stable Diffusion 3

Patch Time Series Transformer in Hugging Face

Constitutional AI with Open LLMs

PatchTSMixer in HuggingFace

Preference Tuning LLMs with Direct Preference Optimization Methods

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Fine-tune Llama 2 with DPO

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Multivariate Probabilistic Time Series Forecasting with Informer

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Probabilistic Time Series Forecasting with 🤗 Transformers

The Annotated Diffusion Model

Organizations

kashif's activity

GIFT Eval

chat template is none