Rykov Elisei

lmeribal

lmeribal

AI & ML interests

NLP, Multimodality

Recent Activity

upvoted an article 9 days ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

upvoted a paper 11 days ago

Chain of Draft: Thinking Faster by Writing Less

upvoted a paper 21 days ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

View all activity

Organizations

lmeribal's activity

upvoted an article 9 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 70

upvoted a paper 11 days ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published 17 days ago • 44

upvoted a paper 21 days ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 22 days ago • 85

liked a model 23 days ago

deepvk/RuModernBERT-base

Fill-Mask • Updated 23 days ago • 6.8k • 29

upvoted a paper 28 days ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 86

liked a model about 1 month ago

fava-uw/fava-model

Text Generation • Updated Dec 1, 2024 • 222 • 16

upvoted 2 papers about 1 month ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5 • 58

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 112

upvoted a collection about 2 months ago

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6 • 127

liked a model about 2 months ago

MoritzLaurer/ModernBERT-large-zeroshot-v2.0

Text Classification • Updated Jan 16 • 172k • 42

liked a dataset about 2 months ago

MERA-evaluation/WEIRD

Viewer • Updated Dec 10, 2024 • 824 • 197 • 1

upvoted a paper about 2 months ago

HALoGEN: Fantastic LLM Hallucinations and Where to Find Them

Paper • 2501.08292 • Published Jan 14 • 17

upvoted a paper 2 months ago

Fine-grained Hallucination Detection and Editing for Language Models

Paper • 2401.06855 • Published Jan 12, 2024 • 4

liked a model 2 months ago

microsoft/phi-4

Text Generation • Updated 18 days ago • 502k • • 1.9k

liked a dataset 2 months ago

fava-uw/fava-data

Viewer • Updated Dec 1, 2024 • 30.1k • 188 • 13

liked 2 datasets 3 months ago

microsoft/wiki_qa

Viewer • Updated Jan 4, 2024 • 29.3k • 3.84k • 56

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 98.6k • 759

liked a model 3 months ago

dslim/distilbert-NER

Token Classification • Updated Oct 8, 2024 • 28.3k • • 30

liked 2 datasets 3 months ago

potsawee/wiki_bio_gpt3_hallucination

Viewer • Updated May 29, 2023 • 238 • 377 • 25

ServiceNow/repliqa

Viewer • Updated Feb 11 • 53.9k • 1.36k • 8