7 60 18

Nikita Sushko

chameleon-lizard

chameleon-lizard

AI & ML interests

NLP, Multilingual Models, Multiagent Systems

Recent Activity

liked a dataset 12 days ago

HuggingFaceFW/finewiki

liked a model 19 days ago

ai-forever/FRIDA

upvoted a paper 22 days ago

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs

View all activity

Organizations

upvoted a paper 22 days ago

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs

Paper • 2510.11288 • Published 29 days ago • 46

upvoted a paper 25 days ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Paper • 2510.04849 • Published Oct 6 • 111

upvoted a paper about 1 month ago

OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features

Paper • 2509.22033 • Published Sep 26 • 18

upvoted a paper 2 months ago

<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs

Paper • 2509.08358 • Published Sep 10 • 13

upvoted 4 papers 3 months ago

CAMAR: Continuous Actions Multi-Agent Routing

Paper • 2508.12845 • Published Aug 18 • 7

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

Paper • 2508.11383 • Published Aug 15 • 40

HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds

Paper • 2508.12782 • Published Aug 18 • 25

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7 • 46

upvoted 4 papers 4 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 123

A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models

Paper • 2507.13563 • Published Jul 17 • 52

RiemannLoRA: A Unified Riemannian Framework for Ambiguity-Free LoRA Optimization

Paper • 2507.12142 • Published Jul 16 • 36

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8 • 118

upvoted an article 4 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 718

upvoted 3 papers 4 months ago

Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback

Paper • 2507.02321 • Published Jul 3 • 39

Listener-Rewarded Thinking in VLMs for Image Preferences

Paper • 2506.22832 • Published Jun 28 • 23

Learning to Skip the Middle Layers of Transformers

Paper • 2506.21103 • Published Jun 26 • 18

upvoted 4 papers 5 months ago

Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models

Paper • 2506.19103 • Published Jun 23 • 42

DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization

Paper • 2505.20975 • Published May 27 • 36

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 131

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7 • 71

Nikita Sushko

AI & ML interests

Recent Activity

Organizations

chameleon-lizard's activity

SmolLM3: smol, multilingual, long-context reasoner