8 25 3

Melisa Russak

melisa

melisa-writer

AI & ML interests

I love definitions

Recent Activity

upvoted a paper 1 day ago

upvoted an article 2 days ago

upvoted a paper 5 days ago

Organizations

melisa's activity

upvoted a paper 1 day ago

Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published 7 days ago • 10

upvoted an article 2 days ago

Article

Fine-tuning LLMs with Singular Value Decomposition

•

Jun 2

• 8

upvoted a paper 5 days ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published 8 days ago • 37

upvoted 2 papers 22 days ago

Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse

Paper • 2410.21333 • Published 25 days ago • 9

Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation

Paper • 2410.18565 • Published 28 days ago • 42

upvoted a paper about 1 month ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17 • 88

upvoted a paper about 2 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30 • 53

upvoted a paper 2 months ago

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5 • 88

upvoted 4 papers 3 months ago

upvoted an article 3 months ago

Article

Using Writer Framework with Hugging Face Spaces

•

Aug 20

• 30

upvoted a paper 5 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 85

upvoted 4 papers 6 months ago

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published Jun 4 • 37

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26 • 22

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27 • 85

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 50

upvoted a collection 6 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 8 days ago • 497

upvoted a paper 6 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87