Yedidia AGNIMO

Yedson54

AI & ML interests

Reinforcement Learning, Federated Learning

Recent Activity

updated a collection about 1 month ago

Transfer Learning - FineTuning SFT - Instruction

updated a collection about 1 month ago

Model Training - Learning Scheme

updated a collection about 1 month ago

Reinforcement Learning (RL / RLHF)

Organizations

Yedson54's activity

upvoted 3 papers about 2 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30 • 53

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published Sep 20 • 48

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

upvoted 5 papers 2 months ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16 • 39

upvoted a paper 3 months ago

Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges

Paper • 2408.08946 • Published Aug 16 • 11

upvoted 9 papers 4 months ago

Patch-Level Training for Large Language Models

Paper • 2407.12665 • Published Jul 17 • 16

Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation

Paper • 2407.10817 • Published Jul 15 • 13

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

Paper • 2407.10457 • Published Jul 15 • 22

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers

Paper • 2407.09413 • Published Jul 12 • 9

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12 • 60

MUSCLE: A Model Update Strategy for Compatible LLM Evolution

Paper • 2407.09435 • Published Jul 12 • 20

H2O-Danube3 Technical Report

Paper • 2407.09276 • Published Jul 12 • 18

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5 • 31

Training Task Experts through Retrieval Based Distillation

Paper • 2407.05463 • Published Jul 7 • 7

upvoted 2 papers 5 months ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15 • 82

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4 • 24