linkedin-xfact

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

lancerts authored a paper about 1 month ago

Liger Kernel: Efficient Triton Kernels for LLM Training

lancerts authored a paper about 1 month ago

AlphaPO -- Reward shape matters for LLM alignment

lancerts authored a paper about 1 month ago

LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding

View all activity

lancerts

authored 5 papers about 1 month ago

Liger Kernel: Efficient Triton Kernels for LLM Training

Paper • 2410.10989 • Published Oct 14, 2024 • 1

AlphaPO -- Reward shape matters for LLM alignment

Paper • 2501.03884 • Published Jan 7 • 2

LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding

Paper • 2508.01617 • Published Aug 3

Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction

Paper • 2509.12464 • Published Sep 15

Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs

Paper • 2509.25779 • Published Sep 30 • 16

JW17

authored 2 papers 4 months ago

AlphaPO -- Reward shape matters for LLM alignment

Paper • 2501.03884 • Published Jan 7 • 2

Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning

Paper • 2504.03380 • Published Apr 4

JW17

authored a paper 6 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 10

amphora

authored a paper 6 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 10

eunkey

authored a paper 8 months ago

Sightation Counts: Leveraging Sighted User Feedback in Building a BLV-aligned Dataset of Diagram Descriptions

Paper • 2503.13369 • Published Mar 17 • 7

amphora

authored a paper 8 months ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 26

nlee-208

updated a dataset 10 months ago

linkedin-xfact/ultrachat_200k_dedup

Viewer • Updated Jan 9 • 207k • 3

JW17

authored 2 papers 11 months ago

Stable Language Model Pre-training by Reducing Embedding Variability

Paper • 2409.07787 • Published Sep 12, 2024

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Paper • 2410.18027 • Published Oct 23, 2024

nlee-208

authored 3 papers about 1 year ago

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Paper • 2410.18027 • Published Oct 23, 2024

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10, 2024 • 16

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Paper • 2406.05761 • Published Jun 9, 2024 • 3

JW17

authored 2 papers over 1 year ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10, 2024 • 16

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 69

nlee-208

authored a paper over 1 year ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 69

AI & ML interests

Recent Activity

Team members 7

linkedin-xfact's activity