1 21 126

peng

superpeng

AI & ML interests

None yet

Recent Activity

upvoted an article 14 days ago

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

liked a dataset 18 days ago

O1-OPEN/OpenO1-SFT

liked a dataset 23 days ago

medalpaca/medical_meadow_wikidoc

View all activity

Organizations

None yet

superpeng's activity

upvoted an article 14 days ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19

• 74

upvoted a collection about 2 months ago

Skywork-Reward-Data-Collection

Collection

Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12 • 11

upvoted 2 papers 4 months ago

HelpSteer2: Open-source dataset for training top-performing reward models

Paper • 2406.08673 • Published Jun 12 • 16

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Paper • 2405.20335 • Published May 30 • 17

upvoted a collection 5 months ago

Biomedical NLP papers

Collection

Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) • 171 items • Updated 12 days ago • 35

upvoted 2 papers 5 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 160

Inference Performance Optimization for Large Language Models on CPUs

Paper • 2407.07304 • Published Jul 10 • 52

upvoted a collection 6 months ago

Tulu 2 Llama 3 Update

Collection

Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5). • 12 items • Updated Aug 15 • 2

upvoted 2 papers 7 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 66

upvoted a paper 8 months ago

Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts

Paper • 2309.07430 • Published Sep 14, 2023 • 27

upvoted 2 articles 8 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 125

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20

• 68

upvoted 2 papers 9 months ago

MathScale: Scaling Instruction Tuning for Mathematical Reasoning

Paper • 2403.02884 • Published Mar 5 • 15

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 183

upvoted a paper 10 months ago

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Paper • 2402.16671 • Published Feb 26 • 26

upvoted a collection 10 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated 4 days ago • 326

upvoted 3 papers 10 months ago

LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

Paper • 2402.10524 • Published Feb 16 • 22

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 77

How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15 • 40