Chujie Zheng's picture

Chujie Zheng

chujiezheng

·

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

upvoted an article 28 days ago

liked a dataset about 1 month ago

KingNish/reasoning-base-20k

upvoted a paper about 1 month ago

Organizations

chujiezheng's activity

upvoted an article 28 days ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

By

•

28 days ago

• 8

upvoted a paper about 1 month ago

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Paper • 2410.13841 • Published Oct 17 • 14

upvoted a paper 2 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 136

upvoted a paper 3 months ago

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

Paper • 2408.08072 • Published Aug 15 • 32

upvoted a collection 5 months ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 19 days ago • 158

upvoted a paper 7 months ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25 • 11

upvoted 3 collections 7 months ago

Eurus

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated about 1 month ago • 24

Weak-to-Strong Extrapolation Expedites Alignment

Better aligned models obtained by weak-to-strong model extrapolation (ExPO) • 25 items • Updated 25 days ago • 16

Model Checkpoints in the ExPO Paper

15 items • Updated May 19 • 2

upvoted 2 collections 10 months ago

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Sep 18 • 206

Quyen

State-of-the-arts General LLMs - based on Qwen1.5 • 26 items • Updated Feb 13 • 12

upvoted a paper 11 months ago

Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

Paper • 2312.04724 • Published Dec 7, 2023 • 20

upvoted a collection about 1 year ago

Pythia Scaling Suite

Pythia is the first LLM suite designed specifically to enable scientific research on LLMs. To learn more see https://github.com/EleutherAI/pythia • 18 items • Updated Nov 21, 2023 • 24