8 3 6

Jaesun Park

jaesun

jaesuny

AI & ML interests

None yet

Recent Activity

authored a paper 8 days ago

HyperCLOVA X Technical Report

authored a paper 8 days ago

Kanana: Compute-efficient Bilingual Language Models

liked a Space 8 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

jaesun's activity

authored 2 papers 8 days ago

HyperCLOVA X Technical Report

Paper • 2404.01954 • Published Apr 2, 2024 • 23

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published 9 days ago • 59

liked a Space 8 days ago

2.1k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 4 months ago

stas/ml-engineering-book

Updated Jan 22 • 16

upvoted a paper 6 months ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 123

liked a model 12 months ago

xai-org/grok-1

Text Generation • Updated Mar 28, 2024 • 1.21k • 2.28k

upvoted 2 papers about 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 142

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29, 2024 • 49

liked a dataset about 2 years ago

bigcode/the-stack-dedup

Viewer • Updated Aug 17, 2023 • 237M • 8.65k • 344

liked a model over 2 years ago

bigscience/bloom

Text Generation • Updated Jul 28, 2023 • 1.1M • 4.86k

liked a Space almost 3 years ago

5.55k

DALL·E mini

🥑