Jiwoo Hong's picture

Jiwoo Hong

JW17

·

https://jiwooya1000.github.io/

AI & ML interests

NLP, LLM, and any related topics

Recent Activity

upvoted a paper 27 days ago

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

updated a model 27 days ago

JW17/L31-8B-It-ICRM-3Epoch-v0.1

published a model 28 days ago

JW17/L31-8B-It-ICRM-3Epoch-v0.1

View all activity

Organizations

upvoted a paper 27 days ago

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

Paper • 2510.04230 • Published Oct 5 • 26

upvoted 2 papers 6 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 10

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 120

upvoted a paper 8 months ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 26

upvoted a collection about 1 year ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 652

upvoted an article about 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 425

upvoted an article over 1 year ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 106

upvoted a paper over 1 year ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10, 2024 • 16

upvoted a collection over 1 year ago

MaPO

This collection includes the models and datasets as a part of the MaPO release. • 9 items • Updated Jun 12, 2024 • 5

upvoted a paper over 1 year ago

Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Paper • 2405.18952 • Published May 29, 2024 • 10

upvoted 2 articles over 1 year ago

Article

How to Finetune phi-3 on MacBook Pro

By

•

Apr 24, 2024

• 68

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 239

upvoted 2 collections over 1 year ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 864

Zephyr ORPO

Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook • 3 items • Updated Apr 12, 2024 • 18

upvoted a paper over 1 year ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 69

upvoted a collection over 1 year ago

ORPO

This is the official collection of "ORPO: Monolithic Preference Optimization without Reference Model". • 5 items • Updated Apr 12, 2024 • 11