Safetensors
qwen3

Dooroo_2508: ์—ฌ์ˆ˜ ๊ด€๊ด‘ ํŠนํ™” ์ฑ—๋ด‡ ๋ชจ๋ธ

์ด ๋ชจ๋ธ์€ unsloth/Qwen3-4B-Instruct-2507 ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ, ๋Œ€ํ•œ๋ฏผ๊ตญ ์—ฌ์ˆ˜์‹œ์˜ ๊ด€๊ด‘ ์ •๋ณด์™€ ์„ฌ ์ •๋ณด์— ๋Œ€ํ•ด ํŠนํ™”๋œ ์ง€์‹์„ ๊ฐ–๋„๋ก ํŒŒ์ธํŠœ๋‹๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

Unsloth ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ LoRA(Low-Rank Adaptation) ๊ธฐ๋ฒ•์œผ๋กœ ํšจ์œจ์ ์ธ ํ•™์Šต์„ ์ง„ํ–‰ํ–ˆ์œผ๋ฉฐ, ์—ฌ์ˆ˜ ์—ฌํ–‰์— ๊ด€ํ•œ ์งˆ๋ฌธ์— ์ž์—ฐ์Šค๋Ÿฝ๊ณ  ์ •ํ™•ํ•œ ๋‹ต๋ณ€์„ ์ƒ์„ฑํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ› ๏ธ ํ•™์Šต ๊ณผ์ • (Training Procedure)

1. ๊ธฐ๋ฐ˜ ๋ชจ๋ธ (Base Model)

  • Model: unsloth/Qwen3-4B-Instruct-2507
  • Library: Unsloth๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰์„ ์ตœ์ ํ™”ํ•˜๊ณ  ํ•™์Šต ์†๋„๋ฅผ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œ์ผฐ์Šต๋‹ˆ๋‹ค.

2. ๋ฐ์ดํ„ฐ์…‹ (Dataset)

ํ•™์Šต์—๋Š” ์•„๋ž˜ ๋‘ ๊ฐ€์ง€ ๋ฐ์ดํ„ฐ์…‹์„ ๋ณ‘ํ•ฉํ•˜์—ฌ ์‚ฌ์šฉํ–ˆ์Šต๋‹ˆ๋‹ค. ๊ฐ ๋ฐ์ดํ„ฐ์…‹์˜ train๊ณผ test ์Šคํ”Œ๋ฆฟ์„ ํ•ฉ์นœ ํ›„, train ๋ฐ์ดํ„ฐ์…‹์€ ๋ฌด์ž‘์œ„๋กœ ์„ž์–ด ๋ชจ๋ธ์ด ํŠน์ • ์ฃผ์ œ์— ํŽธํ–ฅ๋˜์ง€ ์•Š๋„๋ก ํ–ˆ์Šต๋‹ˆ๋‹ค.

  • kingkim/yeosu_tour: ์—ฌ์ˆ˜ ๊ด€๊ด‘ ๋ช…์†Œ ๊ด€๋ จ ๋ฐ์ดํ„ฐ
  • kingkim/yeosu_island: ์—ฌ์ˆ˜ ์„ฌ ๊ด€๋ จ ๋ฐ์ดํ„ฐ

3. ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ (Hyperparameters)

๋ชจ๋ธ ํ•™์Šต์— ์‚ฌ์šฉ๋œ ์ฃผ์š” ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

LoRA ์„ค์ •

ํŒŒ๋ผ๋ฏธํ„ฐ ๊ฐ’ ์„ค๋ช…
r 16 LoRA ํ–‰๋ ฌ์˜ ๋žญํฌ (rank)
lora_alpha 32 LoRA ์Šค์ผ€์ผ๋ง ์ธ์ž
lora_dropout 0.05 LoRA ๋ ˆ์ด์–ด์˜ ๋“œ๋กญ์•„์›ƒ ๋น„์œจ
target_modules q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj LoRA๋ฅผ ์ ์šฉํ•  ๋Œ€์ƒ ๋ชจ๋“ˆ

Training Arguments

ํŒŒ๋ผ๋ฏธํ„ฐ ๊ฐ’ ์„ค๋ช…
num_train_epochs 15 ์ด ํ•™์Šต ์—ํฌํฌ ์ˆ˜
learning_rate 4e-6 ํ•™์Šต๋ฅ 
per_device_train_batch_size 32 ๋””๋ฐ”์ด์Šค๋‹น ํ•™์Šต ๋ฐฐ์น˜ ํฌ๊ธฐ
gradient_accumulation_steps 2 ๊ทธ๋ž˜๋””์–ธํŠธ ๋ˆ„์  ์Šคํ…
optimizer adamw_8bit 8๋น„ํŠธ AdamW ์˜ตํ‹ฐ๋งˆ์ด์ €
lr_scheduler_type linear ์„ ํ˜• ํ•™์Šต๋ฅ  ์Šค์ผ€์ค„๋Ÿฌ

๐Ÿ“Š ํ‰๊ฐ€ ๊ฒฐ๊ณผ (Evaluation Results)

eval_dataset์— ๋Œ€ํ•œ ์ตœ์ข… ํ‰๊ฐ€ ๊ฒฐ๊ณผ์ž…๋‹ˆ๋‹ค. Loss๋Š” ๋ชจ๋ธ์ด ์˜ˆ์ธกํ•œ ๊ฐ’๊ณผ ์‹ค์ œ ๊ฐ’์˜ ์ฐจ์ด๋ฅผ ๋‚˜ํƒ€๋‚ด๋ฉฐ, ๋‚ฎ์„์ˆ˜๋ก ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์ด ์ข‹์Œ์„ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค.

๋ฉ”ํŠธ๋ฆญ (Metric) ๊ฐ’ (Value)
eval_loss 1.5407
eval_runtime 30.8676 ์ดˆ
eval_samples_per_second 68.551
eval_steps_per_second 8.585
epoch 15.0

license: apache-2.0 tags:

  • unsloth
  • trl
  • sft
Downloads last month
1
Safetensors
Model size
4B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for kingkim/Dooroo_2508

Finetuned
(142)
this model

Datasets used to train kingkim/Dooroo_2508