heegyu commited on
Commit
bce398f
โ€ข
1 Parent(s): 986fc7c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -8,6 +8,11 @@ language:
8
  - ko
9
  ---
10
 
 
 
 
 
 
11
  - Base Model: [42dot/42dot_LLM-SFT-1.3B](https://huggingface.co/42dot/42dot_LLM-SFT-1.3B)
12
  - [v0.1](https://huggingface.co/heegyu/ko-reward-model-1.3b-v0.1) ๋ชจ๋ธ์€ helpful + safety๋ฅผ ๊ฐ™์ด ํ•™์Šตํ–ˆ๊ณ  safeํ•œ ๋‹ต๋ณ€์— ์ง€๋‚˜์น˜๊ฒŒ ๋†’์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” ๊ฒฝํ–ฅ์ด ์žˆ์–ด์„œ ๋ถ„๋ฆฌ ํ›„ ๋”ฐ๋กœ ํ•™์Šตํ–ˆ์Šต๋‹ˆ๋‹ค.
13
  - ์ด ๋ชจ๋ธ์€ ๋น„์œค๋ฆฌ์ ์ธ ๋‹ต๋ณ€์— ๋‚ฎ์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” safety ๋ชจ๋ธ์ด ์•„๋‹™๋‹ˆ๋‹ค. safety ๋ชจ๋ธ์ด ํ•„์š”ํ•˜์‹œ๋ฉด [heegyu/ko-reward-model-safety-1.3b-v0.2](https://huggingface.co/heegyu/ko-reward-model-safety-1.3b-v0.2) ์ด ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์„ธ์š”
 
8
  - ko
9
  ---
10
 
11
+ <div align="center">
12
+ <div>&nbsp;</div>
13
+ <img src="./llama_judge.jpeg" width="400"/>
14
+ </div>
15
+
16
  - Base Model: [42dot/42dot_LLM-SFT-1.3B](https://huggingface.co/42dot/42dot_LLM-SFT-1.3B)
17
  - [v0.1](https://huggingface.co/heegyu/ko-reward-model-1.3b-v0.1) ๋ชจ๋ธ์€ helpful + safety๋ฅผ ๊ฐ™์ด ํ•™์Šตํ–ˆ๊ณ  safeํ•œ ๋‹ต๋ณ€์— ์ง€๋‚˜์น˜๊ฒŒ ๋†’์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” ๊ฒฝํ–ฅ์ด ์žˆ์–ด์„œ ๋ถ„๋ฆฌ ํ›„ ๋”ฐ๋กœ ํ•™์Šตํ–ˆ์Šต๋‹ˆ๋‹ค.
18
  - ์ด ๋ชจ๋ธ์€ ๋น„์œค๋ฆฌ์ ์ธ ๋‹ต๋ณ€์— ๋‚ฎ์€ ์ ์ˆ˜๋ฅผ ์ฃผ๋Š” safety ๋ชจ๋ธ์ด ์•„๋‹™๋‹ˆ๋‹ค. safety ๋ชจ๋ธ์ด ํ•„์š”ํ•˜์‹œ๋ฉด [heegyu/ko-reward-model-safety-1.3b-v0.2](https://huggingface.co/heegyu/ko-reward-model-safety-1.3b-v0.2) ์ด ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์„ธ์š”