|
--- |
|
license: apache-2.0 |
|
datasets: |
|
- nayohan/math-gpt-4o-200k-ko |
|
language: |
|
- ko |
|
- en |
|
base_model: meta-llama/Meta-Llama-3.1-8B-Instruct |
|
--- |
|
Licensed under the Apache License, Version 2.0 (the "License"); |
|
you may not use this file except in compliance with the License. |
|
You may obtain a copy of the License at |
|
http://www.apache.org/licenses/LICENSE-2.0 |
|
|
|
Unless required by applicable law or agreed to in writing, software |
|
distributed under the License is distributed on an "AS IS" BASIS, |
|
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. |
|
See the License for the specific language governing permissions and |
|
limitations under the License. |
|
unsloth๋ฅผ ์ฌ์ฉํ์ฌ meta-llama/Meta-Llama-3.1-8B-Instruct ๋ชจ๋ธ์ LORA ํ์ธํ๋์ ์๋ฃํ์ต๋๋ค. |
|
|
|
v01๊ณผ์ ์ฐจ์ด์ ์ per_device_train_batch_size, gradient_accumulation_steps ํ๋ผ๋ฏธํฐ์ ๊ฐ์ ๋ณ๊ฒฝํด ํ์ตํ์ต๋๋ค. |
|
|
|
Contact : cgh@tnap.co.kr |