emre570 commited on
Commit
3416abf
1 Parent(s): cdf7bea

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -9,6 +9,8 @@ datasets:
9
  This model is fine-tuned from [Unsloth's Gemma 1.1 7B Instruct quantized model](https://huggingface.co/unsloth/gemma-1.1-7b-it-bnb-4bit) with [naklecha's Minecraft Question-Answer dataset](https://huggingface.co/datasets/naklecha/minecraft-question-answer-700k).
10
  Fine-tuned with first 100k rows from dataset with 1 epoch, it took around 2 hours 20 minutes with NVIDIA RTX 4090.
11
 
 
 
12
  ## Important Notes
13
  - Model sometimes generates answers with no meanings. I am currently investigating this. This process can be long since I am a beginner in this field. If you have any suggestions, feel free to say it on model's Community page.
14
  - Model is using bitsandbytes so use it with a CUDA supported GPU.
 
9
  This model is fine-tuned from [Unsloth's Gemma 1.1 7B Instruct quantized model](https://huggingface.co/unsloth/gemma-1.1-7b-it-bnb-4bit) with [naklecha's Minecraft Question-Answer dataset](https://huggingface.co/datasets/naklecha/minecraft-question-answer-700k).
10
  Fine-tuned with first 100k rows from dataset with 1 epoch, it took around 2 hours 20 minutes with NVIDIA RTX 4090.
11
 
12
+ Model can now generate some good answers. But sometimes it can generate inappropriate answers. I think this problem is based on lack of data.
13
+
14
  ## Important Notes
15
  - Model sometimes generates answers with no meanings. I am currently investigating this. This process can be long since I am a beginner in this field. If you have any suggestions, feel free to say it on model's Community page.
16
  - Model is using bitsandbytes so use it with a CUDA supported GPU.