kil3r
/

gptj6b-lora-owca

Text Generation

Model card Files Files and versions Community

gptj6b-lora-owca / README.md

kil3r's picture

Update README.md

a2d18da over 1 year ago

|

364 Bytes

	# This repo contains EleutherAI/gpt-j-6B fine tuned on OWCA (https://github.com/Emplocity/owca) using LoRa

	Training params:<br/>
	```
	MICRO_BATCH_SIZE = 64
	BATCH_SIZE = 128
	GRADIENT_ACCUMULATION_STEPS = BATCH_SIZE // MICRO_BATCH_SIZE
	EPOCHS = 3
	LEARNING_RATE = 2e-5
	CUTOFF_LEN = 256
	LORA_R = 4
	LORA_ALPHA = 16
	LORA_DROPOUT = 0.05
	warmup_steps=100
	fp16=True
	```