Trained 19k data points of the above mentioned dataset, with 512 max sequence length, along with QLORA config, for 3 epochs.
Model tree for nikJ13/math_solver_qlora_llama
Base model
meta-llama/Llama-3.2-3B-InstructTrained 19k data points of the above mentioned dataset, with 512 max sequence length, along with QLORA config, for 3 epochs.
Base model
meta-llama/Llama-3.2-3B-Instruct