metadata

base_model: unsloth/gemma-2-2b-it-bnb-4bit
language:
  - en
license: gemma
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - gemma2
  - trl

ActionGemma2 v2

Developed by: dushj98
License: apache-2.0
Finetuned from model : unsloth/gemma-2-2b-it-bnb-4bit

This gemma2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Note

This version of the model was fine-tuned for a full epoch whereas ActionGemma2 Test v1 was only fine-tuned for 60 steps.

This is a fine-tuned model for testing purposes, fine-tuned for a full single epoch in Kaggle. The dataset used for SFT has been changed since and this model is no longer valid.