metadata

base_model: llm-jp/llm-jp-3-13b
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - llama
  - trl
license: cc-by-nc-sa-2.0
language:
  - ja

Uploaded model

Developed by: MMMio
License: apache-2.0
Finetuned from model : llm-jp/llm-jp-3-13b

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

README

モデル概要

本モデルは、日本語事前学習済みモデル llm-jp/llm-jp-3-13bに、ichikara-instruction: LLMのための日本語インストラクションデータを用いて Fine-Tuning したモデルである。

ライセンス

本モデルは、CC-BY-NC-SA ライセンスの下で公開されています。

LLMでの出力方法

1. 学習方法

サンプルコードを使用してFine-Tuningをおこなった。学習データも同様

2. 推論方法

T4 GPU環境で、同じくサンプルコード (unsloth) を使用して推論。