rinna
/

japanese-gpt-neox-3.6b-instruction-sft

Text Generation

text-generation-inference

Model card Files Files and versions Community

tianyuz commited on May 29, 2023

Commit

e6d32e6

•

1 Parent(s): e1d570e

update readme

Files changed (1) hide show

README.md +19 -11

README.md CHANGED Viewed

@@ -18,8 +18,27 @@ inference: false
 ![rinna-icon](./rinna.png)
 This repository provides a Japanese GPT-NeoX model of 3.6 billion parameters. The model is based on [`rinna/japanese-gpt-neox-3.6b`](https://huggingface.co/rinna/japanese-gpt-neox-3.6b) and has been finetuned to serve as a instruction-following conversational agent.
 A special format has been adopted to construct inputs.
 * An input prompt is formatted as a conversation between `ユーザー` and `システム`.
 * Each input utterance consists of (1) its speaker (`"ユーザー"` or `"システム"`), (2) a colon (`":"`), (3) a whitespace (`" "`), and (4) utterance text (e.g. `"世界で一番高い山は？"`).
@@ -93,17 +112,6 @@ print(output)
 4. 道玄坂です。道玄坂は、日本の商業地区である坂道です。</s>"""
 ~~~~
-# Model architecture
-A 36-layer, 2816-hidden-size transformer-based language model.
-# Finetuning
-The finetuning data is the subset of the following datasets and has been translated into Japanese.
-* [Anthropic HH RLHF data](https://huggingface.co/datasets/Anthropic/hh-rlhf)
-* [FLAN Instruction Tuning data](https://github.com/google-research/FLAN)
-* [Stanford Human Preferences Dataset](https://huggingface.co/datasets/stanfordnlp/SHP)
-The data will **not** be released.
 # Tokenization
 The model uses a [sentencepiece](https://github.com/google/sentencepiece)-based tokenizer.
 * The tokenizer has a vocabulary size of 32,000.

 ![rinna-icon](./rinna.png)
+# Overview
 This repository provides a Japanese GPT-NeoX model of 3.6 billion parameters. The model is based on [`rinna/japanese-gpt-neox-3.6b`](https://huggingface.co/rinna/japanese-gpt-neox-3.6b) and has been finetuned to serve as a instruction-following conversational agent.
+* **Model architecture**
+    A 36-layer, 2816-hidden-size transformer-based language model.
+* **Finetuning**
+    The finetuning data is the subset of the following datasets and has been translated into Japanese.
+    * [Anthropic HH RLHF data](https://huggingface.co/datasets/Anthropic/hh-rlhf)
+    * [FLAN Instruction Tuning data](https://github.com/google-research/FLAN)
+    * [Stanford Human Preferences Dataset](https://huggingface.co/datasets/stanfordnlp/SHP)
+    The data will **not** be released.
+* **Authors**
+    [Tianyu Zhao](https://huggingface.co/tianyuz) and [Kei Sawada](https://huggingface.co/keisawada)
+# I/O Format
 A special format has been adopted to construct inputs.
 * An input prompt is formatted as a conversation between `ユーザー` and `システム`.
 * Each input utterance consists of (1) its speaker (`"ユーザー"` or `"システム"`), (2) a colon (`":"`), (3) a whitespace (`" "`), and (4) utterance text (e.g. `"世界で一番高い山は？"`).
 4. 道玄坂です。道玄坂は、日本の商業地区である坂道です。</s>"""
 ~~~~
 # Tokenization
 The model uses a [sentencepiece](https://github.com/google/sentencepiece)-based tokenizer.
 * The tokenizer has a vocabulary size of 32,000.