brittlewis12
/

Kunoichi-DPO-v2-7B-GGUF

Text Generation

GGUF

English

Model card Files Files and versions Community

brittlewis12 commited on Jan 16

Commit

f08255e

•

1 Parent(s): 7509ece

Create README.md

Browse files

Files changed (1) hide show

README.md +127 -0

README.md ADDED Viewed

	@@ -0,0 +1,127 @@

+---
+base_model: SanjiWatsuki/Kunoichi-DPO-v2-7B
+inference: false
+language:
+  - en
+license: cc-by-nc-4.0
+model_creator: SanjiWatsuki
+model_name: Kunoichi-DPO-v2-7B
+model_type: mistral
+pipeline_tag: text-generation
+prompt_template: "{{system_message}}
+  ### Instruction:
+  {{prompt}}
+  ### Response:
+  "
+quantized_by: brittlewis12
+---
+# Kunoichi-DPO-v2-7B GGUF
+![Kunoichi-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-7B/resolve/main/assets/kunoichi.png)
+Original model: [Kunoichi-DPO-v2-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-DPO-v2-7B)
+Model creator: [SanjiWatsuki](https://huggingface.co/SanjiWatsuki)
+This repo contains GGUF format model files for SanjiWatsuki’s Kunoichi-DPO-v2-7B.
+### What is GGUF?
+GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp.
+Converted using llama.cpp build 1879 (revision [3e5ca79](https://github.com/ggerganov/llama.cpp/commit/3e5ca7931c68152e4ec18d126e9c832dd84914c8))
+### Prompt template: Unknown (Alpaca)
+[Alpaca-style](https://huggingface.co/SanjiWatsuki/Kunoichi-7B#prompt-template-custom-format-or-alpaca) was the prompt format for the original [Kunoichi-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-7B).
+```
+Below is an instruction that describes a task. Write a response that appropriately completes the request.
+### Instruction:
+{{prompt}}
+### Response:
+```
+---
+## Download & run with [cnvrs](https://testflight.apple.com/join/sFWReS7K) on iPhone, iPad, and Mac!
+[cnvrs](https://testflight.apple.com/join/sFWReS7K) is the best app for private, local AI on your device:
+- create & save **Characters** with custom system prompts & temperature settings
+- download and experiment with any **GGUF model** you can [find on HuggingFace](https://huggingface.co/models?library=gguf)!
+- make it your own with custom **Theme colors**
+- powered by Metal ⚡️ & [Llama.cpp](https://github.com/ggerganov/llama.cpp), with **haptics** during response streaming!
+- **try it out** yourself today, on [Testflight](https://testflight.apple.com/join/sFWReS7K)!
+---
+## Original Model Evaluations:
+| Model                | MT Bench | EQ Bench | MMLU   | Logic Test |
+|----------------------|----------|----------|---------|-------------|
+| GPT-4-Turbo         | 9.32     | -        | -       | -           |
+| GPT-4               | 8.99     | 62.52    | 86.4    | 0.86        |
+| **Kunoichi-DPO-v2-7B** | **8.51**     | **42.18**    | -    | **0.58**        |
+| Mixtral-8x7B-Instruct| 8.30     | 44.81    | 70.6    | 0.75        |
+| **Kunoichi-DPO-7B** | **8.29**     | **41.60**    | **64.83**    | **0.59**        |
+| **Kunoichi-7B**     | **8.14**     | **44.32**    | **64.9**    | **0.58**            |
+| Starling-7B         | 8.09     | -        | 63.9    | 0.51        |
+| Claude-2            | 8.06     | 52.14    | 78.5    | -           |
+| Silicon-Maid-7B     | 7.96     | 40.44    | 64.7    | 0.54           |
+| Loyal-Macaroni-Maid-7B | 7.95     | 38.66    | 64.9   | 0.57        |
+| GPT-3.5-Turbo       | 7.94     | 50.28    | 70      | 0.57        |
+| Claude-1            | 7.9       | -        | 77      | -           |
+| Openchat-3.5        | 7.81     | 37.08    | 64.3    | 0.39        |
+| Dolphin-2.6-DPO     | 7.74     | 42.88    | 61.9    | 0.53        |
+| Zephyr-7B-beta      | 7.34     | 38.71    | 61.4    | 0.30        |
+| Llama-2-70b-chat-hf | 6.86     | 51.56    | 63      | -           |
+| Neural-chat-7b-v3-1 | 6.84     | 43.61    | 62.4    | 0.30        |
+| Model | Average | AGIEval | GPT4All | TruthfulQA | Bigbench |
+|---|---:|---:|---:|---:|---:|
+| **Kunoichi-DPO-7B**|**58.4**|  45.08 |  74|     66.99|   47.52|
+| **Kunoichi-DPO-v2-7B**|**58.31**|  44.85|  75.05|     65.69|   47.65|
+| [Kunoichi-7B](https://huggingface.co/SanjiWatsuki/Kunoichi-7B)|57.54|  44.99|  74.86|     63.72|   46.58|
+| [OpenPipe/mistral-ft-optimized-1218](https://huggingface.co/OpenPipe/mistral-ft-optimized-1218)| 56.85 | 44.74 | 75.6 | 59.89 | 47.17 |
+| [Silicon-Maid-7B](https://huggingface.co/SanjiWatsuki/Silicon-Maid-7B) | 56.45|  44.74|  74.26|      61.5|   45.32|
+| [mlabonne/NeuralHermes-2.5-Mistral-7B](https://huggingface.co/mlabonne/NeuralHermes-2.5-Mistral-7B) | 53.51 | 43.67 | 73.24 | 55.37 | 41.76 |
+| [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)  | 52.42 | 42.75 | 72.99 | 52.99 | 40.94 |
+| [openchat/openchat_3.5](https://huggingface.co/openchat/openchat_3.5) | 51.34 | 42.67 | 72.92 | 47.27 | 42.51 |
+| [berkeley-nest/Starling-LM-7B-alpha](https://huggingface.co/berkeley-nest/Starling-LM-7B-alpha) | 51.16 | 42.06 | 72.72 | 47.33 | 42.53 |
+| [HuggingFaceH4/zephyr-7b-beta](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta) | 50.99 | 37.33 | 71.83 | 55.1 | 39.7 |
+| Model                       | AlpacaEval2 | Length |
+| --------------------------- | ----------- | ------ |
+| GPT-4                       | 23.58%      | 1365   |
+| GPT-4 0314                  | 22.07%      | 1371   |
+| Mistral Medium              | 21.86%      | 1500   |
+| Mixtral 8x7B v0.1           | 18.26%      | 1465   |
+| **Kunoichi-DPO-v2**         | **17.19%**  | 1785   |
+| Claude 2                    | 17.19%      | 1069   |
+| Claude                      | 16.99%      | 1082   |
+| Gemini Pro                  | 16.85%      | 1315   |
+| GPT-4 0613                  | 15.76%      | 1140   |
+| Claude 2.1                  | 15.73%      | 1096   |
+| Mistral 7B v0.2             | 14.72%      | 1676   |
+| GPT 3.5 Turbo 0613          | 14.13%      | 1328   |
+| LLaMA2 Chat 70B             | 13.87%      | 1790   |
+| LMCocktail-10.7B-v1         | 13.15%      | 1203   |
+| WizardLM 13B V1.1           | 11.23%      | 1525   |
+| Zephyr 7B Beta              | 10.99%      | 1444   |
+| OpenHermes-2.5-Mistral (7B) | 10.34%      | 1107   |
+| GPT 3.5 Turbo 0301          | 9.62%       | 827    |
+| **Kunoichi-7B**             | **9.38%**   | 1492   |
+| GPT 3.5 Turbo 1106          | 9.18%       | 796    |
+| GPT-3.5                     | 8.56%       | 1018   |
+| Phi-2 DPO                   | 7.76%       | 1687   |
+| LLaMA2 Chat 13B             | 7.70%       | 1513   |