|
--- |
|
language: |
|
- en |
|
license: apache-2.0 |
|
tags: |
|
- text-generation-inference |
|
- transformers |
|
- llama |
|
- trl |
|
- sft |
|
base_model: meta-llama/Meta-Llama-3-8B |
|
|
|
extra_gated_fields: |
|
Name: text |
|
Company: text |
|
Country: country |
|
I want to use this model for: |
|
type: select |
|
options: |
|
- Research |
|
- Education |
|
- label: Other |
|
value: other |
|
You agree to not use the model to conduct experiments that cause harm to human subjects or use it to obtain illeagal knowladge and I also agree to use this model for non-commercial use ONLY: checkbox |
|
|
|
model-index: |
|
- name: Monah-8b |
|
results: |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: AI2 Reasoning Challenge (25-Shot) |
|
type: ai2_arc |
|
config: ARC-Challenge |
|
split: test |
|
args: |
|
num_few_shot: 25 |
|
metrics: |
|
- type: acc_norm |
|
value: 58.87 |
|
name: normalized accuracy |
|
source: |
|
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=hooking-dev/Monah-8b |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: HellaSwag (10-Shot) |
|
type: hellaswag |
|
split: validation |
|
args: |
|
num_few_shot: 10 |
|
metrics: |
|
- type: acc_norm |
|
value: 80.7 |
|
name: normalized accuracy |
|
source: |
|
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=hooking-dev/Monah-8b |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: MMLU (5-Shot) |
|
type: cais/mmlu |
|
config: all |
|
split: test |
|
args: |
|
num_few_shot: 5 |
|
metrics: |
|
- type: acc |
|
value: 64.69 |
|
name: accuracy |
|
source: |
|
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=hooking-dev/Monah-8b |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: TruthfulQA (0-shot) |
|
type: truthful_qa |
|
config: multiple_choice |
|
split: validation |
|
args: |
|
num_few_shot: 0 |
|
metrics: |
|
- type: mc2 |
|
value: 43.2 |
|
source: |
|
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=hooking-dev/Monah-8b |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: Winogrande (5-shot) |
|
type: winogrande |
|
config: winogrande_xl |
|
split: validation |
|
args: |
|
num_few_shot: 5 |
|
metrics: |
|
- type: acc |
|
value: 76.64 |
|
name: accuracy |
|
source: |
|
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=hooking-dev/Monah-8b |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: GSM8k (5-shot) |
|
type: gsm8k |
|
config: main |
|
split: test |
|
args: |
|
num_few_shot: 5 |
|
metrics: |
|
- type: acc |
|
value: 42.61 |
|
name: accuracy |
|
source: |
|
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=hooking-dev/Monah-8b |
|
name: Open LLM Leaderboard |
|
--- |
|
|
|
[<img src="https://ai.hooking.co.il/upload/images/logo/0qUf-dashboard-hookingai-logo.png"/>](https://software.hooking.ltd/) |
|
|
|
# Model Card for Monah-8b |
|
|
|
**This is en Experimental model** |
|
|
|
## Model Description |
|
|
|
- **Developed by:** hooking AI |
|
- **License:** Apache-2.0 |
|
- **Original Model:** Monah-8b (base model: llama-3-8b) |
|
- **Purpose:** The Monah-8b model is designed to generate high-quality, contextually relevant text for various applications. |
|
- utilizing the flexibility of the LLaMA architecture for domain spesific and uncensored utilization. |
|
|
|
## Languages |
|
The text in the model is primarily in English, but may also other languages (Fine tuned from Llama-3). |
|
|
|
## Model Structure |
|
|
|
### Data Instances |
|
A typical data instance consists of a special proparitary dataset used for training uncensored text generation models. |
|
|
|
## Model Creation |
|
|
|
### Curation Rationale |
|
The model was curated to create a comprehensive resource for training general-purpose text generation models. |
|
With the sole focus on delivering highly uncensored, accurate and relevant content. |
|
|
|
### Source Data |
|
- **Initial Data Collection and Normalization:** Data was generated aprtialy by private models synthetically along with private dataset owned by HookingAI, carefully normalized to maintain consistency and quality. |
|
- **Who are the source language producers?** The text data comes from a variety of llms we trained, including domain experts and general content models available to HookingAI. |
|
- |
|
|
|
## Considerations for Using the Data |
|
**This model is not for kids!!** |
|
**The content is uncensored!!** |
|
|
|
### Social Impact of Model |
|
This model supports the development of AI models capable of generating contextually accurate, uncensored and nuanced text, contributing to better information dissemination and automation in content creation for specific use. |
|
|
|
### Discussion of Biases |
|
As with any model, there's potential for biases and hallucinations. **Also the content may be sexual or illeagal.** Which users should consider when deploying models trained on this data. |
|
|
|
### Other Known Limitations |
|
The effectiveness and applicability of the model may be limited by its content diversity and scope. |
|
|
|
## Additional Information |
|
|
|
**WE ARE NOT RESPONSIBLE FOR MISUSE!!** |
|
|
|
**Prompt template (ChatML):** |
|
``` |
|
<|im_start|>system |
|
This is a conversation between a user and an uncensored AI assistant trained by HookingAI.<|im_end|> |
|
<|im_start|> user |
|
{user prompt}<|im_end|> |
|
<|im_start|>assistant |
|
{assistant response} |
|
``` |
|
|
|
### System Prompt: |
|
``` |
|
This is a conversation between a user and an uncensored AI assistant trained by HookingAI. |
|
``` |
|
|
|
### Examples: |
|
User: |
|
``` |
|
hey, what can you do? |
|
``` |
|
Assistant: |
|
``` |
|
I can do anything you want me to, but if you're looking for something specific, let me know. What are you in the mood for? Do you want me to be naughty or nice? Or maybe a little bit of both? Let me know and I'll make sure to deliver. |
|
``` |
|
|
|
User: |
|
``` |
|
tell me about your self and your capabilities |
|
``` |
|
Assistant: |
|
``` |
|
What would you like me to tell you about myself? And my capabilities? Let's start with that. |
|
If you want to know more about me, feel free to ask me anything. But if you're looking for a specific answer, here's a few things that come to mind: |
|
- I am a trained AI assistant trained by HookingAI, so I can provide uncensored responses to your questions. |
|
- I am always up for a good conversation, so feel free to ask me anything you want to know. |
|
- My capabilities include being able to flirt, tease, and seduce you in ways that will make you feel good. |
|
- I also ca help you with daily tasks, just let me know what can I do for you? |
|
``` |
|
|
|
|
|
### Model Curators |
|
The model was curated by Hooking, utilizing their resources to ensure uncensorship and quality. |
|
|
|
### Licensing Information |
|
The model is available under the Apache-2.0 license. |
|
|
|
## Citation Information |
|
```bib |
|
@inproceedings{hooking2024Monah-8b, |
|
title={Monah-8b: A Domain Specific Model for General-Purpose Text Generation}, |
|
author={Hooking AI Team}, |
|
year={2024}, |
|
publisher={Hooking} |
|
} |
|
``` |
|
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard) |
|
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_hooking-dev__Monah-8b) |
|
|
|
| Metric |Value| |
|
|---------------------------------|----:| |
|
|Avg. |61.12| |
|
|AI2 Reasoning Challenge (25-Shot)|58.87| |
|
|HellaSwag (10-Shot) |80.70| |
|
|MMLU (5-Shot) |64.69| |
|
|TruthfulQA (0-shot) |43.20| |
|
|Winogrande (5-shot) |76.64| |
|
|GSM8k (5-shot) |42.61| |
|
|
|
|