Quantization made by Richard Erkhov.

calme-2.3-llama3.1-70b - GGUF

Model creator: https://huggingface.co/MaziyarPanahi/
Original model: https://huggingface.co/MaziyarPanahi/calme-2.3-llama3.1-70b/

Name	Quant method	Size
calme-2.3-llama3.1-70b.Q2_K.gguf	Q2_K	24.56GB
calme-2.3-llama3.1-70b.IQ3_XS.gguf	IQ3_XS	27.29GB
calme-2.3-llama3.1-70b.IQ3_S.gguf	IQ3_S	28.79GB
calme-2.3-llama3.1-70b.Q3_K_S.gguf	Q3_K_S	28.79GB
calme-2.3-llama3.1-70b.IQ3_M.gguf	IQ3_M	29.74GB
calme-2.3-llama3.1-70b.Q3_K.gguf	Q3_K	31.91GB
calme-2.3-llama3.1-70b.Q3_K_M.gguf	Q3_K_M	31.91GB
calme-2.3-llama3.1-70b.Q3_K_L.gguf	Q3_K_L	34.59GB
calme-2.3-llama3.1-70b.IQ4_XS.gguf	IQ4_XS	35.64GB
calme-2.3-llama3.1-70b.Q4_0.gguf	Q4_0	37.22GB
calme-2.3-llama3.1-70b.IQ4_NL.gguf	IQ4_NL	37.58GB
calme-2.3-llama3.1-70b.Q4_K_S.gguf	Q4_K_S	37.58GB
calme-2.3-llama3.1-70b.Q4_K.gguf	Q4_K	39.6GB
calme-2.3-llama3.1-70b.Q4_K_M.gguf	Q4_K_M	39.6GB
calme-2.3-llama3.1-70b.Q4_1.gguf	Q4_1	41.27GB
calme-2.3-llama3.1-70b.Q5_0.gguf	Q5_0	45.32GB
calme-2.3-llama3.1-70b.Q5_K_S.gguf	Q5_K_S	45.32GB
calme-2.3-llama3.1-70b.Q5_K.gguf	Q5_K	46.52GB
calme-2.3-llama3.1-70b.Q5_K_M.gguf	Q5_K_M	46.52GB
calme-2.3-llama3.1-70b.Q5_1.gguf	Q5_1	49.36GB
calme-2.3-llama3.1-70b.Q6_K.gguf	Q6_K	53.91GB
calme-2.3-llama3.1-70b.Q8_0.gguf	Q8_0	69.83GB

Original model description:

language: - en library_name: transformers tags: - chat - llama - facebook - llaam3 - finetune - chatml base_model: meta-llama/Meta-Llama-3.1-70B-Instruct datasets: - MaziyarPanahi/truthy-dpo-v0.1-axolotl model_name: calme-2.3-llama3.1-70b pipeline_tag: text-generation inference: false model_creator: MaziyarPanahi quantized_by: MaziyarPanahi model-index: - name: calme-2.3-llama3.1-70b results: - task: type: text-generation name: Text Generation dataset: name: IFEval (0-Shot) type: HuggingFaceH4/ifeval args: num_few_shot: 0 metrics: - type: inst_level_strict_acc and prompt_level_strict_acc value: 86.05 name: strict accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.3-llama3.1-70b name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: BBH (3-Shot) type: BBH args: num_few_shot: 3 metrics: - type: acc_norm value: 55.59 name: normalized accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.3-llama3.1-70b name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MATH Lvl 5 (4-Shot) type: hendrycks/competition_math args: num_few_shot: 4 metrics: - type: exact_match value: 21.45 name: exact match source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.3-llama3.1-70b name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: GPQA (0-shot) type: Idavidrein/gpqa args: num_few_shot: 0 metrics: - type: acc_norm value: 12.53 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.3-llama3.1-70b name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MuSR (0-shot) type: TAUR-Lab/MuSR args: num_few_shot: 0 metrics: - type: acc_norm value: 17.74 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.3-llama3.1-70b name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MMLU-PRO (5-shot) type: TIGER-Lab/MMLU-Pro config: main split: test args: num_few_shot: 5 metrics: - type: acc value: 48.48 name: accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.3-llama3.1-70b name: Open LLM Leaderboard

MaziyarPanahi/calme-2.3-llama3.1-70b

This model is a fine-tuned version of the powerful meta-llama/Meta-Llama-3.1-70B-Instruct, pushing the boundaries of natural language understanding and generation even further. My goal was to create a versatile and robust model that excels across a wide range of benchmarks and real-world applications.

Use Cases

This model is suitable for a wide range of applications, including but not limited to:

Advanced question-answering systems
Intelligent chatbots and virtual assistants
Content generation and summarization
Code generation and analysis
Complex problem-solving and decision support

⚡ Quantized GGUF

coming soon!

🏆 Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	40.30
IFEval (0-Shot)	86.05
BBH (3-Shot)	55.59
MATH Lvl 5 (4-Shot)	21.45
GPQA (0-shot)	12.53
MuSR (0-shot)	17.74
MMLU-PRO (5-shot)	48.48

This model uses ChatML prompt template:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>

{prompt}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

How to use


# Use a pipeline as a high-level helper

from transformers import pipeline

messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe = pipeline("text-generation", model="MaziyarPanahi/calme-2.3-llama3.1-70b")
pipe(messages)


# Load model directly

from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("MaziyarPanahi/calme-2.3-llama3.1-70b")
model = AutoModelForCausalLM.from_pretrained("MaziyarPanahi/calme-2.3-llama3.1-70b")

Ethical Considerations

As with any large language model, users should be aware of potential biases and limitations. We recommend implementing appropriate safeguards and human oversight when deploying this model in production environments.