hansmueller464's picture
Upload README.md with huggingface_hub
f44eca1 verified
metadata
language:
  - en
license: cc-by-nc-4.0
library_name: transformers
tags:
  - biology
  - medical
  - llama-cpp
  - gguf-my-repo
datasets:
  - argilla/dpo-mix-7k
  - nvidia/HelpSteer
  - jondurbin/airoboros-3.2
  - hkust-nlp/deita-10k-v0
  - LDJnr/Capybara
  - HPAI-BSC/CareQA
  - GBaker/MedQA-USMLE-4-options
  - lukaemon/mmlu
  - bigbio/pubmed_qa
  - openlifescienceai/medmcqa
  - bigbio/med_qa
  - HPAI-BSC/better-safe-than-sorry
  - HPAI-BSC/pubmedqa-cot
  - HPAI-BSC/medmcqa-cot
  - HPAI-BSC/medqa-cot
pipeline_tag: question-answering

hansmueller464/Llama3-Aloe-8B-Alpha-Q6_K-GGUF

This model was converted to GGUF format from HPAI-BSC/Llama3-Aloe-8B-Alpha using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

Use with llama.cpp

Install llama.cpp through brew.

brew install ggerganov/ggerganov/llama.cpp

Invoke the llama.cpp server or the CLI.

CLI:

llama-cli --hf-repo hansmueller464/Llama3-Aloe-8B-Alpha-Q6_K-GGUF --model llama3-aloe-8b-alpha.Q6_K.gguf -p "The meaning to life and the universe is"

Server:

llama-server --hf-repo hansmueller464/Llama3-Aloe-8B-Alpha-Q6_K-GGUF --model llama3-aloe-8b-alpha.Q6_K.gguf -c 2048

Note: You can also use this checkpoint directly through the usage steps listed in the Llama.cpp repo as well.

git clone https://github.com/ggerganov/llama.cpp &&             cd llama.cpp &&             make &&             ./main -m llama3-aloe-8b-alpha.Q6_K.gguf -n 128