Mozilla
/

Meta-Llama-3-70B-Instruct-llamafile

Text Generation

Model card Files Files and versions Community

jartine commited on Apr 20

Commit

072f53a

•

1 Parent(s): ce86e9e

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -49,7 +49,7 @@ It uses Cosmopolitan Libc to turn LLM weights into runnable llama.cpp
 binaries that run on the stock installs of six OSes for both ARM64 and
 AMD64.
-## About Quantization Formats
 Your choice of quantization format depends on three things:
@@ -67,7 +67,7 @@ computation speed (flops) so simpler quants help.
 Note: BF16 is currently only supported on CPU.
-## Hardware Choices
 Any Macbook with 32GB should be able to run
 Meta-Llama-3-70B-Instruct.Q2\_K.llamafile reasonably well. At this

 binaries that run on the stock installs of six OSes for both ARM64 and
 AMD64.
+## About Quantization Formats (General Advice)
 Your choice of quantization format depends on three things:
 Note: BF16 is currently only supported on CPU.
+## Hardware Choices (LLaMA3 70B Specific)
 Any Macbook with 32GB should be able to run
 Meta-Llama-3-70B-Instruct.Q2\_K.llamafile reasonably well. At this