LiteLLMs
/

aya-23-8B-GGUF

Transformers

GGUF

conversational

Model card Files Files and versions Community

andrijdavid commited on May 28

Commit

be900b8

•

1 Parent(s): c900bba

Upload folder using huggingface_hub

Browse files

Files changed (1) hide show

README.md +12 -9

README.md CHANGED Viewed

@@ -29,6 +29,7 @@ license: cc-by-nc-4.0
 library_name: transformers
 tags:
 - GGUF
 quantized_by: andrijdavid
 ---
 # aya-23-8B-GGUF
@@ -228,6 +229,10 @@ Here are guides on using llama-cpp-python and ctransformers with LangChain:
 # Model Card for Aya-23-8B
 ## Model Summary
 Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. Aya 23 focuses on pairing a highly performant pre-trained [Command family](https://huggingface.co/CohereForAI/c4ai-command-r-plus) of models with the recently released [Aya Collection](https://huggingface.co/datasets/CohereForAI/aya_collection). The result is a powerful multilingual large language model serving 23 languages.
@@ -243,10 +248,6 @@ Developed by: [Cohere For AI](https://cohere.for.ai) and [Cohere](https://cohere
 - Model: aya-23-8B
 - Model Size: 8 billion parameters
-**Try Aya 23**
-You can try out Aya 23 (35B) before downloading the weights in our hosted Hugging Face Space [here](https://huggingface.co/spaces/CohereForAI/aya-23).
 ### Usage
 Please install transformers from the source repository that includes the necessary changes for this model
@@ -312,11 +313,13 @@ You can try Aya 23 in the Cohere [playground](https://dashboard.cohere.com/playg
 ### Citation info
 ```bibtex
-@misc{aya23technicalreport,
-  title={Aya 23: Open Weight Releases to Further Multilingual Progress},
-  author={Viraat Aryabumi, John Dang, Dwarak Talupuru, Saurabh Dash, David Cairuz, Hangyu Lin, Bharat Venkitesh, Madeline Smith, Kelly Marchisio, Sebastian Ruder, Acyr Locatelli, Julia Kreutzer, Nick Frosst, Phil Blunsom, Marzieh Fadaee, Ahmet Üstün, and Sara Hooker},
-  url={https://cohere.com/research/papers/aya-command-23-8b-and-35b-technical-report-2024-05-23},
-  year={2024}
 }
 ```

 library_name: transformers
 tags:
 - GGUF
+inference: false
 quantized_by: andrijdavid
 ---
 # aya-23-8B-GGUF
 # Model Card for Aya-23-8B
+**Try Aya 23**
+You can try out Aya 23 (35B) before downloading the weights in our hosted Hugging Face Space [here](https://huggingface.co/spaces/CohereForAI/aya-23).
 ## Model Summary
 Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. Aya 23 focuses on pairing a highly performant pre-trained [Command family](https://huggingface.co/CohereForAI/c4ai-command-r-plus) of models with the recently released [Aya Collection](https://huggingface.co/datasets/CohereForAI/aya_collection). The result is a powerful multilingual large language model serving 23 languages.
 - Model: aya-23-8B
 - Model Size: 8 billion parameters
 ### Usage
 Please install transformers from the source repository that includes the necessary changes for this model
 ### Citation info
 ```bibtex
+@misc{aryabumi2024aya,
+      title={Aya 23: Open Weight Releases to Further Multilingual Progress},
+      author={Viraat Aryabumi and John Dang and Dwarak Talupuru and Saurabh Dash and David Cairuz and Hangyu Lin and Bharat Venkitesh and Madeline Smith and Kelly Marchisio and Sebastian Ruder and Acyr Locatelli and Julia Kreutzer and Nick Frosst and Phil Blunsom and Marzieh Fadaee and Ahmet Üstün and Sara Hooker},
+      year={2024},
+      eprint={2405.15032},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
 }
 ```