|
--- |
|
license: llama3 |
|
datasets: |
|
- mims-harvard/ProCyon-Instruct |
|
language: |
|
- en |
|
base_model: |
|
- meta-llama/Llama-3.1-8B |
|
tags: |
|
- biology |
|
- protein |
|
--- |
|
# ProCyon-Full |
|
|
|
ProCyon-Full is a multimodal foundation model for protein phenotypes, which combines a large language model with protein encoders to support inputs of interleaved free text and proteins. |
|
This model is instruction-tuned using the full [ProCyon-Instruct](https://huggingface.co/datasets/mims-harvard/ProCyon-Instruct) dataset. |
|
|
|
For more information on the model design, training, and validation, please see the [overview page](https://zitniklab.hms.harvard.edu/ProCyon/) or the [paper](https://www.biorxiv.org/content/10.1101/2024.12.10.627665v1). |
|
|
|
Additional versions of the model are available as [ProCyon-Split](https://huggingface.co/mims-harvard/ProCyon-Split) and [ProCyon-Bind](https://huggingface.co/mims-harvard/ProCyon-Bind). |
|
|
|
|
|
Built with Meta Llama 3 |
|
|
|
|