metadata
license: llama3
datasets:
- mims-harvard/ProCyon-Instruct
language:
- en
base_model:
- meta-llama/Llama-3.1-8B
tags:
- biology
- protein
ProCyon-Full
ProCyon-Full is a multimodal foundation model for protein phenotypes, which combines a large language model with protein encoders to support inputs of interleaved free text and proteins. This model is instruction-tuned using the full ProCyon-Instruct dataset.
For more information on the model design, training, and validation, please see the overview page or the paper.
Additional versions of the model are available as ProCyon-Split and ProCyon-Bind.
Built with Meta Llama 3