English
biology
protein
File size: 922 Bytes
6bfc263
 
2c16fed
 
 
 
 
 
 
 
 
6bfc263
2c16fed
 
 
 
 
caf27d0
2c16fed
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
license: llama3
datasets:
- mims-harvard/ProCyon-Instruct
language:
- en
base_model:
- meta-llama/Llama-3.1-8B
tags:
- biology
- protein
---
# ProCyon-Full

ProCyon-Full is a multimodal foundation model for protein phenotypes, which combines a large language model with protein encoders to support inputs of interleaved free text and proteins.
This model is instruction-tuned using the full [ProCyon-Instruct](https://huggingface.co/datasets/mims-harvard/ProCyon-Instruct) dataset.

For more information on the model design, training, and validation, please see the [overview page](https://zitniklab.hms.harvard.edu/ProCyon/) or the [paper](https://www.biorxiv.org/content/10.1101/2024.12.10.627665v1).

Additional versions of the model are available as [ProCyon-Split](https://huggingface.co/mims-harvard/ProCyon-Split) and [ProCyon-Bind](https://huggingface.co/mims-harvard/ProCyon-Bind).


Built with Meta Llama 3