RaphaelMourad
commited on
Commit
•
2e58141
1
Parent(s):
790be30
Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ tags:
|
|
13 |
|
14 |
The Mistral-DNA-v1-138M-virus Large Language Model (LLM) is a pretrained generative DNA text model with 17.31M parameters x 8 experts = 138.5M parameters.
|
15 |
It is derived from Mistral-7B-v0.1 model, which was simplified for DNA: the number of layers and the hidden size were reduced.
|
16 |
-
The model was pretrained using around 15071 viruses > 1kb.
|
17 |
|
18 |
Virus genome database was downloaded from https://www.ncbi.nlm.nih.gov/labs/virus/vssi/#/virus?SeqType_s=Genome&VirusLineage_ss=taxid:10239&SourceDB_s=RefSeq.
|
19 |
NB: the DNA sequence was used, not the RNA sequence.
|
|
|
13 |
|
14 |
The Mistral-DNA-v1-138M-virus Large Language Model (LLM) is a pretrained generative DNA text model with 17.31M parameters x 8 experts = 138.5M parameters.
|
15 |
It is derived from Mistral-7B-v0.1 model, which was simplified for DNA: the number of layers and the hidden size were reduced.
|
16 |
+
The model was pretrained using around 15071 viruses > 1kb. Virus genomes were split into 1kb sequences.
|
17 |
|
18 |
Virus genome database was downloaded from https://www.ncbi.nlm.nih.gov/labs/virus/vssi/#/virus?SeqType_s=Genome&VirusLineage_ss=taxid:10239&SourceDB_s=RefSeq.
|
19 |
NB: the DNA sequence was used, not the RNA sequence.
|