RaphaelMourad commited on
Commit
2e58141
1 Parent(s): 790be30

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -13,7 +13,7 @@ tags:
13
 
14
  The Mistral-DNA-v1-138M-virus Large Language Model (LLM) is a pretrained generative DNA text model with 17.31M parameters x 8 experts = 138.5M parameters.
15
  It is derived from Mistral-7B-v0.1 model, which was simplified for DNA: the number of layers and the hidden size were reduced.
16
- The model was pretrained using around 15071 viruses > 1kb.
17
 
18
  Virus genome database was downloaded from https://www.ncbi.nlm.nih.gov/labs/virus/vssi/#/virus?SeqType_s=Genome&VirusLineage_ss=taxid:10239&SourceDB_s=RefSeq.
19
  NB: the DNA sequence was used, not the RNA sequence.
 
13
 
14
  The Mistral-DNA-v1-138M-virus Large Language Model (LLM) is a pretrained generative DNA text model with 17.31M parameters x 8 experts = 138.5M parameters.
15
  It is derived from Mistral-7B-v0.1 model, which was simplified for DNA: the number of layers and the hidden size were reduced.
16
+ The model was pretrained using around 15071 viruses > 1kb. Virus genomes were split into 1kb sequences.
17
 
18
  Virus genome database was downloaded from https://www.ncbi.nlm.nih.gov/labs/virus/vssi/#/virus?SeqType_s=Genome&VirusLineage_ss=taxid:10239&SourceDB_s=RefSeq.
19
  NB: the DNA sequence was used, not the RNA sequence.