VISOR-GPT / train /models /cnn /gatedcnn_9_config.json
szukevin's picture
upload
7900c16
raw
history blame
279 Bytes
{
"emb_size": 128,
"hidden_size": 807,
"layers_num": 9,
"kernel_size": 4,
"block_size": 2,
"max_seq_length": 1024,
"dropout": 0.1,
"data_processor": "lm",
"embedding": ["word"],
"remove_embedding_layernorm": true,
"encoder": "gatedcnn",
"target": ["lm"]
}