9 1

ali issa

ali-issa

AI & ML interests

None yet

Recent Activity

published a dataset 1 day ago

ali-issa/eng_opus_dataset

updated a dataset 3 days ago

ali-issa/arb_tokenized_filtered_dataset_with_eng-bpe-tokenizer-32768

published a dataset 3 days ago

ali-issa/arb_tokenized_filtered_dataset_with_eng-bpe-tokenizer-32768

View all activity

Organizations

ali-issa's activity

published a dataset 1 day ago

ali-issa/eng_opus_dataset

Viewer • Updated Nov 6, 2024 • 175M • 99

updated a dataset 3 days ago

ali-issa/arb_tokenized_filtered_dataset_with_eng-bpe-tokenizer-32768

Viewer • Updated 3 days ago • 142M • 3

published a dataset 3 days ago

ali-issa/arb_tokenized_filtered_dataset_with_eng-bpe-tokenizer-32768

Viewer • Updated 3 days ago • 142M • 3

published a model 3 days ago

ali-issa/arb-bpe-tokenizer-32768

Updated Dec 13, 2024

updated a dataset 10 days ago

ali-issa/arb_filtered_short_sentences_less_than_5_words_training_data_for_opus_aya_xnli

Updated 10 days ago • 47

updated a dataset 13 days ago

ali-issa/eng_filtered_short_sentences_less_than_5_words_training_data_for_opus_aya_xnli

Updated 13 days ago • 9

updated a model about 2 months ago

ali-issa/arb-bpe-tokenizer-32768

Updated Dec 13, 2024

upvoted an article about 2 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 14

New activity in riotu-lab/Aranizer-SP-32k 2 months ago

LLM Evaluation technique

#1 opened 2 months ago by

ali-issa

New activity in CohereForAI/aya_collection 3 months ago

The number of rows varies across languages for the same translated dataset, and some translations are inconsistent between languages.

#11 opened 3 months ago by

ali-issa