some one

some1nostr
ยท

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago
some1nostr/Nostr-Llama-3.1-8B
updated a model about 1 month ago
some1nostr/Ostrich-Llama-3-70B
updated a model about 1 month ago
some1nostr/Ostrich-70B
View all activity

Organizations

None yet

some1nostr's activity

upvoted an article 2 months ago
New activity in some1nostr/Ostrich-70B 3 months ago

BlackSheep

3
#1 opened 3 months ago by
TroyDoesAI
upvoted an article 7 months ago
view article
Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

By leonardlin โ€ข
โ€ข 50
reacted to singhsidhukuldeep's post with ๐Ÿš€ 7 months ago
view post
Post
1370
๐Ÿ“ˆ One of the biggest changes in Llama 3 was the training dataset, which grew by 7X over Llama 2 (2T to 15T tokens) ๐Ÿš€

While Meta did not open source the dataset, it sparked a thought... what would happen if everyone had access to a big, high-quality dataset? ๐Ÿค”

To address that, in April this year, @huggingface released FineWeb, a 15T token open-source dataset ๐ŸŒ

And now they are releasing FineWeb Technical Report and FineWeb Edu ๐Ÿ“š

๐Ÿ† 15T tokens in FineWeb outperforming other open datasets
๐ŸŽ“ 1.3T highest-quality educational dataset FineWeb-Edu
๐Ÿ“˜ 5.4T high-quality educational tokens in FineWeb-Edu-2

FineWeb Edu outperforms other datasets on MMLU, ARC, OpenBookQA ๐Ÿ“ˆ

ODC-By 1.0 license ๐Ÿ“œ

Report: HuggingFaceFW/blogpost-fineweb-v1