BigScience Workshop

non-profit

https://bigscience.huggingface.co

bigscienceW

bigscience-workshop

AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

craffel authored a paper about 1 hour ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

RTT1 authored a paper 15 days ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

christopher new activity 16 days ago

bigscience/bloom-intermediate:Adding `safetensors` variant of this model

View all activity

bigscience's activity

craffel

authored a paper about 1 hour ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 1 day ago • 58

loubnabnl

authored a paper about 1 hour ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 1 day ago • 58

archiki

authored a paper 1 day ago

Learning to Generate Unit Tests for Automated Debugging

Paper • 2502.01619 • Published 3 days ago • 3

albertvillanova

posted an update 2 days ago

Post

1727

🚀 Introducing @huggingface Open Deep-Research💥

In just 24 hours, we built an open-source agent that:
✅ Autonomously browse the web
✅ Search, scroll & extract info
✅ Download & manipulate files
✅ Run calculations on data

55% on GAIA validation set! Help us improve it!💡
https://huggingface.co/blog/open-deep-research

1 reply

·

armanc

authored a paper 15 days ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 16 days ago • 81

christopher

in bigscience/bloom-intermediate 16 days ago

Adding `safetensors` variant of this model

#2 opened 16 days ago by

christopher

in bigscience/bloomz-7b1-p3 17 days ago

Adding `safetensors` variant of this model

#3 opened 17 days ago by

mryab

authored a paper 21 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 23 days ago • 53

salomey

authored a paper 22 days ago

AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages

Paper • 2501.08284 • Published 23 days ago • 6

armanc

authored a paper 23 days ago

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

Paper • 2501.06590 • Published 26 days ago • 9

Skylion007

authored a paper 27 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 28 days ago • 87

albertvillanova

posted an update about 1 month ago

Post

2036

Discover all the improvements in the new version of Lighteval: https://huggingface.co/docs/lighteval/

dirkgr

authored a paper about 1 month ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 16

christopher

in bigscience/bloom-1b1-intermediate about 2 months ago

Adding `safetensors` variant of this model

#2 opened about 2 months ago by

christopher

in bigscience/bloom-7b1-intermediate about 2 months ago

Adding `safetensors` variant of this model

#4 opened about 2 months ago by

NohTow

authored 3 papers about 2 months ago

Multitask Prompted Training Enables Zero-Shot Task Generalization

Paper • 2110.08207 • Published Oct 15, 2021 • 2

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 28

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 126

lhoestq

authored a paper about 2 months ago

Croissant: A Metadata Format for ML-Ready Datasets

Paper • 2403.19546 • Published Mar 28, 2024 • 1

tomsherborne

authored a paper about 2 months ago

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Paper • 2412.04144 • Published Dec 5, 2024 • 4