--- datasets: - abisee/cnn_dailymail language: - en --- # Model Card for Model ID This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1). ## Model Details ### Model Description - **Developed by:** A. Britez - **Model type:** RoBERTa - **Language(s) (NLP):** English - EN ### Model Sources [optional] - **Repository:** https://github.com/abrtz/baby-lm ## Uses ### Direct Use [More Information Needed] ## Bias, Risks, and Limitations [More Information Needed] ### Recommendations Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. ## How to Get Started with the Model Use the code below to get started with the model. [More Information Needed] ## Training Details ### Training Data datasets: - [abisee/cnn_dailymail](https://huggingface.co/datasets/abisee/cnn_dailymail) ### Training Procedure #### Preprocessing [optional] [More Information Needed] #### Training Hyperparameters - **Training regime:** [More Information Needed] #### Speeds, Sizes, Times [optional] [More Information Needed] ## Evaluation ### Testing Data, Factors & Metrics #### Testing Data [ewok-core/ewok-core-1.0](https://huggingface.co/datasets/ewok-core/ewok-core-1.0) #### Factors [More Information Needed] #### Metrics [More Information Needed] ### Results [More Information Needed] #### Summary ## Model Examination [optional] [More Information Needed] ## Environmental Impact - **Hardware Type:** A100-SXM4-40GB - **Hours used:** 1.0 hours - **Cloud Provider:** Google Cloud Platform - **Compute Region:** europe-west4 - **Carbon Emitted:** Total emissions are estimated to be 0.14 kgCO$_2$eq of which 100 percents were directly offset by the cloud provider. Estimations were conducted using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). ## Technical Specifications [optional] ### Model Architecture and Objective RoBERTa for MLM ## Glossary [optional] BabyLM