Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zaydzuhri 's Collections
Token Order Prediction
Softpick

Token Order Prediction

updated Sep 1

Pretrained models from the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"

Upvote
-

  • Predicting the Order of Upcoming Tokens Improves Language Modeling

    Paper • 2508.19228 • Published Aug 26 • 22

  • zaydzuhri/vanilla-340M-4096-model

    0.4B • Updated Sep 1 • 9

  • zaydzuhri/mtp-340M-4096-model

    0.4B • Updated Sep 1 • 5

  • zaydzuhri/top-340M-4096-model

    0.4B • Updated Sep 1 • 10 • 1

  • zaydzuhri/vanilla-1.8B-4096-model

    2B • Updated Sep 1 • 11

  • zaydzuhri/mtp-1.8B-4096-model

    2B • Updated Sep 1 • 9

  • zaydzuhri/top-1.8B-4096-model

    2B • Updated Sep 1 • 17

  • zaydzuhri/vanilla-7B-4096-model

    7B • Updated Sep 1 • 2

  • zaydzuhri/mtp-7B-4096-model

    7B • Updated Sep 1 • 13

  • zaydzuhri/top-7B-4096-model

    7B • Updated Sep 1 • 17
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs