Spaces:

THUDM
/

LongWriter

Runtime error

Benchmarking concerns

by Tech-Meld - opened Aug 19

Aug 19

Dear Authors,
Have you ever considered benchmarking this model ? Do you consider doing that ? Are there anything that the community can work on to improve benchmarking long sequences of AI generated text ?

bys0318

Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University org Aug 19

Good question! We release two benchmarks along our work: LongBench-Write and LongWrite-Ruler. Please see our github Repo for more details: https://github.com/THUDM/LongWriter?tab=readme-ov-file#evaluation

Tech-Meld

Aug 19

That's really helpful, thanks!

zRzRzRzRzRzRzR changed discussion status to closed Aug 22

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment