ITS NOT REAL

#11

by rombodawg - opened May 2, 2024

Discussion

rombodawg

May 2, 2024

rombodawg

May 2, 2024

I hope you know, I only criticize you because I know you can do better. Prove the model can remember every single token and explain all of them at once of the entire context and then the extended context is worth it. Otherwise its just useless.
If the model can summarize an entire book thats worth 1048k tokens an rewrite it into an essay, then you did your job right. If it doesnt remember anything but 1 idea then the model is a failure and needs further work

michaelfeil

Gradient AI org May 2, 2024

Thanks for the meme, and the motivating words.

Just published a blog: https://gradient.ai/blog/the-haystack-matters-for-niah-evals
We are looking into ways on improved alignment for long context w.r.t. special tokens.
entire book thats worth 1048k tokens - there might be only few books of that size. That's around the size of the bible including old + new testament or other books of 1.5k pages. What we would need is a copyleft book of 1500 pages, published after the llama3 training data, in plain .txt. As you are asking for it, do you happen to have such data @rombodawg ?

rombodawg

May 2, 2024

•

edited May 2, 2024

@michaelfeil You can also copy multiple books into an llm, a series of the same novels, or diffrent books, and ask the model to summarize all of the books in seperate paragraphs, or while an essay describing all the books in detail and what impact they made. You can easily think of multiple benchmarks you can do from an llm reading through a large amount of books and writing something detailed about them, just by doing some critical thinking. 😉