Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
etemiz 
posted an update 2 days ago
Post
290
Having bad LLMs is ok and can be utilized well. They can allow us to find ideas that work faster.

Reinforcement algorithm could be: "take what a proper model says and negate what a bad LLM says". Or in a mixture of agents situation we could say refute the bad LLM output and combine with the output of the good LLM.

This could mean having two wings (or more) in search of "ideas that work for most people most of the time".

The Open Mic
OpenAI hosts an open mic night.
Comedian: "Why did OpenAI cross the road?"
Audience: "Why?"
Comedian: "To close the door on the other side."

In this post