Support for multiple image/text pairs and/or in-context learning

#17
by kushinm - opened

Is there a transparent way to interleave text and image tokens to test in-context learning abilities for the model?

Sign up or log in to comment