Fix wrong tensor shape leading to words duplication

#11

by dmitry-yudakov - opened 1 day ago

base: refs/heads/main

←

from: refs/pr/11

Discussion Files changed

-2

dmitry-yudakov

1 day ago

The initial context tokens are 3 but the tensor shape is set to [1,2] - after the inference step the next token is set to the correct position but the shape is again with 1 less than it should be.

It leads to words doubling and seemingly hallucinations even with the provided audio sample.

Fix wrong tensor shape leading to words duplicationdcda9c09

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment