Extract attention from model

#15

by kaustabanv - opened Feb 27

Feb 27

Is there a way to extract attention values at every BertLayer? It would be useful for interpretability if the attention values can be accessed.
I'm new to transformers. Is the hidden layer output be used for explainability the same way attention is?

Thanks for your time!

May 14

Read this, I was finally able to extract attention. I hope this will help all of us :D

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment