Add TF weights

#1
by amyeroberts HF staff - opened

Model converted by the transformers' pt_to_tf CLI.

All converted model outputs and hidden layers were validated against its Pytorch counterpart. Maximum crossload output difference=1.547e-04; Maximum converted output difference=1.547e-04.

All crossload differences

logits: 1.025e-05
hidden_states[0]: 9.060e-06
hidden_states[1]: 7.853e-05
hidden_states[2]: 1.547e-04
hidden_states[3]: 8.881e-05
hidden_states[4]: 1.428e-04

amyeroberts changed pull request status to merged

Sign up or log in to comment