AhmedSSabir
/

BERT-CNN-Visual-Semantic

Model card Files Files and versions Community

AhmedSSabir commited on Jun 9, 2022

Commit

3946810

•

1 Parent(s): 19be54f

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -1,6 +1,8 @@
 # Visual semantic with BERT-CNN
  To take advantage of the overlapping between the visual context and the caption, and to extract global information from each visual, we use BERT  as an embedding layer followed by a shallow CNN (tri-gram kernel) (Kim,204).
 This model can be used to assign an object-to-caption relatedness score, which is valuable for

 # Visual semantic with BERT-CNN
  To take advantage of the overlapping between the visual context and the caption, and to extract global information from each visual, we use BERT  as an embedding layer followed by a shallow CNN (tri-gram kernel) (Kim,204).
+ For datasets that are less than 100K please have look at our [shallow model](https://github.com/ahmedssabir/Semantic-Relatedness-Based-Reranker-for-Text-Spotting)
 This model can be used to assign an object-to-caption relatedness score, which is valuable for