Word2Pix: Word to Pixel Cross Attention Transformer in Visual Grounding

Word2Pix: Word to Pixel Cross Attention Transformer in Visual Grounding

    ObjD

Papers citing "Word2Pix: Word to Pixel Cross Attention Transformer in Visual Grounding"

28 / 28 papers shown
Title
Layer Normalization
Layer Normalization
437
10,548
0
21 Jul 2016