ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.12090
  4. Cited By
Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text
  Recognition

Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition

26 July 2021
A. Bhunia
Aneeshan Sain
Amandeep Kumar
S. Ghose
Pinaki Nath Chowdhury
Yi-Zhe Song
ArXivPDFHTML

Papers citing "Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition"

11 / 11 papers shown
Title
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
Naphat Nithisopa
Teerapong Panboonyuen
ViT
26
0
0
07 May 2025
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
49
35
0
30 Aug 2023
Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings
Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings
A. Bhunia
Subhadeep Koley
Amandeep Kumar
Aneeshan Sain
Pinaki Nath Chowdhury
Tao Xiang
Yi-Zhe Song
52
19
0
20 Mar 2023
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for
  Scene Text Spotting
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
Shancheng Fang
Zhendong Mao
Hongtao Xie
Yuxin Wang
C. Yan
Yongdong Zhang
32
53
0
19 Nov 2022
Masked Vision-Language Transformers for Scene Text Recognition
Masked Vision-Language Transformers for Scene Text Recognition
Jie Wu
Ying Peng
Shenmin Zhang
Weigang Qi
Jian Zhang
32
3
0
09 Nov 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
26
169
0
14 Jul 2022
Reading and Writing: Discriminative and Generative Modeling for
  Self-Supervised Text Recognition
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
33
55
0
01 Jul 2022
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
Xiaojie Chu
Yongtao Wang
27
2
0
06 Apr 2022
Self-supervised Implicit Glyph Attention for Text Recognition
Self-supervised Implicit Glyph Attention for Text Recognition
Tongkun Guan
Chaochen Gu
Jingzheng Tu
Xuehang Yang
Qi Feng
Yudi Zhao
Xiaokang Yang
Wei Shen
32
25
0
07 Mar 2022
Visual-Semantic Transformer for Scene Text Recognition
Visual-Semantic Transformer for Scene Text Recognition
Xin Tang
Yongquan Lai
Ying Liu
Yuanyuan Fu
Rui Fang
ViT
26
8
0
02 Dec 2021
Iterative Visual Reasoning Beyond Convolutions
Iterative Visual Reasoning Beyond Convolutions
Xinlei Chen
Li-Jia Li
Li Fei-Fei
Abhinav Gupta
LRM
GNN
40
213
0
29 Mar 2018
1