ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.08218
  4. Cited By
VizWiz Grand Challenge: Answering Visual Questions from Blind People
v1v2v3v4 (latest)

VizWiz Grand Challenge: Answering Visual Questions from Blind People

22 February 2018
Danna Gurari
Qing Li
Abigale Stangl
Anhong Guo
Chi Lin
Kristen Grauman
Jiebo Luo
Jeffrey P. Bigham
    CoGe
ArXiv (abs)PDFHTML

Papers citing "VizWiz Grand Challenge: Answering Visual Questions from Blind People"

50 / 573 papers shown
Title
InfographicVQA
InfographicVQA
Minesh Mathew
Viraj Bagal
Rubèn Pérez Tito
Dimosthenis Karatzas
Ernest Valveny
C. V. Jawahar
112
242
0
26 Apr 2021
Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task
  Feasibility in Interactive Visual Environments
Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments
Andrea Burns
Deniz Arsan
Sanjna Agrawal
Ranjitha Kumar
Kate Saenko
Bryan A. Plummer
LRM
123
23
0
17 Apr 2021
Dealing with Missing Modalities in the Visual Question Answer-Difference
  Prediction Task through Knowledge Distillation
Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation
Jae-Won Cho
Dong-Jin Kim
Jinsoo Choi
Yunjae Jung
In So Kweon
VLM
57
17
0
13 Apr 2021
Towards a Collective Agenda on AI for Earth Science Data Analysis
Towards a Collective Agenda on AI for Earth Science Data Analysis
D. Tuia
R. Roscher
Jan Dirk Wegner
Nathan Jacobs
Xiaoxiang Zhu
Gustau Camps-Valls
AI4CE
82
70
0
11 Apr 2021
ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition
ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition
Daniela Massiceti
L. Zintgraf
J. Bronskill
Lida Theodorou
Matthew Tobias Harris
Edward Cutrell
C. Morrison
Katja Hofmann
Simone Stumpf
191
45
0
08 Apr 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Domain-robust VQA with diverse datasets and methods but no target labels
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
129
23
0
29 Mar 2021
Visual Question Answering: which investigated applications?
Visual Question Answering: which investigated applications?
Silvio Barra
Carmen Bisogni
M. De Marsico
S. Ricciardi
80
38
0
04 Mar 2021
Less is More: ClipBERT for Video-and-Language Learning via Sparse
  Sampling
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Jie Lei
Linjie Li
Luowei Zhou
Zhe Gan
Tamara L. Berg
Joey Tianyi Zhou
Jingjing Liu
CLIP
179
666
0
11 Feb 2021
VisualMRC: Machine Reading Comprehension on Document Images
VisualMRC: Machine Reading Comprehension on Document Images
Ryota Tanaka
Kyosuke Nishida
Sen Yoshida
101
146
0
27 Jan 2021
Unanswerable Questions about Images and Texts
Unanswerable Questions about Images and Texts
E. Davis
79
12
0
25 Jan 2021
Understanding the Effect of Out-of-distribution Examples and Interactive
  Explanations on Human-AI Decision Making
Understanding the Effect of Out-of-distribution Examples and Interactive Explanations on Human-AI Decision Making
Han Liu
Vivian Lai
Chenhao Tan
171
121
0
13 Jan 2021
MSD: Saliency-aware Knowledge Distillation for Multimodal Understanding
MSD: Saliency-aware Knowledge Distillation for Multimodal Understanding
Woojeong Jin
Maziar Sanjabi
Shaoliang Nie
L Tan
Xiang Ren
Hamed Firooz
30
6
0
06 Jan 2021
Detecting Hate Speech in Memes Using Multimodal Deep Learning
  Approaches: Prize-winning solution to Hateful Memes Challenge
Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge
Riza Velioglu
J. Rose
VLM
50
87
0
23 Dec 2020
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Zhengyuan Yang
Yijuan Lu
Jianfeng Wang
Xi Yin
D. Florêncio
Lijuan Wang
Cha Zhang
Lei Zhang
Jiebo Luo
VLM
107
144
0
08 Dec 2020
CapWAP: Captioning with a Purpose
CapWAP: Captioning with a Purpose
Adam Fisch
Kenton Lee
Ming-Wei Chang
J. Clark
Regina Barzilay
53
11
0
09 Nov 2020
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question
  Answering
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
Zanxia Jin
Heran Wu
Chun Yang
Fang Zhou
Jingyan Qin
Lei Xiao
Xu-Cheng Yin
88
31
0
24 Oct 2020
Literature Review of Computer Tools for the Visually Impaired: a focus
  on Search Engines
Literature Review of Computer Tools for the Visually Impaired: a focus on Search Engines
Guy Meyer
Alan Wassyng
M. Lawford
Kourosh Sabri
S. Shirani
8
2
0
21 Oct 2020
Answer-checking in Context: A Multi-modal FullyAttention Network for
  Visual Question Answering
Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering
Hantao Huang
Tao Han
Wei Han
D. Yap
Cheng-Ming Chiang
28
4
0
17 Oct 2020
Vision Skills Needed to Answer Visual Questions
Vision Skills Needed to Answer Visual Questions
Xiaoyu Zeng
Yanan Wang
Tai-Yin Chiu
Nilavra Bhattacharya
Danna Gurari
66
18
0
07 Oct 2020
Regularizing Attention Networks for Anomaly Detection in Visual Question
  Answering
Regularizing Attention Networks for Anomaly Detection in Visual Question Answering
Doyup Lee
Yeongjae Cheon
Wook-Shin Han
AAMLOOD
44
16
0
21 Sep 2020
Ground-truth or DAER: Selective Re-query of Secondary Information
Ground-truth or DAER: Selective Re-query of Secondary Information
Stephan J. Lemmer
Jason J. Corso
55
4
0
16 Sep 2020
Visual Question Answering on Image Sets
Visual Question Answering on Image Sets
Ankan Bansal
Yuting Zhang
Rama Chellappa
CoGe
158
44
0
27 Aug 2020
Document Visual Question Answering Challenge 2020
Document Visual Question Answering Challenge 2020
Minesh Mathew
Rubèn Pérez Tito
Dimosthenis Karatzas
R. Manmatha
C. V. Jawahar
59
16
0
20 Aug 2020
Towards Ecologically Valid Research on Language User Interfaces
Towards Ecologically Valid Research on Language User Interfaces
H. D. Vries
Dzmitry Bahdanau
Christopher D. Manning
291
52
0
28 Jul 2020
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data
Michael Cogswell
Jiasen Lu
Rishabh Jain
Stefan Lee
Devi Parikh
Dhruv Batra
VLMEgoV
78
15
0
24 Jul 2020
Reducing Language Biases in Visual Question Answering with
  Visually-Grounded Question Encoder
Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder
K. Gouthaman
Anurag Mittal
98
79
0
13 Jul 2020
Visual Question Answering as a Multi-Task Problem
Visual Question Answering as a Multi-Task Problem
A. E. Pollard
J. Shapiro
19
7
0
03 Jul 2020
Exploring Weaknesses of VQA Models through Attribution Driven Insights
Exploring Weaknesses of VQA Models through Attribution Driven Insights
Shaunak Halbe
42
2
0
11 Jun 2020
Structured Multimodal Attentions for TextVQA
Structured Multimodal Attentions for TextVQA
Chenyu Gao
Qi Zhu
Peng Wang
Hui Li
Yuliang Liu
Anton Van Den Hengel
Qi Wu
99
60
0
01 Jun 2020
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
Douwe Kiela
Hamed Firooz
Aravind Mohan
Vedanuj Goswami
Amanpreet Singh
Pratik Ringshia
Davide Testuggine
109
612
0
10 May 2020
Are we pretraining it right? Digging deeper into visio-linguistic
  pretraining
Are we pretraining it right? Digging deeper into visio-linguistic pretraining
Amanpreet Singh
Vedanuj Goswami
Devi Parikh
VLM
78
48
0
19 Apr 2020
An Entropy Clustering Approach for Assessing Visual Question Difficulty
An Entropy Clustering Approach for Assessing Visual Question Difficulty
K. Terao
Toru Tamaki
B. Raytchev
K. Kaneda
Shuníchi Satoh
OODAAML
58
1
0
12 Apr 2020
Rephrasing visual questions by specifying the entropy of the answer
  distribution
Rephrasing visual questions by specifying the entropy of the answer distribution
K. Terao
Toru Tamaki
B. Raytchev
K. Kaneda
S. Satoh
OOD
44
2
0
10 Apr 2020
TuringAdvice: A Generative and Dynamic Evaluation of Language Use
TuringAdvice: A Generative and Dynamic Evaluation of Language Use
Rowan Zellers
Ari Holtzman
Elizabeth Clark
Lianhui Qin
Ali Farhadi
Yejin Choi
ELMLRM
65
14
0
07 Apr 2020
SHOP-VRB: A Visual Reasoning Benchmark for Object Perception
SHOP-VRB: A Visual Reasoning Benchmark for Object Perception
Michal Nazarczuk
K. Mikolajczyk
72
21
0
06 Apr 2020
Assessing Image Quality Issues for Real-World Problems
Assessing Image Quality Issues for Real-World Problems
Tai-Yin Chiu
Yinan Zhao
Danna Gurari
137
54
0
27 Mar 2020
Egoshots, an ego-vision life-logging dataset and semantic fidelity
  metric to evaluate diversity in image captioning models
Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning models
Pranav Agarwal
Alejandro Betancourt
V. Panagiotou
Natalia Díaz Rodríguez
EGVM
82
10
0
26 Mar 2020
RSVQA: Visual Question Answering for Remote Sensing Data
RSVQA: Visual Question Answering for Remote Sensing Data
Sylvain Lobry
Diego Marcos
J. Murray
D. Tuia
126
223
0
16 Mar 2020
Hand-Priming in Object Localization for Assistive Egocentric Vision
Hand-Priming in Object Localization for Assistive Egocentric Vision
Kyungjun Lee
Abhinav Shrivastava
Hernisa Kacorri
EgoV
63
16
0
28 Feb 2020
On the General Value of Evidence, and Bilingual Scene-Text Visual
  Question Answering
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
Xinyu Wang
Yuliang Liu
Chunhua Shen
Chun Chet Ng
Canjie Luo
Lianwen Jin
C. Chan
Anton Van Den Hengel
Liangwei Wang
101
97
0
24 Feb 2020
Captioning Images Taken by People Who Are Blind
Captioning Images Taken by People Who Are Blind
Danna Gurari
Yinan Zhao
Meng Zhang
Nilavra Bhattacharya
105
184
0
20 Feb 2020
In Defense of Grid Features for Visual Question Answering
In Defense of Grid Features for Visual Question Answering
Huaizu Jiang
Ishan Misra
Marcus Rohrbach
Erik Learned-Miller
Xinlei Chen
OODObjD
85
320
0
10 Jan 2020
VizWiz Dataset Browser: A Tool for Visualizing Machine Learning Datasets
VizWiz Dataset Browser: A Tool for Visualizing Machine Learning Datasets
Nilavra Bhattacharya
Danna Gurari
31
6
0
19 Dec 2019
Deep Bayesian Active Learning for Multiple Correct Outputs
Deep Bayesian Active Learning for Multiple Correct Outputs
Khaled Jedoui
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
BDLOODUQCV
93
14
0
02 Dec 2019
Temporal Reasoning via Audio Question Answering
Temporal Reasoning via Audio Question Answering
Haytham M. Fayek
Justin Johnson
65
54
0
21 Nov 2019
Iterative Answer Prediction with Pointer-Augmented Multimodal
  Transformers for TextVQA
Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA
Ronghang Hu
Amanpreet Singh
Trevor Darrell
Marcus Rohrbach
94
197
0
14 Nov 2019
Open-Ended Visual Question Answering by Multi-Modal Domain Adaptation
Open-Ended Visual Question Answering by Multi-Modal Domain Adaptation
Yiming Xu
Lin Chen
Zhongwei Cheng
Lixin Duan
Jiebo Luo
OOD
86
24
0
11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAIAI4TS
122
338
0
10 Nov 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic
  Labels Improve Image Captioning and Visual Question Answering
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
Soravit Changpinyo
Bo Pang
Piyush Sharma
Radu Soricut
ObjD
60
20
0
04 Sep 2019
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Iro Laina
Christian Rupprecht
Nassir Navab
SSL
76
103
0
25 Aug 2019
Previous
123...101112
Next