ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.06147
  4. Cited By
NAAQA: A Neural Architecture for Acoustic Question Answering

NAAQA: A Neural Architecture for Acoustic Question Answering

11 June 2021
Jerome Abdelnour
Jean Rouat
G. Salvi
ArXivPDFHTML

Papers citing "NAAQA: A Neural Architecture for Acoustic Question Answering"

15 / 15 papers shown
Title
Audio Retrieval with Natural Language Queries: A Benchmark Study
Audio Retrieval with Natural Language Queries: A Benchmark Study
A. Sophia Koepke
Andreea-Maria Oncescu
João F. Henriques
Zeynep Akata
Samuel Albanie
57
99
0
17 Dec 2021
Visualizing Data using GTSNE
Visualizing Data using GTSNE
Songting Shi
48
155
0
03 Aug 2021
Receptive-field-regularized CNN variants for acoustic scene
  classification
Receptive-field-regularized CNN variants for acoustic scene classification
Khaled Koutini
Hamid Eghbalzadeh
Gerhard Widmer
97
30
0
05 Sep 2019
End-to-End Environmental Sound Classification using a 1D Convolutional
  Neural Network
End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network
Sajjad Abdoli
P. Cardinal
Alessandro Lameiras Koerich
54
269
0
18 Apr 2019
From Recognition to Cognition: Visual Commonsense Reasoning
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
151
877
0
27 Nov 2018
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language
  Understanding
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
Kexin Yi
Jiajun Wu
Chuang Gan
Antonio Torralba
Pushmeet Kohli
J. Tenenbaum
NAI
82
608
0
04 Oct 2018
Don't Just Assume; Look and Answer: Overcoming Priors for Visual
  Question Answering
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
136
585
0
01 Dec 2017
Timbre Analysis of Music Audio Signals with Convolutional Neural
  Networks
Timbre Analysis of Music Audio Signals with Convolutional Neural Networks
Jordi Pons
Olga Slizovskaia
Rong Gong
E. Gómez
Xavier Serra
40
113
0
20 Mar 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
  Visual Reasoning
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
287
2,367
0
20 Dec 2016
Making the V in VQA Matter: Elevating the Role of Image Understanding in
  Visual Question Answering
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
320
3,224
0
02 Dec 2016
CNN Architectures for Large-Scale Audio Classification
CNN Architectures for Large-Scale Audio Classification
Shawn Hershey
Sourish Chaudhuri
D. Ellis
J. Gemmeke
A. Jansen
...
Rif A. Saurous
Bryan Seybold
M. Slaney
Ron J. Weiss
K. Wilson
111
2,494
0
29 Sep 2016
Analyzing the Behavior of Visual Question Answering Models
Analyzing the Behavior of Visual Question Answering Models
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
73
312
0
23 Jun 2016
FVQA: Fact-based Visual Question Answering
FVQA: Fact-based Visual Question Answering
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
CoGe
77
456
0
17 Jun 2016
Yin and Yang: Balancing and Answering Binary Visual Questions
Yin and Yang: Balancing and Answering Binary Visual Questions
Peng Zhang
Yash Goyal
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
87
352
0
16 Nov 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image
  Question Answering
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
Wenyuan Xu
78
498
0
21 May 2015
1