ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1605.09782
  4. Cited By
Adversarial Feature Learning

Adversarial Feature Learning

31 May 2016
Jiasen Lu
Philipp Krahenbuhl
Trevor Darrell
    GAN
ArXivPDFHTML

Papers citing "Adversarial Feature Learning"

50 / 642 papers shown
Title
Object Attribute Matters in Visual Question Answering
Object Attribute Matters in Visual Question Answering
Peize Li
Q. Si
Peng Fu
Zheng Lin
Yan Wang
35
0
0
20 Dec 2023
Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual
  Question Answering
Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering
Chengxiang Yin
Zhengping Che
Kun Wu
Zhiyuan Xu
Jian Tang
36
0
0
20 Dec 2023
A Dual-way Enhanced Framework from Text Matching Point of View for
  Multimodal Entity Linking
A Dual-way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking
Shezheng Song
Shan Zhao
Chengyu Wang
Tianwei Yan
Shasha Li
Xiaoguang Mao
Meng Wang
18
6
0
19 Dec 2023
JPIS: A Joint Model for Profile-based Intent Detection and Slot Filling
  with Slot-to-Intent Attention
JPIS: A Joint Model for Profile-based Intent Detection and Slot Filling with Slot-to-Intent Attention
Thinh-Le-Gia Pham
Dat Quoc Nguyen
35
1
0
14 Dec 2023
NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous
  Driving Datasets using Markup Annotations
NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup Annotations
Yuichi Inoue
Yuki Yada
Kotaro Tanahashi
Yu Yamaguchi
29
17
0
11 Dec 2023
MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with
  Intent-Slot Co-Attention
MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention
Thinh-Le-Gia Pham
Tran Chi
Dat Quoc Nguyen
VLM
13
5
0
10 Dec 2023
GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging
  Cross-Modal Attention with Large Language Models
GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging Cross-Modal Attention with Large Language Models
Haicheng Liao
Huanming Shen
Zhenning Li
Chengyue Wang
Guofa Li
Yiming Bie
Chengzhong Xu
39
50
0
06 Dec 2023
Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain
  Adaptation
Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation
Linzi Xing
Quan Tran
Fabian Caba
Franck Dernoncourt
Seunghyun Yoon
Zhaowen Wang
Trung Bui
Giuseppe Carenini
46
1
0
30 Nov 2023
Towards A Unified Neural Architecture for Visual Recognition and
  Reasoning
Towards A Unified Neural Architecture for Visual Recognition and Reasoning
Calvin Luo
Boqing Gong
Ting Chen
Chen Sun
OCL
ObjD
19
1
0
10 Nov 2023
From Image to Language: A Critical Analysis of Visual Question Answering
  (VQA) Approaches, Challenges, and Opportunities
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
Md Farhan Ishmam
Md Sakib Hossain Shovon
M. F. Mridha
Nilanjan Dey
40
36
0
01 Nov 2023
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
CAD -- Contextual Multi-modal Alignment for Dynamic AVQA
Asmar Nadeem
Adrian Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
27
9
0
25 Oct 2023
Emergent Communication in Interactive Sketch Question Answering
Emergent Communication in Interactive Sketch Question Answering
Zixing Lei
Yiming Zhang
Yuxin Xiong
Siheng Chen
34
2
0
24 Oct 2023
Exploiting User Comments for Early Detection of Fake News Prior to
  Users' Commenting
Exploiting User Comments for Early Detection of Fake News Prior to Users' Commenting
Qiong Nan
Qiang Sheng
Juan Cao
Yongchun Zhu
Danding Wang
Guang Yang
Jintao Li
Kai Shu
38
8
0
16 Oct 2023
DRIN: Dynamic Relation Interactive Network for Multimodal Entity Linking
DRIN: Dynamic Relation Interactive Network for Multimodal Entity Linking
Shangyu Xing
Fei Zhao
Zhen Wu
Chunhui Li
Jianbing Zhang
Xinyu Dai
33
8
0
09 Oct 2023
Dense Object Grounding in 3D Scenes
Dense Object Grounding in 3D Scenes
Wencan Huang
Daizong Liu
Wei Hu
13
17
0
05 Sep 2023
S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical
  Learning
S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning
Wei Suo
Mengyang Sun
Weisong Liu
Yi-Meng Gao
Peifeng Wang
Yanning Zhang
Qi Wu
LRM
38
7
0
05 Sep 2023
Explaining Vision and Language through Graphs of Events in Space and
  Time
Explaining Vision and Language through Graphs of Events in Space and Time
Mihai Masala
Nicolae Cudlenco
Traian Rebedea
Marius Leordeanu
VLM
54
2
0
29 Aug 2023
Simple Baselines for Interactive Video Retrieval with Questions and
  Answers
Simple Baselines for Interactive Video Retrieval with Questions and Answers
Kaiqu Liang
Samuel Albanie
24
2
0
21 Aug 2023
Diagnosing Human-object Interaction Detectors
Diagnosing Human-object Interaction Detectors
Fangrui Zhu
Yiming Xie
Weidi Xie
Huaizu Jiang
30
7
0
16 Aug 2023
Progressive Spatio-temporal Perception for Audio-Visual Question
  Answering
Progressive Spatio-temporal Perception for Audio-Visual Question Answering
Guangyao Li
Wenxuan Hou
Di Hu
31
26
0
10 Aug 2023
Learning to Model the World with Language
Learning to Model the World with Language
Jessy Lin
Yuqing Du
Olivia Watkins
Danijar Hafner
Pieter Abbeel
Dan Klein
Anca Dragan
LM&Ro
SyDa
35
51
0
31 Jul 2023
BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers
  Models for Vietnamese Visual Question Answering
BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering
Khiem Vinh Tran
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
ViT
31
2
0
28 Jul 2023
FedMEKT: Distillation-based Embedding Knowledge Transfer for Multimodal
  Federated Learning
FedMEKT: Distillation-based Embedding Knowledge Transfer for Multimodal Federated Learning
Huy Q. Le
Minh N. H. Nguyen
Chu Myaet Thwal
Yu Qiao
Chao Zhang
Choong Seon Hong
16
13
0
25 Jul 2023
PAT: Parallel Attention Transformer for Visual Question Answering in
  Vietnamese
PAT: Parallel Attention Transformer for Visual Question Answering in Vietnamese
Nghia Hieu Nguyen
Kiet Van Nguyen
13
2
0
17 Jul 2023
UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image
  Enhancement for Gastrointestinal Visual Question Answering
UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image Enhancement for Gastrointestinal Visual Question Answering
T. M. Thai
A. T. Vo
Hao K. Tieu
Linh Bui
T. Nguyen
16
2
0
06 Jul 2023
Multimodal Prompt Retrieval for Generative Visual Question Answering
Multimodal Prompt Retrieval for Generative Visual Question Answering
Timothy Ossowski
Junjie Hu
21
1
0
30 Jun 2023
Answer Mining from a Pool of Images: Towards Retrieval-Based Visual
  Question Answering
Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering
A. S. Penamakuri
Manish Gupta
Mithun Das Gupta
Anand Mishra
37
7
0
29 Jun 2023
Emulating Reader Behaviors for Fake News Detection
Emulating Reader Behaviors for Fake News Detection
Junwei Yin
Min Gao
Kai Shu
Zehua Zhao
Yinqiu Huang
Jia Wang
19
2
0
27 Jun 2023
Cross-Language Speech Emotion Recognition Using Multimodal Dual
  Attention Transformers
Cross-Language Speech Emotion Recognition Using Multimodal Dual Attention Transformers
Syed Muhammad talha Zaidi
S. Latif
Junaid Qadir
34
8
0
23 Jun 2023
FedMultimodal: A Benchmark For Multimodal Federated Learning
FedMultimodal: A Benchmark For Multimodal Federated Learning
Tiantian Feng
Digbalay Bose
Tuo Zhang
Rajat Hebbar
Anil Ramakrishna
Rahul Gupta
Mi Zhang
Salman Avestimehr
Shrikanth Narayanan
32
48
0
15 Jun 2023
Multimodal Explainable Artificial Intelligence: A Comprehensive Review
  of Methodological Advances and Future Research Directions
Multimodal Explainable Artificial Intelligence: A Comprehensive Review of Methodological Advances and Future Research Directions
N. Rodis
Christos Sardianos
Panagiotis I. Radoglou-Grammatikis
Panagiotis G. Sarigiannidis
Iraklis Varlamis
Georgios Th. Papadopoulos
25
22
0
09 Jun 2023
Improving Protein-peptide Interface Predictions in the Low Data Regime
Improving Protein-peptide Interface Predictions in the Low Data Regime
Justin S. Diamond
M. Lill
ViT
18
0
0
31 May 2023
Generate then Select: Open-ended Visual Question Answering Guided by
  World Knowledge
Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Xingyu Fu
Shenmin Zhang
Gukyeong Kwon
Pramuditha Perera
Henghui Zhu
...
Zhiguo Wang
Vittorio Castelli
Patrick K. L. Ng
Dan Roth
Bing Xiang
27
19
0
30 May 2023
Multi-Scale Attention for Audio Question Answering
Multi-Scale Attention for Audio Question Answering
Guangyao Li
Yixin Xu
Di Hu
22
16
0
29 May 2023
Context-aware attention layers coupled with optimal transport domain
  adaptation and multimodal fusion methods for recognizing dementia from
  spontaneous speech
Context-aware attention layers coupled with optimal transport domain adaptation and multimodal fusion methods for recognizing dementia from spontaneous speech
Loukas Ilias
D. Askounis
31
9
0
25 May 2023
MEMEX: Detecting Explanatory Evidence for Memes via Knowledge-Enriched
  Contextualization
MEMEX: Detecting Explanatory Evidence for Memes via Knowledge-Enriched Contextualization
Shivam Sharma
S Ramaneswaran
Udit Arora
Md. Shad Akhtar
Tanmoy Chakraborty
30
9
0
25 May 2023
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for
  Autonomous Driving Scenario
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario
Tianwen Qian
Jingjing Chen
Linhai Zhuo
Yang Jiao
Yueping Jiang
29
134
0
24 May 2023
GEST: the Graph of Events in Space and Time as a Common Representation
  between Vision and Language
GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language
Mihai Masala
Nicolae Cudlenco
Traian Rebedea
Marius Leordeanu
14
0
0
22 May 2023
EAML: Ensemble Self-Attention-based Mutual Learning Network for Document
  Image Classification
EAML: Ensemble Self-Attention-based Mutual Learning Network for Document Image Classification
Souhail Bakkali
Zuheng Ming
Mickael Coustaty
Marçal Rusiñol
10
6
0
11 May 2023
OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual
  Question Answering in Vietnamese
OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual Question Answering in Vietnamese
Nghia Hieu Nguyen
Duong T.D. Vo
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
24
18
0
07 May 2023
Transform-Equivariant Consistency Learning for Temporal Sentence
  Grounding
Transform-Equivariant Consistency Learning for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Jianfeng Dong
Pan Zhou
Zichuan Xu
Yining Qi
Xing Di
Weining Lu
Yu Cheng
46
8
0
06 May 2023
Multimodal Graph Transformer for Multimodal Question Answering
Multimodal Graph Transformer for Multimodal Question Answering
Xuehai He
Xin Eric Wang
36
7
0
30 Apr 2023
Improving Visual Question Answering Models through Robustness Analysis
  and In-Context Learning with a Chain of Basic Questions
Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions
Jia-Hong Huang
Modar Alfadly
Guohao Li
M. Worring
OOD
AAML
38
5
0
06 Apr 2023
No Place to Hide: Dual Deep Interaction Channel Network for Fake News
  Detection based on Data Augmentation
No Place to Hide: Dual Deep Interaction Channel Network for Fake News Detection based on Data Augmentation
Biwei Cao
Lulu Hua
Jiuxin Cao
Jie Gui
Bo Liu
James T. Kwok
21
1
0
31 Mar 2023
CoRe-Sleep: A Multimodal Fusion Framework for Time Series Robust to
  Imperfect Modalities
CoRe-Sleep: A Multimodal Fusion Framework for Time Series Robust to Imperfect Modalities
Konstantinos Kontras
Christos Chatzichristos
Huy P Phan
Johan A. K. Suykens
Marina De Vos
AI4TS
24
11
0
27 Mar 2023
Top-Down Visual Attention from Analysis by Synthesis
Top-Down Visual Attention from Analysis by Synthesis
Baifeng Shi
Trevor Darrell
Xin Eric Wang
25
28
0
23 Mar 2023
SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel
  Storage
SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage
Song Park
Sanghyuk Chun
Byeongho Heo
Wonjae Kim
Sangdoo Yun
VLM
ViT
12
8
0
20 Mar 2023
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Bo He
Jun Wang
Jielin Qiu
Trung Bui
Abhinav Shrivastava
Zhaowen Wang
22
65
0
13 Mar 2023
Knowledge-Based Counterfactual Queries for Visual Question Answering
Knowledge-Based Counterfactual Queries for Visual Question Answering
Theodoti Stoikou
Maria Lymperaiou
Giorgos Stamou
AAML
26
1
0
05 Mar 2023
VQA with Cascade of Self- and Co-Attention Blocks
VQA with Cascade of Self- and Co-Attention Blocks
Aakansha Mishra
Ashish Anand
Prithwijit Guha
33
0
0
28 Feb 2023
Previous
12345...111213
Next