Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01933
Cited By
A Decomposable Attention Model for Natural Language Inference
6 June 2016
Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Decomposable Attention Model for Natural Language Inference"
50 / 166 papers shown
Title
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz U. Abdullaev
Tan M. Nguyen
41
2
0
02 Mar 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
102
18
0
17 Jan 2025
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Kaixin Wu
Yixin Ji
Z. Chen
Qiang Wang
Cunxiang Wang
...
Jia Xu
Zhongyi Liu
Jinjie Gu
Yuan Zhou
Linjian Mo
KELM
CLL
92
0
0
02 Dec 2024
KNOWCOMP POKEMON Team at DialAM-2024: A Two-Stage Pipeline for Detecting Relations in Dialogical Argument Mining
Zihao Zheng
Zhaowei Wang
Qing Zong
Yangqiu Song
LRM
40
1
0
29 Jul 2024
CARL: A Framework for Equivariant Image Registration
Hastings Greer
Lin Tian
François-Xavier Vialard
Roland Kwitt
R. Estépar
Marc Niethammer
3DPC
MedIm
35
0
0
27 May 2024
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism
Yanxi Chen
Xuchen Pan
Yaliang Li
Bolin Ding
Jingren Zhou
LRM
33
31
0
08 Dec 2023
Beyond Semantics: Learning a Behavior Augmented Relevance Model with Self-supervised Learning
Ze-jie Chen
Wei-Neng Chen
Jia Xu
Zhongyi Liu
Wei Zhang
RALM
23
4
0
10 Aug 2023
CageViT: Convolutional Activation Guided Efficient Vision Transformer
Hao Zheng
Jinbao Wang
Xiantong Zhen
H. Chen
Jingkuan Song
Feng Zheng
ViT
10
0
0
17 May 2023
Hierarchical Video-Moment Retrieval and Step-Captioning
Abhaysinh Zala
Jaemin Cho
Satwik Kottur
Xilun Chen
Barlas Ouguz
Yasher Mehdad
Mohit Bansal
3DV
18
51
0
29 Mar 2023
Recent advances in artificial intelligence for retrosynthesis
Zipeng Zhong
Jie Song
Zunlei Feng
Tiantao Liu
Lingxiang Jia
Shaolun Yao
Tingjun Hou
Mingli Song
29
5
0
14 Jan 2023
Paraphrase Identification with Deep Learning: A Review of Datasets and Methods
Chao Zhou
Cheng Qiu
Daniel Ernesto Acuna
26
25
0
13 Dec 2022
BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks
Xiaowei Chi
Jiaming Liu
Ming Lu
Rongyu Zhang
Zhaoqing Wang
Yandong Guo
Shanghang Zhang
3DPC
38
19
0
02 Dec 2022
HeatViT: Hardware-Efficient Adaptive Token Pruning for Vision Transformers
Peiyan Dong
Mengshu Sun
Alec Lu
Yanyue Xie
Li-Yu Daisy Liu
...
Xin Meng
Z. Li
Xue Lin
Zhenman Fang
Yanzhi Wang
ViT
26
58
0
15 Nov 2022
Entity Matching by Pool-based Active Learning
Youfang Han
Chunping Li
26
2
0
01 Nov 2022
M
3
^3
3
ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
29
81
0
26 Oct 2022
The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks
Nikil Selvam
Sunipa Dev
Daniel Khashabi
Tushar Khot
Kai-Wei Chang
ALM
13
25
0
18 Oct 2022
Contrastive Representation Learning for Conversational Question Answering over Knowledge Graphs
Endri Kacupaj
Kuldeep Singh
M. Maleshkova
Jens Lehmann
24
13
0
09 Oct 2022
Interpreting the Mechanism of Synergism for Drug Combinations Using Attention-Based Hierarchical Graph Pooling
Zehao Dong
Heming Zhang
Yixin Chen
Philip R. O. Payne
Fuhai Li
GNN
38
16
0
19 Sep 2022
Sequence Learning Using Equilibrium Propagation
Malyaban Bal
Abhronil Sengupta
32
9
0
14 Sep 2022
INTERACTION: A Generative XAI Framework for Natural Language Inference Explanations
Jialin Yu
Alexandra I. Cristea
Anoushka Harit
Zhongtian Sun
O. Aduragba
Lei Shi
Noura Al Moubayed
18
10
0
02 Sep 2022
Don't Pay Attention to the Noise: Learning Self-supervised Representations of Light Curves with a Denoising Time Series Transformer
M. Morvan
N. Nikolaou
K. H. Yip
Ingo P. Waldmann
AI4TS
99
9
0
06 Jul 2022
MulT: An End-to-End Multitask Learning Transformer
Deblina Bhattacharjee
Tong Zhang
Sabine Süsstrunk
Mathieu Salzmann
ViT
36
62
0
17 May 2022
Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims
Miguel Arana Catania
E. Kochkina
A. Zubiaga
M. Liakata
Rob Procter
Yulan He
25
10
0
05 May 2022
QRelScore: Better Evaluating Generated Questions with Deeper Understanding of Context-aware Relevance
Xiaoqiang Wang
Bang Liu
Siliang Tang
Lingfei Wu
16
9
0
29 Apr 2022
A Probabilistic Interpretation of Transformers
Alexander Shim
33
1
0
28 Apr 2022
MGIMN: Multi-Grained Interactive Matching Network for Few-shot Text Classification
Jianhai Zhang
M. Maimaiti
Xing Gao
Yuanhang Zheng
Ji Zhang
11
9
0
11 Apr 2022
Fact Checking with Insufficient Evidence
Pepa Atanasova
J. Simonsen
Christina Lioma
Isabelle Augenstein
29
14
0
05 Apr 2022
Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents
Yicheng Zou
Hongwei Liu
Tao Gui
Junzhe Wang
Qi Zhang
M. Tang
Haixiang Li
Dan Wang
DRL
35
29
0
06 Mar 2022
Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference
Bangzheng Li
Wenpeng Yin
Muhao Chen
22
29
0
12 Feb 2022
CsFEVER and CTKFacts: Acquiring Czech data for fact verification
Herbert Ullrich
Jan Drchal
Martin Rýpar
Hana Vincourová
Václav Moravec
HILM
17
9
0
26 Jan 2022
Unified Question Generation with Continual Lifelong Learning
Wei Yuan
Hongzhi Yin
Tieke He
Tong Chen
Qiufeng Wang
Li-zhen Cui
28
10
0
24 Jan 2022
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
22
103
0
16 Jan 2022
Self-attention Multi-view Representation Learning with Diversity-promoting Complementarity
Jian-wei Liu
Xi-hao Ding
Run-kun Lu
Xiong-lin Luo
8
1
0
01 Jan 2022
Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence
Wenchi Ma
Tianxiao Zhang
Guanghui Wang
ViT
28
14
0
26 Dec 2021
Automated Evidence Collection for Fake News Detection
Mrinal Rawat
Diptesh Kanojia
27
3
0
13 Dec 2021
Mixed Precision of Quantization of Transformer Language Models for Speech Recognition
Junhao Xu
Shoukang Hu
Jianwei Yu
Xunying Liu
Helen M. Meng
MQ
32
15
0
29 Nov 2021
WikiContradiction: Detecting Self-Contradiction Articles on Wikipedia
Cheng-Mao Hsu
Cheng-Te Li
Diego Sáez-Trumper
Yi-Zhan Hsu
SSL
16
13
0
16 Nov 2021
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching
Ehsan Tavan
A. Rahmati
M. Najafi
Saeed Bibak
Zahed Rahmati
33
5
0
03 Nov 2021
A Simple Approach to Image Tilt Correction with Self-Attention MobileNet for Smartphones
Siddhant Garg
D. Mohanty
S. Thota
Sukumar Moharana
ViT
11
2
0
31 Oct 2021
Exploiting Inter-pixel Correlations in Unsupervised Domain Adaptation for Semantic Segmentation
Inseop Chung
Jayeon Yoo
Nojun Kwak
21
4
0
21 Oct 2021
Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy Logic
Zijun Wu
Zi Xuan Zhang
Atharva Naik
Zhijian Mei
Mauajama Firdaus
Lili Mou
LRM
NAI
36
14
0
18 Sep 2021
A Strong Baseline for Query Efficient Attacks in a Black Box Setting
Rishabh Maheshwary
Saket Maheshwary
Vikram Pudi
AAML
16
30
0
10 Sep 2021
Fastformer: Additive Attention Can Be All You Need
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
35
117
0
20 Aug 2021
Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition
Ziwei Xu
Guangzhi Wang
Yongkang Wong
Mohan S. Kankanhalli
41
26
0
10 Aug 2021
RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting
Jiachen Li
Fan Yang
Hengbo Ma
Srikanth Malla
M. Tomizuka
Chiho Choi
24
42
0
03 Aug 2021
Towards Robustness Against Natural Language Word Substitutions
Xinshuai Dong
A. Luu
Rongrong Ji
Hong Liu
SILM
AAML
25
112
0
28 Jul 2021
A Survey on Data Augmentation for Text Classification
Markus Bayer
M. Kaufhold
Christian A. Reuter
28
334
0
07 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
52
94
0
01 Jul 2021
DocNLI: A Large-scale Dataset for Document-level Natural Language Inference
Wenpeng Yin
Dragomir R. Radev
Caiming Xiong
HILM
19
97
0
17 Jun 2021
GraphiT: Encoding Graph Structure in Transformers
Grégoire Mialon
Dexiong Chen
Margot Selosse
Julien Mairal
20
163
0
10 Jun 2021
1
2
3
4
Next