Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01933
Cited By
A Decomposable Attention Model for Natural Language Inference
6 June 2016
Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Decomposable Attention Model for Natural Language Inference"
50 / 185 papers shown
Title
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz U. Abdullaev
Tan M. Nguyen
41
2
0
02 Mar 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
105
18
0
17 Jan 2025
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Kaixin Wu
Yixin Ji
Z. Chen
Qiang Wang
Cunxiang Wang
...
Jia Xu
Zhongyi Liu
Jinjie Gu
Yuan Zhou
Linjian Mo
KELM
CLL
92
0
0
02 Dec 2024
KNOWCOMP POKEMON Team at DialAM-2024: A Two-Stage Pipeline for Detecting Relations in Dialogical Argument Mining
Zihao Zheng
Zhaowei Wang
Qing Zong
Yangqiu Song
LRM
48
1
0
29 Jul 2024
Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks
Mahdi Sabbaghi
George Pappas
Hamed Hassani
Surbhi Goel
41
4
0
04 Jun 2024
CARL: A Framework for Equivariant Image Registration
Hastings Greer
Lin Tian
François-Xavier Vialard
Roland Kwitt
R. Estépar
Marc Niethammer
3DPC
MedIm
35
0
0
27 May 2024
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism
Yanxi Chen
Xuchen Pan
Yaliang Li
Bolin Ding
Jingren Zhou
LRM
41
31
0
08 Dec 2023
Beyond Semantics: Learning a Behavior Augmented Relevance Model with Self-supervised Learning
Ze-jie Chen
Wei Chen
Jia Xu
Zhongyi Liu
Wei Zhang
RALM
23
4
0
10 Aug 2023
CageViT: Convolutional Activation Guided Efficient Vision Transformer
Hao Zheng
Jinbao Wang
Xiantong Zhen
H. Chen
Jingkuan Song
Feng Zheng
ViT
20
0
0
17 May 2023
Hierarchical Video-Moment Retrieval and Step-Captioning
Abhaysinh Zala
Jaemin Cho
Satwik Kottur
Xilun Chen
Barlas Ouguz
Yasher Mehdad
Joey Tianyi Zhou
3DV
20
51
0
29 Mar 2023
Recent advances in artificial intelligence for retrosynthesis
Zipeng Zhong
Jie Song
Zunlei Feng
Tiantao Liu
Lingxiang Jia
Shaolun Yao
Tingjun Hou
Mingli Song
29
5
0
14 Jan 2023
Paraphrase Identification with Deep Learning: A Review of Datasets and Methods
Chao Zhou
Cheng Qiu
Daniel Ernesto Acuna
32
25
0
13 Dec 2022
BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks
Xiaowei Chi
Jiaming Liu
Ming Lu
Rongyu Zhang
Zhaoqing Wang
Yandong Guo
Shanghang Zhang
3DPC
43
19
0
02 Dec 2022
HeatViT: Hardware-Efficient Adaptive Token Pruning for Vision Transformers
Peiyan Dong
Mengshu Sun
Alec Lu
Yanyue Xie
Li-Yu Daisy Liu
...
Xin Meng
ZeLin Li
Xue Lin
Zhenman Fang
Yanzhi Wang
ViT
31
58
0
15 Nov 2022
Entity Matching by Pool-based Active Learning
Youfang Han
Chunping Li
34
2
0
01 Nov 2022
M
3
^3
3
ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
31
81
0
26 Oct 2022
The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks
Nikil Selvam
Sunipa Dev
Daniel Khashabi
Tushar Khot
Kai-Wei Chang
ALM
24
25
0
18 Oct 2022
Contrastive Representation Learning for Conversational Question Answering over Knowledge Graphs
Endri Kacupaj
Kuldeep Singh
M. Maleshkova
Jens Lehmann
24
13
0
09 Oct 2022
Interpreting the Mechanism of Synergism for Drug Combinations Using Attention-Based Hierarchical Graph Pooling
Zehao Dong
Heming Zhang
Yixin Chen
Philip R. O. Payne
Fuhai Li
GNN
43
16
0
19 Sep 2022
Sequence Learning Using Equilibrium Propagation
Malyaban Bal
Abhronil Sengupta
32
9
0
14 Sep 2022
INTERACTION: A Generative XAI Framework for Natural Language Inference Explanations
Jialin Yu
Alexandra I. Cristea
Anoushka Harit
Zhongtian Sun
O. Aduragba
Lei Shi
Noura Al Moubayed
26
10
0
02 Sep 2022
Bayesian Neural Network Language Modeling for Speech Recognition
Boyang Xue
Shoukang Hu
Junhao Xu
Mengzhe Geng
Xunying Liu
Helen M. Meng
UQCV
BDL
44
14
0
28 Aug 2022
Don't Pay Attention to the Noise: Learning Self-supervised Representations of Light Curves with a Denoising Time Series Transformer
M. Morvan
N. Nikolaou
K. H. Yip
Ingo P. Waldmann
AI4TS
104
9
0
06 Jul 2022
Deformable Graph Transformer
Jinyoung Park
Seongjun Yun
Hyeon-ju Park
Jaewoo Kang
Jisu Jeong
KyungHyun Kim
Jung-Woo Ha
Hyunwoo J. Kim
90
7
0
29 Jun 2022
Persian Natural Language Inference: A Meta-learning approach
Heydar Soudani
Mohammadreza Mojab
H. Beigy
32
1
0
18 May 2022
MulT: An End-to-End Multitask Learning Transformer
Deblina Bhattacharjee
Tong Zhang
Sabine Süsstrunk
Mathieu Salzmann
ViT
36
62
0
17 May 2022
Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims
Miguel Arana Catania
E. Kochkina
A. Zubiaga
M. Liakata
Rob Procter
Yulan He
27
10
0
05 May 2022
QRelScore: Better Evaluating Generated Questions with Deeper Understanding of Context-aware Relevance
Xiaoqiang Wang
Bang Liu
Siliang Tang
Lingfei Wu
30
9
0
29 Apr 2022
A Probabilistic Interpretation of Transformers
Alexander Shim
35
1
0
28 Apr 2022
MGIMN: Multi-Grained Interactive Matching Network for Few-shot Text Classification
Jianhai Zhang
M. Maimaiti
Xing Gao
Yuanhang Zheng
Ji Zhang
16
9
0
11 Apr 2022
Fact Checking with Insufficient Evidence
Pepa Atanasova
J. Simonsen
Christina Lioma
Isabelle Augenstein
37
14
0
05 Apr 2022
Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents
Yicheng Zou
Hongwei Liu
Tao Gui
Junzhe Wang
Qi Zhang
M. Tang
Haixiang Li
Dan Wang
DRL
35
29
0
06 Mar 2022
Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference
Bangzheng Li
Wenpeng Yin
Muhao Chen
30
29
0
12 Feb 2022
CsFEVER and CTKFacts: Acquiring Czech data for fact verification
Herbert Ullrich
Jan Drchal
Martin Rýpar
Hana Vincourová
Václav Moravec
HILM
25
9
0
26 Jan 2022
Unified Question Generation with Continual Lifelong Learning
Wei Yuan
Hongzhi Yin
Tieke He
Tong Chen
Qiufeng Wang
Li-zhen Cui
36
10
0
24 Jan 2022
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
22
103
0
16 Jan 2022
Self-attention Multi-view Representation Learning with Diversity-promoting Complementarity
Jian-wei Liu
Xi-hao Ding
Run-kun Lu
Xiong-lin Luo
8
1
0
01 Jan 2022
Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence
Wenchi Ma
Tianxiao Zhang
Guanghui Wang
ViT
33
14
0
26 Dec 2021
Automated Evidence Collection for Fake News Detection
Mrinal Rawat
Diptesh Kanojia
32
3
0
13 Dec 2021
Mixed Precision of Quantization of Transformer Language Models for Speech Recognition
Junhao Xu
Shoukang Hu
Jianwei Yu
Xunying Liu
Helen M. Meng
MQ
40
15
0
29 Nov 2021
WikiContradiction: Detecting Self-Contradiction Articles on Wikipedia
Cheng-Mao Hsu
Cheng-Te Li
Diego Sáez-Trumper
Yi-Zhan Hsu
SSL
24
13
0
16 Nov 2021
BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching
Ehsan Tavan
A. Rahmati
M. Najafi
Saeed Bibak
Zahed Rahmati
41
5
0
03 Nov 2021
A Simple Approach to Image Tilt Correction with Self-Attention MobileNet for Smartphones
Siddhant Garg
D. Mohanty
S. Thota
Sukumar Moharana
ViT
13
2
0
31 Oct 2021
Exploiting Inter-pixel Correlations in Unsupervised Domain Adaptation for Semantic Segmentation
Inseop Chung
Jayeon Yoo
Nojun Kwak
29
4
0
21 Oct 2021
Structural Characterization for Dialogue Disentanglement
Xinbei Ma
Zhuosheng Zhang
Hai Zhao
16
16
0
15 Oct 2021
Multiplicative Position-aware Transformer Models for Language Understanding
Zhiheng Huang
Davis Liang
Peng-Tao Xu
Bing Xiang
9
1
0
27 Sep 2021
MINIMAL: Mining Models for Data Free Universal Adversarial Triggers
Swapnil Parekh
Yaman Kumar Singla
Somesh Singh
Changyou Chen
Balaji Krishnamurthy
R. Shah
AAML
16
3
0
25 Sep 2021
Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy Logic
Zijun Wu
Zi Xuan Zhang
Atharva Naik
Zhijian Mei
Mauajama Firdaus
Lili Mou
LRM
NAI
44
14
0
18 Sep 2021
A Strong Baseline for Query Efficient Attacks in a Black Box Setting
Rishabh Maheshwary
Saket Maheshwary
Vikram Pudi
AAML
27
30
0
10 Sep 2021
Is Attention Better Than Matrix Decomposition?
Zhengyang Geng
Meng-Hao Guo
Hongxu Chen
Xia Li
Ke Wei
Zhouchen Lin
59
137
0
09 Sep 2021
1
2
3
4
Next