Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.03122
Cited By
v1
v2
v3 (latest)
Convolutional Sequence to Sequence Learning
8 May 2017
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Convolutional Sequence to Sequence Learning"
50 / 1,328 papers shown
Title
Augmented Neural Fine-Tuning for Efficient Backdoor Purification
Nazmul Karim
Abdullah Al Arafat
Umar Khalid
Zhishan Guo
Nazanin Rahnavard
AAML
94
1
0
14 Jul 2024
DAHRS: Divergence-Aware Hallucination-Remediated SRL Projection
Sangpil Youm
Brodie Mather
Chathuri Jayaweera
Juliana Prada
Bonnie J. Dorr
VLM
115
0
0
12 Jul 2024
The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers
Abhi Kamboj
79
0
0
24 Jun 2024
Multilingual Large Language Models and Curse of Multilinguality
Daniil Gurgurov
Tanja Bäumel
Tatiana Anikina
137
7
0
15 Jun 2024
Semi-Supervised Spoken Language Glossification
Huijie Yao
Wengang Zhou
Hao Zhou
Houqiang Li
70
0
0
12 Jun 2024
Oscillations enhance time-series prediction in reservoir computing with feedback
Yuji Kawai
Takashi Morita
Jihoon Park
Minoru Asada
AI4TS
42
1
0
05 Jun 2024
Meta-Designing Quantum Experiments with Language Models
Sören Arlt
Haonan Duan
Felix Li
Sang Michael Xie
Yuhuai Wu
Mario Krenn
AI4CE
123
5
0
04 Jun 2024
Using Explainable AI for EEG-based Reduced Montage Neonatal Seizure Detection
Dinuka Sandun Udayantha
Kavindu Weerasinghe
Nima Wickramasinghe
Akila Abeyratne
Kithmin Wickremasinghe
J. Wanigasinghe
Anjula De Silva
Chamira U. S. Edussooriya
69
2
0
04 Jun 2024
Anomaly Detection in Dynamic Graphs: A Comprehensive Survey
Ocheme Anthony Ekle
William Eberle
AI4TS
100
11
0
31 May 2024
Contextual Position Encoding: Learning to Count What's Important
O. Yu. Golovneva
Tianlu Wang
Jason Weston
Sainbayar Sukhbaatar
112
35
0
29 May 2024
MEGA: Masked Generative Autoencoder for Human Mesh Recovery
Guénolé Fiche
Simon Leglaive
Xavier Alameda-Pineda
Francesc Moreno-Noguer
3DH
139
1
0
29 May 2024
Building Vision Models upon Heat Conduction
Zhaozhi Wang
Yue Liu
Yunfan Liu
Hongtian Yu
Yaowei Wang
QiXiang Ye
ViT
VLM
102
0
0
26 May 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu
Jiahao Xu
Yulin Yuan
Gholamreza Haffari
Longyue Wang
Weihua Luo
Kaifu Zhang
LLMAG
184
27
0
20 May 2024
Context-Aware Machine Translation with Source Coreference Explanation
Huy Hien Vu
Hidetaka Kamigaito
Taro Watanabe
LRM
66
3
0
30 Apr 2024
BotDGT: Dynamicity-aware Social Bot Detection with Dynamic Graph Transformers
Buyun He
Yingguang Yang
Qi Wu
Hao Liu
Renyu Yang
Hao Peng
Xiang Wang
Yong Liao
P. Zhou
57
9
0
23 Apr 2024
Multi-Cell Decoder and Mutual Learning for Table Structure and Character Recognition
T. Kawakatsu
LMTD
49
3
0
20 Apr 2024
HybriMap: Hybrid Clues Utilization for Effective Vectorized HD Map Construction
Chi Zhang
Qi Song
Feifei Li
Yongquan Chen
Rui Huang
75
2
0
17 Apr 2024
Optimal Kernel Tuning Parameter Prediction using Deep Sequence Models
Khawir Mahmood
Jehandad Khan
Hammad Afzal
47
0
0
15 Apr 2024
A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh
Shikhar Vashishth
Raj Dabre
Pushpak Bhattacharyya
75
2
0
06 Apr 2024
Vision Transformers in Domain Adaptation and Generalization: A Study of Robustness
Shadi Alijani
Jamil Fayyad
Homayoun Najjaran
OOD
116
1
0
05 Apr 2024
Generative weather for improved crop model simulations
Yuji Saikai
107
1
0
31 Mar 2024
Equipping Sketch Patches with Context-Aware Positional Encoding for Graphic Sketch Representation
Sicong Zang
Zhijun Fang
115
0
0
26 Mar 2024
Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation
Kamal Kumar
Yinhan Liu
Parth Patwa
Tanmoy
Mihir Adam Roberts
95
2
0
25 Mar 2024
A Stochastic Quasi-Newton Method for Non-convex Optimization with Non-uniform Smoothness
Zhenyu Sun
Ermin Wei
113
0
0
22 Mar 2024
KeyPoint Relative Position Encoding for Face Recognition
Minchul Kim
Yiyang Su
Feng Liu
Anil Jain
Xiaoming Liu
CVBM
88
10
0
21 Mar 2024
Prediction of Translation Techniques for the Translation Process
Fan Zhou
Vincent Vandeghinste
77
0
0
21 Mar 2024
Towards In-Vehicle Multi-Task Facial Attribute Recognition: Investigating Synthetic Data and Vision Foundation Models
Esmaeil Seraj
Walter Talamonti
54
0
0
10 Mar 2024
Spatio-Temporal Field Neural Networks for Air Quality Inference
Yutong Feng
Qiongyan Wang
Yutong Xia
Junlin Huang
Siru Zhong
Yuxuan Liang
125
1
0
02 Mar 2024
Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching
Rui Gong
Weide Liu
Zaiwang Gu
Xulei Yang
Jun Cheng
3DV
119
9
0
29 Feb 2024
Can Transformers Predict Vibrations?
Fusataka Kuniyoshi
Yoshihide Sawada
54
0
0
16 Feb 2024
Grandmaster-Level Chess Without Search
Anian Ruoss
Grégoire Delétang
Sourabh Medapati
Jordi Grau-Moya
Wenliang Kevin Li
Elliot Catt
John Reid
Tim Genewein
LRM
125
7
0
07 Feb 2024
Revisiting the Markov Property for Machine Translation
Cunxiao Du
Hao Zhou
Zhaopeng Tu
Jing Jiang
110
2
0
03 Feb 2024
Positional Encoding Helps Recurrent Neural Networks Handle a Large Vocabulary
Takashi Morita
94
3
0
31 Jan 2024
Engineering A Large Language Model From Scratch
Abiodun Finbarrs Oketunji
46
0
0
30 Jan 2024
Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models
T. Golling
Lukas Heinrich
Michael Kagan
Samuel Klein
Matthew Leigh
Margarita Osadchy
J. A. Raine
87
27
0
24 Jan 2024
Full Bayesian Significance Testing for Neural Networks
Zehua Liu
Zimeng Li
Jingyuan Wang
Yue He
BDL
59
4
0
24 Jan 2024
Unsupervised Learning of Graph from Recipes
Aissatou Diallo
Antonis Bikakis
Luke Dickens
Anthony Hunter
Rob Miller
SSL
60
0
0
22 Jan 2024
M3BUNet: Mobile Mean Max UNet for Pancreas Segmentation on CT-Scans
Juwita Juwita
Ghulam Mubashar Hassan
Naveed Akhtar
Amitava Datta
65
1
0
18 Jan 2024
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
Jinzhi Zheng
Ruyi Ji
Libo Zhang
Yanjun Wu
Chen Zhao
69
4
0
18 Jan 2024
The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey
Saurav Pawar
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Aman Chadha
Amitava Das
68
29
0
15 Jan 2024
Extending LLMs' Context Window with 100 Samples
Yikai Zhang
Junlong Li
Pengfei Liu
89
12
0
13 Jan 2024
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
67
0
0
06 Jan 2024
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
Yiran Song
Qianyu Zhou
Hefei Ling
Deng-Ping Fan
Xuequan Lu
Lizhuang Ma
VLM
141
15
0
04 Jan 2024
Algebraic Positional Encodings
Konstantinos Kogkalidis
Jean-Philippe Bernardy
Vikas Garg
42
3
0
26 Dec 2023
Heterogeneous Encoders Scaling In The Transformer For Neural Machine Translation
J. Hu
Roberto Cavicchioli
Giulia Berardinelli
Alessandro Capotondi
75
2
0
26 Dec 2023
Lift-Attend-Splat: Bird's-eye-view camera-lidar fusion using transformers
James Gunn
Zygmunt Lenyk
Anuj Sharma
Andrea Donati
Alexandru Buburuzan
John Redford
Romain Mueller
MDE
98
9
0
22 Dec 2023
Who is leading in AI? An analysis of industry AI research
Ben Cottier
T. Besiroglu
David Owen
126
8
0
24 Nov 2023
Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis
Honglin Li
Yunlong Zhang
Chenglu Zhu
Jiatong Cai
Sunyi Zheng
Lin Yang
VLM
89
4
0
21 Nov 2023
Recursion in Recursion: Two-Level Nested Recursion for Length Generalization with Scalability
Jishnu Ray Chowdhury
Cornelia Caragea
78
5
0
08 Nov 2023
The Expressibility of Polynomial based Attention Scheme
Zhao Song
Guangyi Xu
Junze Yin
95
5
0
30 Oct 2023
Previous
1
2
3
4
5
...
25
26
27
Next