Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.03122
Cited By
Convolutional Sequence to Sequence Learning
8 May 2017
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Convolutional Sequence to Sequence Learning"
50 / 1,321 papers shown
Title
MEGA: Masked Generative Autoencoder for Human Mesh Recovery
Guénolé Fiche
Simon Leglaive
Xavier Alameda-Pineda
Francesc Moreno-Noguer
3DH
60
1
0
29 May 2024
Building Vision Models upon Heat Conduction
Zhaozhi Wang
Yue Liu
Yunfan Liu
Hongtian Yu
Yaowei Wang
QiXiang Ye
ViT
VLM
58
0
0
26 May 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu
Jiahao Xu
Yulin Yuan
Gholamreza Haffari
Longyue Wang
Weihua Luo
Kaifu Zhang
LLMAG
119
22
0
20 May 2024
Context-Aware Machine Translation with Source Coreference Explanation
Huy Hien Vu
Hidetaka Kamigaito
Taro Watanabe
LRM
44
2
0
30 Apr 2024
BotDGT: Dynamicity-aware Social Bot Detection with Dynamic Graph Transformers
Buyun He
Yingguang Yang
Qi Wu
Hao Liu
Renyu Yang
Hao Peng
Xiang Wang
Yong Liao
P. Zhou
29
6
0
23 Apr 2024
Multi-Cell Decoder and Mutual Learning for Table Structure and Character Recognition
T. Kawakatsu
LMTD
35
2
0
20 Apr 2024
HybriMap: Hybrid Clues Utilization for Effective Vectorized HD Map Construction
Chi Zhang
Qi Song
Feifei Li
Yongquan Chen
Rui Huang
35
2
0
17 Apr 2024
Optimal Kernel Tuning Parameter Prediction using Deep Sequence Models
Khawir Mahmood
Jehandad Khan
Hammad Afzal
26
0
0
15 Apr 2024
A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh
Shikhar Vashishth
Raj Dabre
Pushpak Bhattacharyya
34
1
0
06 Apr 2024
Vision Transformers in Domain Adaptation and Generalization: A Study of Robustness
Shadi Alijani
Jamil Fayyad
H. Najjaran
OOD
35
10
0
05 Apr 2024
Generative weather for improved crop model simulations
Yuji Saikai
27
1
0
31 Mar 2024
Equipping Sketch Patches with Context-Aware Positional Encoding for Graphic Sketch Representation
Sicong Zang
Zhijun Fang
42
0
0
26 Mar 2024
Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation
Kamal Kumar
Yinhan Liu
Parth Patwa
Tanmoy
Mihir Adam Roberts
27
1
0
25 Mar 2024
A Stochastic Quasi-Newton Method for Non-convex Optimization with Non-uniform Smoothness
Zhenyu Sun
Ermin Wei
39
0
0
22 Mar 2024
KeyPoint Relative Position Encoding for Face Recognition
Minchul Kim
Yiyang Su
Feng Liu
Anil Jain
Xiaoming Liu
CVBM
49
7
0
21 Mar 2024
Prediction of Translation Techniques for the Translation Process
Fan Zhou
Vincent Vandeghinste
37
0
0
21 Mar 2024
Towards In-Vehicle Multi-Task Facial Attribute Recognition: Investigating Synthetic Data and Vision Foundation Models
Esmaeil Seraj
Walter Talamonti
30
0
0
10 Mar 2024
Spatio-Temporal Field Neural Networks for Air Quality Inference
Yutong Feng
Qiongyan Wang
Yutong Xia
Junlin Huang
Siru Zhong
Keli Zhang
37
0
0
02 Mar 2024
Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching
Rui Gong
Weide Liu
Zaiwang Gu
Xulei Yang
Jun Cheng
3DV
24
9
0
29 Feb 2024
Can Transformers Predict Vibrations?
Fusataka Kuniyoshi
Yoshihide Sawada
27
0
0
16 Feb 2024
Grandmaster-Level Chess Without Search
Anian Ruoss
Grégoire Delétang
Sourabh Medapati
Jordi Grau-Moya
Wenliang Kevin Li
Elliot Catt
John Reid
Tim Genewein
LRM
72
14
0
07 Feb 2024
Revisiting the Markov Property for Machine Translation
Cunxiao Du
Hao Zhou
Zhaopeng Tu
Jing Jiang
21
1
0
03 Feb 2024
Positional Encoding Helps Recurrent Neural Networks Handle a Large Vocabulary
Takashi Morita
16
3
0
31 Jan 2024
Engineering A Large Language Model From Scratch
Abiodun Finbarrs Oketunji
32
0
0
30 Jan 2024
Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models
T. Golling
Lukas Heinrich
Michael Kagan
Samuel Klein
Matthew Leigh
Margarita Osadchy
J. A. Raine
28
24
0
24 Jan 2024
Full Bayesian Significance Testing for Neural Networks
Zehua Liu
Zimeng Li
Jingyuan Wang
Yue He
BDL
13
3
0
24 Jan 2024
Unsupervised Learning of Graph from Recipes
Aissatou Diallo
Antonis Bikakis
Luke Dickens
Anthony Hunter
Rob Miller
SSL
17
0
0
22 Jan 2024
M3BUNet: Mobile Mean Max UNet for Pancreas Segmentation on CT-Scans
Juwita Juwita
Ghulam Mubashar Hassan
Naveed Akhtar
Amitava Datta
26
1
0
18 Jan 2024
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
Jinzhi Zheng
Ruyi Ji
Libo Zhang
Yanjun Wu
Chen Zhao
32
4
0
18 Jan 2024
The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey
Saurav Pawar
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Aman Chadha
Amitava Das
37
28
0
15 Jan 2024
Extending LLMs' Context Window with 100 Samples
Yikai Zhang
Junlong Li
Pengfei Liu
37
11
0
13 Jan 2024
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
26
0
0
06 Jan 2024
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
Yiran Song
Qianyu Zhou
Hefei Ling
Deng-Ping Fan
Xuequan Lu
Lizhuang Ma
VLM
38
14
0
04 Jan 2024
Algebraic Positional Encodings
Konstantinos Kogkalidis
Jean-Philippe Bernardy
Vikas K. Garg
16
1
0
26 Dec 2023
Heterogeneous Encoders Scaling In The Transformer For Neural Machine Translation
J. Hu
Roberto Cavicchioli
Giulia Berardinelli
Alessandro Capotondi
44
2
0
26 Dec 2023
Lift-Attend-Splat: Bird's-eye-view camera-lidar fusion using transformers
James Gunn
Zygmunt Lenyk
Anuj Sharma
Andrea Donati
Alexandru Buburuzan
John Redford
Romain Mueller
MDE
38
8
0
22 Dec 2023
Who is leading in AI? An analysis of industry AI research
Ben Cottier
T. Besiroglu
David Owen
36
7
0
24 Nov 2023
Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis
Honglin Li
Yunlong Zhang
Chenglu Zhu
Jiatong Cai
Sunyi Zheng
Lin Yang
VLM
37
4
0
21 Nov 2023
Recursion in Recursion: Two-Level Nested Recursion for Length Generalization with Scalability
Jishnu Ray Chowdhury
Cornelia Caragea
37
5
0
08 Nov 2023
The Expressibility of Polynomial based Attention Scheme
Zhao Song
Guangyi Xu
Junze Yin
32
5
0
30 Oct 2023
PartialFormer: Modeling Part Instead of Whole for Machine Translation
Tong Zheng
Bei Li
Huiwen Bao
Jiale Wang
Weiqiao Shan
Tong Xiao
Jingbo Zhu
MoE
AI4CE
16
0
0
23 Oct 2023
Rethinking SIGN Training: Provable Nonconvex Acceleration without First- and Second-Order Gradient Lipschitz
Tao Sun
Congliang Chen
Peng Qiao
Li Shen
Xinwang Liu
Dongsheng Li
36
3
0
23 Oct 2023
The Locality and Symmetry of Positional Encodings
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
44
0
0
19 Oct 2023
Enhancing Neural Machine Translation with Semantic Units
Langlin Huang
Shuhao Gu
Zhuocheng Zhang
Yang Feng
44
4
0
17 Oct 2023
Distilling Efficient Vision Transformers from CNNs for Semantic Segmentation
Xueye Zheng
Yunhao Luo
Pengyuan Zhou
Lin Wang
35
13
0
11 Oct 2023
Enhancing Pre-Trained Language Models with Sentence Position Embeddings for Rhetorical Roles Recognition in Legal Opinions
Anas Belfathi
Nicolas Hernandez
Laura Monceaux
AILaw
31
3
0
08 Oct 2023
Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts
Emad A. Alghamdi
Jezia Zakraoui
Fares A. Abanmy
29
1
0
22 Sep 2023
AI Foundation Models for Weather and Climate: Applications, Design, and Implementation
S. K. Mukkavilli
Daniel Salles Civitarese
J. Schmude
Johannes Jakubik
Anne Jones
...
R. Ganti
Hendrik Hamann
U. Nair
Rahul Ramachandran
Kommy Weldemariam
AI4Cl
AI4CE
32
18
0
19 Sep 2023
Nebula: Self-Attention for Dynamic Malware Analysis
Dmitrijs Trizna
Luca Demetrio
Battista Biggio
Fabio Roli
24
13
0
19 Sep 2023
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair
Weishi Wang
Yue Wang
Chenyu You
Steven C. H. Hoi
29
57
0
12 Sep 2023
Previous
1
2
3
4
5
...
25
26
27
Next