Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 27,180 papers shown
Title
Transformer Encoder and Multi-features Time2Vec for Financial Prediction
Nguyen Kim Hai Bui
Nguyen Duy Chien
P. Kovács
Gergő Bognár
AI4TS
AIFin
138
0
0
01 Jul 2025
FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving
Yutao Zhu
Xiaosong Jia
Xinyu Yang
Junchi Yan
ViT
78
6
0
01 Jul 2025
Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser
Qingyuan Cai
Xuecai Hu
Saihui Hou
Li Yao
Yongzhen Huang
DiffM
63
14
0
01 Jul 2025
Aircraft Trajectory Dataset Augmentation in Latent Space
Seokbin Yoon
Keumjin Lee
15
0
0
01 Jul 2025
BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports
Jing-Yuan Chang
90
0
0
01 Jul 2025
EXPRTS: Exploring and Probing the Robustness of Time Series Forecasting Models
Haakon Hanisch Kjaernli
Lluis Mas-Ribas
Hans Jakob Håland
Gleb Sizov
Aida Ashrafi
Helge Langseth
Odd Erik Gundersen
AI4TS
125
0
0
01 Jul 2025
Neural Canonical Polyadic Factorization for Traffic Analysis
Yikai Hou
Peng Tang
10
0
0
01 Jul 2025
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning
Lizhen Xu
Xiuxiu Bai
Xiaojun Jia
Jianwu Fang
Shanmin Pang
127
0
0
01 Jul 2025
Open World Object Detection: A Survey
Yiming Li
Yi Wang
Wenqian Wang
Dan Lin
Bingbing Li
Kim-Hui Yap
ObjD
84
1
0
01 Jul 2025
BFA: Best-Feature-Aware Fusion for Multi-View Fine-grained Manipulation
Zihan Lan
Weixin Mao
Haoyang Li
Le Wang
Tiancai Wang
Haoqiang Fan
Osamu Yoshie
EgoV
134
2
0
01 Jul 2025
Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles
Xing Shen
Hengguan Huang
Brennan Nichyporuk
Tal Arbel
MedIm
133
4
0
01 Jul 2025
Challenging Gradient Boosted Decision Trees with Tabular Transformers for Fraud Detection at Booking.com
Sergei Krutikov
Bulat Khaertdinov
Rodion Kiriukhin
Shubham Agrawal
Mozhdeh Ariannezhad
Kees Jan de Vries
LMTD
94
0
0
01 Jul 2025
CarGait: Cross-Attention based Re-ranking for Gait recognition
Gavriel Habib
Noa Barzilay
O. Shimshi
Rami Ben-Ari
N. Darshan
CVBM
134
1
0
01 Jul 2025
Visual Re-Ranking with Non-Visual Side Information
Gustav Hanning
Gabrielle Flood
Viktor Larsson
66
0
0
01 Jul 2025
RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference
Yushen Chen
Jiawei Zhang
Baotong Lu
Qianxi Zhang
Chengruidong Zhang
...
Chen Chen
Mingxing Zhang
Yuqing Yang
Fan Yang
Mao Yang
85
1
0
01 Jul 2025
A Pre-trained Sequential Recommendation Framework: Popularity Dynamics for Zero-shot Transfer
Junting Wang
Praneet Rathi
Hari Sundaram
HAI
VLM
34
5
0
01 Jul 2025
CBAGAN-RRT: Convolutional Block Attention Generative Adversarial Network for Sampling-Based Path Planning
Abhinav Sagar
Sai Teja Gilukara
65
0
0
01 Jul 2025
Vision-QRWKV: Exploring Quantum-Enhanced RWKV Models for Image Classification
Chi-Sheng Chen
18
0
0
01 Jul 2025
FreeCodec: A disentangled neural speech codec with fewer tokens
Youqiang Zheng
Weiping Tu
Yueteng Kang
Jie Chen
Yike Zhang
Li Xiao
Yuhong Yang
Long Ma
132
4
0
01 Jul 2025
MedSegNet10: A Publicly Accessible Network Repository for Split Federated Medical Image Segmentation
C. Shiranthika
Zahra Hafezi Kafshgari
Hadi Hadizadeh
Parvaneh Saeedi
FedML
96
0
0
01 Jul 2025
Video-Guided Text-to-Music Generation Using Public Domain Movie Collections
Haven Kim
Cheng-i Wang
Weihan Xu
Julian McAuley
Hao-Wen Dong
VGen
36
0
0
01 Jul 2025
Efficient Online Inference of Vision Transformers by Training-Free Tokenization
Leonidas Gee
Wing Yan Li
V. Sharmanska
Novi Quadrianto
ViT
197
0
0
01 Jul 2025
RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations
Mingshu Zhao
Yi Luo
Yong Ouyang
100
0
0
01 Jul 2025
Progressive Binarization with Semi-Structured Pruning for LLMs
Xinyu Yan
Tianao Zhang
Zhiteng Li
Yulun Zhang
MQ
136
1
0
01 Jul 2025
Incomplete Multi-view Clustering via Diffusion Contrastive Generation
Yuanyang Zhang
Yijie Lin
Weiqing Yan
Li Yao
Xinhang Wan
Guangyuan Li
Chao Zhang
Guanzhou Ke
Jie Xu
DiffM
119
0
0
01 Jul 2025
TransDreamerV3: Implanting Transformer In DreamerV3
Shruti Sadanand Dongare
Amun Kharel
Jonathan Samuel
Xiaona Zhou
10
0
0
20 Jun 2025
Relaxed syntax modeling in Transformers for future-proof license plate recognition
Florent Meyer
Laurent Guichard
Denis Coquenet
Guillaume Gravier
Yann Soullard
Bertrand Coüasnon
15
0
0
20 Jun 2025
Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?
Adithya Bhaskar
Alexander Wettig
Tianyu Gao
Yihe Dong
Danqi Chen
10
0
0
20 Jun 2025
The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation
Giulia Bertazzini
Chiara Albisani
Daniele Baracchi
Dasara Shullani
Roberto Verdecchia
14
0
0
20 Jun 2025
PPTP: Performance-Guided Physiological Signal-Based Trust Prediction in Human-Robot Collaboration
Hao Guo
Wei Fan
Shaohui Liu
Feng Jiang
Chunzhi Yi
12
0
0
20 Jun 2025
ParkFormer: A Transformer-Based Parking Policy with Goal Embedding and Pedestrian-Aware Control
Jun Fu
Bin Tian
Haonan Chen
Shi Meng
Tingting Yao
ViT
10
0
0
20 Jun 2025
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
Jingtong Su
Julia Kempe
Karen Ullrich
10
0
0
20 Jun 2025
LaVi: Efficient Large Vision-Language Models via Internal Feature Modulation
Tongtian Yue
Longteng Guo
Yepeng Tang
Zijia Zhao
Xinxin Zhu
Hua Huang
Jing Liu
MLLM
VLM
16
0
0
20 Jun 2025
Emergent Temporal Correspondences from Video Diffusion Transformers
Jisu Nam
Soowon Son
Dahyun Chung
Jiyoung Kim
Siyoon Jin
Junhwa Hur
Seungryong Kim
VGen
16
0
0
20 Jun 2025
When Can Model-Free Reinforcement Learning be Enough for Thinking?
Josiah P. Hanna
Nicholas Corrado
OffRL
LM&Ro
ReLM
LRM
AI4CE
31
0
0
20 Jun 2025
RocketStack: A level-aware deep recursive ensemble learning framework with exploratory feature fusion and model pruning dynamics
Çağatay Demirel
5
0
0
20 Jun 2025
Mesh-Informed Neural Operator : A Transformer Generative Approach
Yaozhong Shi
Zachary E. Ross
D. Asimaki
Kamyar Azizzadenesheli
AI4CE
19
0
0
20 Jun 2025
Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs
Thomas Marwitz
Alexander Colsmann
Ben Breitung
Christoph Brabec
Christoph Kirchlechner
...
Michael Hirtz
Pavel A. Levkin
Yolita M. Eggeler
Tobias Schlöder
Pascal Friederich
AI4CE
28
0
0
20 Jun 2025
Optimal Depth of Neural Networks
Qian Qi
10
0
0
20 Jun 2025
Universal Music Representations? Evaluating Foundation Models on World Music Corpora
Charilaos Papaioannou
Emmanouil Benetos
Alexandros Potamianos
14
0
0
20 Jun 2025
A Quantile Regression Approach for Remaining Useful Life Estimation with State Space Models
Davide Frizzo
Francesco Borsatti
Gian Antonio Susto
5
0
0
20 Jun 2025
Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation
Chenxu Wang
Yonggang Jin
Cheng Hu
Youpeng Zhao
Zipeng Dai
Jian Zhao
Shiyu Huang
Liuyu Xiang
Junge Zhang
Zhaofeng He
12
0
0
20 Jun 2025
Neural Polar Decoders for DNA Data Storage
Ziv Aharoni
Henry D. Pfister
5
0
0
20 Jun 2025
A Simple Contrastive Framework Of Item Tokenization For Generative Recommendation
Penglong Zhai
Yifang Yuan
Fanyi Di
Jie Li
Y. Liu
Chen Li
Jie Huang
S. Wang
Yao Xu
X. Li
10
0
0
20 Jun 2025
EFormer: An Effective Edge-based Transformer for Vehicle Routing Problems
Dian Meng
Zhiguang Cao
Yaoxin Wu
Yaqing Hou
Hongwei Ge
Qiang Zhang
20
0
0
19 Jun 2025
GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View
Fenghua Cheng
Jinxiang Wang
Sen Wang
Zi Huang
Xue Li
LRM
17
0
0
19 Jun 2025
Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective
Léo Gagnon
Eric Elmoznino
Sarthak Mittal
Tom Marty
Tejas Kasetty
Dhanya Sridhar
Guillaume Lajoie
10
0
0
19 Jun 2025
Goal-conditioned Hierarchical Reinforcement Learning for Sample-efficient and Safe Autonomous Driving at Intersections
Yiou Huang
9
0
0
19 Jun 2025
Advanced Sign Language Video Generation with Compressed and Quantized Multi-Condition Tokenization
Cong Wang
Zexuan Deng
Zhiwei Jiang
Fei Shen
Yafeng Yin
Shiwei Gan
Zifeng Cheng
Shiping Ge
Qing Gu
DiffM
SLR
VGen
31
0
0
19 Jun 2025
Unpacking Generative AI in Education: Computational Modeling of Teacher and Student Perspectives in Social Media Discourse
Paulina DeVito
Akhil Vallala
Sean Mcmahon
Yaroslav Hinda
Benjamin Thaw
Hanqi Zhuang
Hari Kalva
10
0
0
19 Jun 2025
1
2
3
4
...
542
543
544
Next