Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 27,180 papers shown
Title
POCO: Scalable Neural Forecasting through Population Conditioning
Yu Duan
Hamza Tahir Chaudhry
Misha B. Ahrens
Christopher D Harvey
Matthew G Perich
Karl Deisseroth
Kanaka Rajan
AI4CE
17
0
0
17 Jun 2025
Adaptive Accompaniment with ReaLchords
Yusong Wu
Tim Cooijmans
Kyle Kastner
Adam Roberts
Ian Simon
...
Shayegan Omidshafiei
Aaron Courville
Pablo Samuel Castro
Natasha Jaques
Cheng-Zhi Anna Huang
19
0
0
17 Jun 2025
Human-Centered Editable Speech-to-Sign-Language Generation via Streaming Conformer-Transformer and Resampling Hook
Yingchao Li
SLR
39
0
0
17 Jun 2025
Refining music sample identification with a self-supervised graph neural network
Aditya Bhattacharjee
Ivan Meresman Higgs
Mark Sandler
Emmanouil Benetos
25
0
0
17 Jun 2025
Multi-Scale Finetuning for Encoder-based Time Series Foundation Models
Zhongzheng Qiao
Chenghao Liu
Y. Zhang
Ming Jin
Quang Pham
Qingsong Wen
P.N. Suganthan
Xudong Jiang
Savitha Ramasamy
AI4TS
AI4CE
17
0
0
17 Jun 2025
Into the Unknown: Applying Inductive Spatial-Semantic Location Embeddings for Predicting Individuals' Mobility Beyond Visited Places
Xinglei Wang
Tao Cheng
Stephen Law
Zichao Zeng
Ilya Ilyankou
Junyuan Liu
Lu Yin
Weiming Huang
Natchapon Jongwiriyanurak
HAI
22
0
0
17 Jun 2025
Cross-Modal Geometric Hierarchy Fusion: An Implicit-Submap Driven Framework for Resilient 3D Place Recognition
Xiaohui Jiang
Haijiang Zhu
Chadei Li
Fulin Tang
Ning An
15
0
0
17 Jun 2025
MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution
Zhiwen Shao
Yifan Cheng
Feiran Li
Yong Zhou
Xuequan Lu
Yuan Xie
Lizhuang Ma
ViT
20
0
0
17 Jun 2025
Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot
Xiang Cheng
Chengyan Pan
Minjun Zhao
Deyang Li
Fangchao Liu
Xinyu Zhang
Xiao Zhang
Yong Liu
ReLM
LRM
39
0
0
17 Jun 2025
SFT-GO: Supervised Fine-Tuning with Group Optimization for Large Language Models
Gyuhak Kim
Sumiran Thakur
Su Min Park
Wei Wei
Yujia Bao
15
0
0
17 Jun 2025
CALM: Contextual Analog Logic with Multimodality
Maxwell J. Jacobson
Corey J. Maley
Yexiang Xue
10
0
0
17 Jun 2025
Unifying Streaming and Non-streaming Zipformer-based ASR
Bidisha Sharma
Karthik Pandia Durai
Shankar Venkatesan
Jeena Prakash
Shashi Kumar
Malolan Chetlur
Andreas Stolcke
23
0
0
17 Jun 2025
Recursive Variational Autoencoders for 3D Blood Vessel Generative Modeling
Paula Feldman
Miguel Fainstein
Viviana Siless
C. Delrieux
Emmanuel Iarussi
MedIm
12
0
0
17 Jun 2025
Early Prediction of Multiple Sclerosis Disability Progression via Multimodal Foundation Model Benchmarks
Maxime Usdin
Lito Kriara
Licinio Craveiro
AI4CE
11
0
0
17 Jun 2025
PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation
Ming Xu
Xu Zhang
3DH
44
0
0
17 Jun 2025
BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation Models
Bharath Dandala
Michael M. Danziger
Ella Barkan
Tanwi Biswas
Viatcheslav Gurev
...
Akira Koseki
Tal Kozlovski
Michal Rosen-Zvi
Yishai Shimoni
Ching-Huei Tsou
AI4CE
15
0
0
17 Jun 2025
FADPNet: Frequency-Aware Dual-Path Network for Face Super-Resolution
Siyu Xu
W. Li
Guangwei Gao
Jian Yang
Guo-Jun Qi
Chia-Wen Lin
CVBM
32
0
0
17 Jun 2025
Chaining Event Spans for Temporal Relation Grounding
Jongho Kim
Dohyeon Lee
Minsoo Kim
Seung-won Hwang
27
0
0
17 Jun 2025
Unified Representation Space for 3D Visual Grounding
Yinuo Zheng
Lipeng Gu
Honghua Chen
Liangliang Nan
Mingqiang Wei
16
0
0
17 Jun 2025
A multi-stage augmented multimodal interaction network for fish feeding intensity quantification
Shulong Zhang
Mingyuan Yao
Jiayin Zhao
Xiao Liu
Haihua Wang
16
0
0
17 Jun 2025
Vision Transformers for End-to-End Quark-Gluon Jet Classification from Calorimeter Images
Md Abrar Jahin
Shahriar Soudeep
Arian Rahman Aditta
M. F. Mridha
Nafiz Fahad
Md. Jakir Hossen
ViT
15
0
0
17 Jun 2025
Advances in Compliance Detection: Novel Models Using Vision-Based Tactile Sensors
Ziteng Li
Malte Kuhlmann
Ilana Nisky
Nicolás Navarro-Guerrero
10
0
0
17 Jun 2025
Improving LoRA with Variational Learning
Bai Cong
Nico Daheim
Yuesong Shen
Rio Yokota
Mohammad Emtiyaz Khan
Thomas Möllenhoff
19
0
0
17 Jun 2025
Forecasting the spatiotemporal evolution of fluid-induced microearthquakes with deep learning
Jaehong Chung
Michael Manga
Timothy Kneafsey
T. Mukerji
Mengsu Hu
10
0
0
17 Jun 2025
NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving
Ren Xin
Hongji Liu
Xiaodong Mei
Wenru Liu
Maosheng Ye
Zhili Chen
Jun Ma
22
0
0
17 Jun 2025
ResNets Are Deeper Than You Think
Christian H.X. Ali Mehmeti-Göpel
Michael Wand
15
0
0
17 Jun 2025
Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition
Jiamin Xie
Ju Lin
Yiteng Huang
Tyler Vuong
Zhaojiang Lin
...
Peng Su
Prashant Rawat
Sangeeta Srivastava
Ming Sun
Florian Metze
15
0
0
17 Jun 2025
EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition
Bingxi Liu
Hao Chen
Shiyi Guo
Yihong Wu
Jinqiang Cui
Hong Zhang
7
0
0
16 Jun 2025
Action Dubber: Timing Audible Actions via Inflectional Flow
Wenlong Wan
Weiying Zheng
Tianyi Xiang
Guiqing Li
Shengfeng He
22
0
0
16 Jun 2025
Evolution of ReID: From Early Methods to LLM Integration
Amran Bhuiyan
Mizanur Rahman
Md Tahmid Rahman Laskar
Aijun An
Jimmy Xiangji Huang
VLM
12
0
0
16 Jun 2025
Gated Rotary-Enhanced Linear Attention for Long-term Sequential Recommendation
Juntao Hu
Wei Zhou
Huayi Shen
Xiao Du
Jie Liao
Junhao Wen
Min Gao
20
0
0
16 Jun 2025
Sketched Sum-Product Networks for Joins
Brian Tsan
Abylay Amanbayev
Asoke Datta
Florin Rusu
24
0
0
16 Jun 2025
Equitable Electronic Health Record Prediction with FAME: Fairness-Aware Multimodal Embedding
Nikkie Hooman
Zhongjie Wu
Eric C. Larson
Mehak Gupta
20
0
0
16 Jun 2025
ZipVoice: Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
Han Zhu
Wei Kang
Zengwei Yao
Liyong Guo
Fangjun Kuang
Zhaoqing Li
Weiji Zhuang
Long Lin
Daniel Povey
29
0
0
16 Jun 2025
HierVL: Semi-Supervised Segmentation leveraging Hierarchical Vision-Language Synergy with Dynamic Text-Spatial Query Alignment
Numair Nadeem
Saeed Anwar
Muhammad Asad
Abdul Bais
VLM
22
0
0
16 Jun 2025
IKDiffuser: A Generative Inverse Kinematics Solver for Multi-arm Robots via Diffusion Model
Zeyu Zhang
Ziyuan Jiao
18
0
0
16 Jun 2025
Towards Pervasive Distributed Agentic Generative AI -- A State of The Art
Gianni Molinari
Fabio Ciravegna
LLMAG
LM&Ro
AI4CE
29
0
0
16 Jun 2025
SatHealth: A Multimodal Public Health Dataset with Satellite-based Environmental Factors
Yuanlong Wang
Pengqi Wang
Changchang Yin
Ping Zhang
LM&MA
22
0
0
16 Jun 2025
GITO: Graph-Informed Transformer Operator for Learning Complex Partial Differential Equations
Milad Ramezankhani
Janak M. Patel
A. Deodhar
Dagnachew Birru
AI4CE
20
0
0
16 Jun 2025
PhenoKG: Knowledge Graph-Driven Gene Discovery and Patient Insights from Phenotypes Alone
Kamilia Zaripova
Ege Özsoy
Nassir Navab
Azade Farshad
15
0
0
16 Jun 2025
Prefix-Tuning+: Modernizing Prefix-Tuning by Decoupling the Prefix from Attention
Haonan Wang
Brian K Chen
Siquan Li
Xinhe Liang
Hwee Kuan Lee
Kenji Kawaguchi
Tianyang Hu
21
0
0
16 Jun 2025
Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization
Guanghui Song
Dongping Liao
Yiren Zhao
Kejiang Ye
Cheng-zhong Xu
X. Gao
MoE
14
0
0
16 Jun 2025
FOAM: A General Frequency-Optimized Anti-Overlapping Framework for Overlapping Object Perception
Mingyuan Li
Tong Jia
Han Gu
Hui Lu
Hao Wang
Bowen Ma
Shuyang Lin
Shiyi Guo
Shizhuo Deng
Dongyue Chen
29
0
0
16 Jun 2025
Improving Prostate Gland Segmenting Using Transformer based Architectures
Shatha Abudalou
18
0
0
16 Jun 2025
A Survey on Imitation Learning for Contact-Rich Tasks in Robotics
T. Tsuji
Y. Kato
Gokhan Solak
Heng Zhang
Tadej Petrič
Francesco Nori
Arash Ajoudani
16
0
0
16 Jun 2025
KEPLA: A Knowledge-Enhanced Deep Learning Framework for Accurate Protein-Ligand Binding Affinity Prediction
Han Liu
Keyan Ding
Peilin Chen
Yinwei Wei
Liqiang Nie
Dapeng Oliver Wu
Shiqi Wang
14
0
0
16 Jun 2025
Symmetry in Neural Network Parameter Spaces
Bo Zhao
Robin Walters
Rose Yu
22
0
0
16 Jun 2025
Detecting Hard-Coded Credentials in Software Repositories via LLMs
Chidera Biringa
Gökhan Kul
23
0
0
16 Jun 2025
Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis
Martina Pastorino
Michael Alibani
Nicola Acito
Gabriele Moser
17
0
0
16 Jun 2025
HELENA: High-Efficiency Learning-based channel Estimation using dual Neural Attention
Miguel Camelo Botero
Esra Aycan Beyazit
Nina Slamnik-Kriještorac
Johann M. Marquez-Barja
7
0
0
16 Jun 2025
Previous
1
2
3
4
5
6
...
542
543
544
Next