Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 17,634 papers shown
Title
VIViT: Variable-Input Vision Transformer Framework for 3D MR Image Segmentation
Badhan Kumar Das
Ajay Singh
Gengyan Zhao
Han Liu
Thomas J. Re
Dorin Comaniciu
Eli Gibson
Andreas Maier
ViT
MedIm
29
0
0
13 May 2025
Small but Significant: On the Promise of Small Language Models for Accessible AIED
Yumou Wei
Paulo Carvalho
John Stamper
SyDa
45
0
0
13 May 2025
Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning
Ardian Selmonaj
Oleg Szehr
Giacomo Del Rio
Alessandro Antonucci
Adrian Schneider
Michael Rüegsegger
29
0
0
13 May 2025
Knowledge-Informed Deep Learning for Irrigation Type Mapping from Remote Sensing
Oishee Bintey Hoque
Nibir Chandra Mandal
Abhijin Adiga
Samarth Swarup
S. Nouwakpo
Amanda Wilson
Madhav Marathe
31
0
0
13 May 2025
DHECA-SuperGaze: Dual Head-Eye Cross-Attention and Super-Resolution for Unconstrained Gaze Estimation
Franko Šikić
Donik Vršnak
Sven Lončarić
24
0
0
13 May 2025
Large Language Models Meet Stance Detection: A Survey of Tasks, Methods, Applications, Challenges and Future Directions
Lata Pangtey
Anukriti Bhatnagar
Shubhi Bansal
Shahid Shafi Dar
Nagendra Kumar
34
0
0
13 May 2025
TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series
Xiaolei Qin
Di Wang
Jingyang Zhang
Fengxiang Wang
Xin Su
Bo Du
Liangpei Zhang
AI4TS
24
0
0
13 May 2025
Big Data and the Computational Social Science of Entrepreneurship and Innovation
Ningzi Li
Shiyang Lai
James Evans
AILaw
29
0
0
13 May 2025
Implet: A Post-hoc Subsequence Explainer for Time Series Models
Fanyu Meng
Ziwen Kan
Shahbaz Rezaei
Z. Kong
Xin Chen
Xin Liu
AI4TS
29
0
0
13 May 2025
ADC-GS: Anchor-Driven Deformable and Compressed Gaussian Splatting for Dynamic Scene Reconstruction
He Huang
Qi Yang
Mufan Liu
Yiling Xu
Zhu Li
3DGS
42
0
0
13 May 2025
Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies
Xiaoliang Luo
Xinyi Xu
Michael Ramscar
Bradley C. Love
30
0
0
13 May 2025
Towards Adaptive Meta-Gradient Adversarial Examples for Visual Tracking
Wei-Long Tian
Peng Gao
Xiao Liu
Long Xu
Hamido Fujita
Hanan Aljuai
Mao-Li Wang
AAML
29
0
0
13 May 2025
Large Language Models for Computer-Aided Design: A Survey
Licheng Zhang
Bach Le
Naveed Akhtar
Siew-Kei Lam
Tuan Ngo
3DV
AI4CE
40
0
0
13 May 2025
Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation
Chiara Manna
Afra Alishahi
Frédéric Blain
Eva Vanmassenhove
27
0
0
13 May 2025
WaLLM -- Insights from an LLM-Powered Chatbot deployment via WhatsApp
Hiba Eltigani
Rukhshan Haroon
Asli Kocak
Abdullah Bin Faisal
Noah Martin
Fahad Dogar
19
0
0
13 May 2025
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
Yifu Yuan
Haiqin Cui
Yibin Chen
Zibin Dong
Fei Ni
Longxin Kou
Jinyi Liu
Pengyi Li
Yan Zheng
Jianye Hao
31
0
0
13 May 2025
Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
Chang Zong
Yueting Zhuang
Jian Shao
Weiming Lu
44
0
0
13 May 2025
Enhancing the Efficiency of Complex Systems Crystal Structure Prediction by Active Learning Guided Machine Learning Potential
Jiaxiang Li
Junwei Feng
Jie Luo
Bowen Jiang
Xiangyu Zheng
...
Keith Butler
Hanyu Liu
Congwei Xie
Yu Xie
Yanming Ma
31
0
0
13 May 2025
When repeats drive the vocabulary: a Byte-Pair Encoding analysis of T2T primate genomes
Marina Popova
Iaroslav Chelombitko
Aleksey Komissarov
25
0
0
13 May 2025
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
Mamba
54
0
0
13 May 2025
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference
Tollef Emil Jørgensen
MQ
54
0
0
13 May 2025
ForeCite: Adapting Pre-Trained Language Models to Predict Future Citation Rates of Academic Papers
Gavin Hull
Alex Bihlo
29
0
0
13 May 2025
Decoupled Multimodal Prototypes for Visual Recognition with Missing Modalities
Jueqing Lu
Yuanyuan Qi
Xiaohao Yang
Shujie Zhou
Lan Du
34
0
0
13 May 2025
FauForensics: Boosting Audio-Visual Deepfake Detection with Facial Action Units
Jian Wang
Baoyuan Wu
Li Liu
Qingshan Liu
AAML
39
0
0
13 May 2025
Reinforcement Learning-based Fault-Tolerant Control for Quadrotor with Online Transformer Adaptation
Dohyun Kim
Jayden Dongwoo Lee
Hyochoong Bang
Jungho Bae
33
0
0
13 May 2025
UMoE: Unifying Attention and FFN with Shared Experts
Yuanhang Yang
Chaozheng Wang
Jing Li
MoE
29
0
0
12 May 2025
AIS Data-Driven Maritime Monitoring Based on Transformer: A Comprehensive Review
Zhiye Xie
Enmei Tu
Xianping Fu
Guoliang Yuan
Yi Han
31
0
0
12 May 2025
A Generative Re-ranking Model for List-level Multi-objective Optimization at Taobao
Yue Meng
Cheng Guo
Yi Cao
Tong Liu
Bo Zheng
26
0
0
12 May 2025
Self-Supervised Transformer-based Contrastive Learning for Intrusion Detection Systems
Ippokratis Koukoulis
Ilias Syrigos
Thanasis Korakis
26
0
0
12 May 2025
LECTOR: Summarizing E-book Reading Content for Personalized Student Support
Erwin Daniel López Zapata
Cheng Tang
Valdemar Švábenský
Fumiya Okubo
Atsushi Shimada
24
0
0
12 May 2025
OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain
Wenzhen Yue
Yong-Jin Liu
Haoxuan Li
Hao Wang
Xianghua Ying
Ruohao Guo
Bowei Xing
Ji Shi
AI4TS
OOD
34
0
0
12 May 2025
Diffusion-driven SpatioTemporal Graph KANsformer for Medical Examination Recommendation
Jiacheng Li
Yangtao Zhou
Zhifu Zhao
Qinglan Huang
Jian Qi
Xiao He
Hua Chu
Fu Li
DiffM
MedIm
58
0
0
12 May 2025
Thoughts on Objectives of Sparse and Hierarchical Masked Image Model
Asahi Miyazaki
Tsuyoshi Okita
24
0
0
12 May 2025
From Search To Sampling: Generative Models For Robust Algorithmic Recourse
Prateek Garg
Lokesh Nagalapatti
Sunita Sarawagi
31
0
0
12 May 2025
Anatomical Attention Alignment representation for Radiology Report Generation
Quang Vinh Nguyen
Minh Duc Nguyen
Thanh Hoang Son Vo
Hyung-Jeong Yang
Soo-Hyung Kim
MedIm
28
0
0
12 May 2025
ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
Hongyin Zhang
Zifeng Zhuang
Han Zhao
Pengxiang Ding
Hongchao Lu
Donglin Wang
OffRL
44
0
0
12 May 2025
Accountability of Generative AI: Exploring a Precautionary Approach for "Artificially Created Nature"
Yuri Nakao
29
0
0
12 May 2025
The Geography of Transportation Cybersecurity: Visitor Flows, Industry Clusters, and Spatial Dynamics
Yanjie Wang
Kailai Wang
Songhua Hu
Yunpeng
Zhang
Gino Lim
Pengyu Zhu
29
0
0
12 May 2025
Fused3S: Fast Sparse Attention on Tensor Cores
Zitong Li
Aparna Chandramowlishwaran
GNN
47
0
0
12 May 2025
ReCDAP: Relation-Based Conditional Diffusion with Attention Pooling for Few-Shot Knowledge Graph Completion
Jeongho Kim
Chanyeong Heo
Jaehee Jung
36
0
0
12 May 2025
DexWild: Dexterous Human Interactions for In-the-Wild Robot Policies
Tony Tao
Mohan Kumar Srirama
Jason Jingzhou Liu
Kenneth Shaw
Deepak Pathak
31
0
0
12 May 2025
Channel Fingerprint Construction for Massive MIMO: A Deep Conditional Generative Approach
Zhenzhou Jin
Li You
Xudong Li
Zhen Gao
Yuanwei Liu
Xiang-Gen Xia
Xiqi Gao
DiffM
33
0
0
12 May 2025
A Comparative Study of Transformer-Based Models for Multi-Horizon Blood Glucose Prediction
Meryem Altin Karagoz
Marc D. Breton
Anas El Fathi
AI4TS
31
0
0
12 May 2025
A Comparative Analysis of Static Word Embeddings for Hungarian
Máté Gedeon
39
0
0
12 May 2025
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
Yi Zhang
Ruihong Qiu
Xuwei Xu
Jiajun Liu
Sen Wang
OffRL
34
0
0
12 May 2025
Multimodal Assessment of Classroom Discourse Quality: A Text-Centered Attention-Based Multi-Task Learning Approach
Ruikun Hou
B. Bühler
Tim Fütterer
Efe Bozkir
Peter Gerjets
Ulrich Trautwein
Enkelejda Kasneci
31
0
0
12 May 2025
DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection
Mingqian Ji
Jian Yang
Shanshan Zhang
3DPC
MDE
45
0
0
12 May 2025
Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data
David de-Fitero-Dominguez
Antonio Garcia-Cabot
Eva García-López
SyDa
71
0
0
12 May 2025
AI in Money Matters
Nadine Sandjo Tchatchoua
Richard Harper
31
0
0
12 May 2025
SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models
Hang Wu
Jianian Zhu
Yong Li
Haojie Wang
Biao Hou
Jidong Zhai
40
0
0
12 May 2025
Previous
1
2
3
4
5
6
...
351
352
353
Next