Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 27,180 papers shown
Title
Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models
Wei Peng
Kang Liu
Jianchen Hu
Meng Zhang
VLM
LM&MA
115
0
0
08 May 2025
A Simple Detector with Frame Dynamics is a Strong Tracker
Chenxu Peng
Changbo Wang
Minrui Zou
Danyang Li
Zhiyong Yang
Yimian Dai
Ming-Ming Cheng
Xiang Li
105
0
0
08 May 2025
Rethinking Invariance in In-context Learning
Lizhe Fang
Yifei Wang
Khashayar Gatmiry
Lei Fang
Yun Wang
104
6
0
08 May 2025
Augmented Deep Contexts for Spatially Embedded Video Coding
Yifan Bian
Chuanbo Tang
L. Li
Dong Liu
80
0
0
08 May 2025
Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration
Andreas Kontogiannis
Konstantinos Papathanasiou
Yi Shen
Giorgos Stamou
Michael M. Zavlanos
G. Vouros
66
0
0
08 May 2025
Unpacking Robustness in Inflectional Languages: Adversarial Evaluation and Mechanistic Insights
Paweł Walkowiak
Marek Klonowski
Marcin Oleksy
Arkadiusz Janz
AAML
104
0
0
08 May 2025
SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation
Yonwoo Choi
3DGS
VGen
115
0
0
08 May 2025
Defending against Indirect Prompt Injection by Instruction Detection
Tongyu Wen
Chenglong Wang
Xiyuan Yang
Haoyu Tang
Yueqi Xie
Lingjuan Lyu
Zhicheng Dou
Fangzhao Wu
AAML
83
1
0
08 May 2025
Normalize Everything: A Preconditioned Magnitude-Preserving Architecture for Diffusion-Based Speech Enhancement
Julius Richter
Danilo de Oliveira
Timo Gerkmann
DiffM
124
1
0
08 May 2025
Generative Models for Long Time Series: Approximately Equivariant Recurrent Network Structures for an Adjusted Training Scheme
Ruwen Fulek
Markus Lange-Hegermann
AI4TS
113
0
0
08 May 2025
Scalable LLM Math Reasoning Acceleration with Low-rank Distillation
Harry Dong
Bilge Acun
Beidi Chen
Yuejie Chi
LRM
76
0
0
08 May 2025
The Evolution of Embedding Table Optimization and Multi-Epoch Training in Pinterest Ads Conversion
Andrew Qiu
Shubham Barhate
Hin Wai Lui
Runze Su
Rafael Rios Müller
Kungang Li
Ling Leng
Han Sun
Shayan Ehsani
Zhifang Liu
88
0
0
08 May 2025
CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task Evolution
Yufei Lin
Chengwei Ye
Jun Wang
Kangsheng Wang
Linuo Xu
Shuyan Liu
Zeyu Zhang
78
1
0
08 May 2025
Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes
Zhuocheng Gong
Jian Guan
Wei Wu
Huishuai Zhang
Dongyan Zhao
100
1
0
08 May 2025
Learning Item Representations Directly from Multimodal Features for Effective Recommendation
Xin Zhou
Xiaoxiong Zhang
Dusit Niyato
Zhiqi Shen
115
0
0
08 May 2025
Learning to Drive Anywhere with Model-Based Reannotation
Noriaki Hirose
Lydia Ignatova
Kyle Stachowicz
Catherine Glossop
Sergey Levine
Dhruv Shah
74
1
0
08 May 2025
InstanceGen: Image Generation with Instance-level Instructions
Etai Sella
Yanir Kleiman
Hadar Averbuch-Elor
93
0
0
08 May 2025
Trading Under Uncertainty: A Distribution-Based Strategy for Futures Markets Using FutureQuant Transformer
Wenhao Guo
Yuda Wang
Zeqiao Huang
Changjiang Zhang
Shumin ma
AIFin
44
0
0
08 May 2025
OWT: A Foundational Organ-Wise Tokenization Framework for Medical Imaging
Sifan Song
Siyeop Yoon
Pengfei Jin
Sekeun Kim
Matthew Tivnan
...
Zhiliang Lyu
Dufan Wu
Ning Guo
Xiang Li
Quanzheng Li
OOD
ViT
97
0
0
08 May 2025
DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion
Qitao Zhao
Amy Lin
Jeff Tan
Jason Y. Zhang
Deva Ramanan
Shubham Tulsiani
VGen
175
1
0
08 May 2025
T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction
Kun Peng
Chaodong Tong
Cong Cao
Hao Peng
Yue Liu
Guanlin Wu
Lei Jiang
Yanbing Liu
Philip S. Yu
LMTD
106
0
0
08 May 2025
Privacy-Preserving Transformers: SwiftKey's Differential Privacy Implementation
Abdelrahman Abouelenin
M. Abdelrehim
Raffy Fahim
Amr Hendy
Mohamed Afify
57
0
0
08 May 2025
Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization
Yuntai Bao
Xuhong Zhang
Tianyu Du
Xinkui Zhao
Jiang Zong
Hao Peng
Yuxiang Cai
TDI
117
0
0
08 May 2025
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
Min Chen
Jinglei Cheng
Pingzhi Li
Haoran Wang
Tianlong Chen
Junyu Liu
LRM
114
0
0
08 May 2025
X-Driver: Explainable Autonomous Driving with Vision-Language Models
Wei Liu
Jingyun Zhang
Binxiong Zheng
Yufeng Hu
Yingzhan Lin
Zengfeng Zeng
VLM
LRM
127
1
0
08 May 2025
FF-PNet: A Pyramid Network Based on Feature and Field for Brain Image Registration
Ying Zhang
Shuai Guo
Chenxi Sun
Yuchen Zhu
Jinhai Xiang
MedIm
217
0
0
08 May 2025
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
Ahmed Abdelreheem
Filippo Aleotti
Jamie Watson
Z. Qureshi
Abdelrahman Eldesokey
Peter Wonka
Gabriel J. Brostow
Sara Vicente
Guillermo Garcia-Hernando
DiffM
135
0
0
08 May 2025
Nonlinear Motion-Guided and Spatio-Temporal Aware Network for Unsupervised Event-Based Optical Flow
Zuntao Liu
Hao Zhuang
Junjie Jiang
Yuhang Song
Zheng Fang
83
0
0
08 May 2025
Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable Models
Aishwarya Venkataramanan
P. Bodesheim
Joachim Denzler
BDL
VLM
100
0
0
08 May 2025
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian Sun
Xinhu Zheng
Gang Wang
AI4CE
143
4
0
08 May 2025
VaCDA: Variational Contrastive Alignment-based Scalable Human Activity Recognition
Soham Khisa
Avijoy Chakma
87
0
0
08 May 2025
The Moon's Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction
Tom Sander
Moritz Tenthoff
Kay Wohlfarth
Christian Wöhler
110
0
0
08 May 2025
Diffusion Model Quantization: A Review
Qian Zeng
Chenggong Hu
Mingli Song
Jie Song
MQ
97
0
0
08 May 2025
FRAIN to Train: A Fast-and-Reliable Solution for Decentralized Federated Learning
Sanghyeon Park
Soo-Mook Moon
74
0
0
07 May 2025
Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs
Chetan Pathade
AAML
SILM
221
2
0
07 May 2025
ORXE: Orchestrating Experts for Dynamically Configurable Efficiency
Qingyuan Wang
Guoxin Wang
B. Cardiff
Deepu John
92
0
0
07 May 2025
SetONet: A Deep Set-based Operator Network for Solving PDEs with permutation invariant variable input sampling
Stepan Tretiakov
Xingjian Li
Krishna Kumar
72
0
0
07 May 2025
Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer
Sainath Dey
Mitul Goswami
Jashika Sethi
Prasant Kumar Pattnaik
ViT
85
0
0
07 May 2025
DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once
Qi Zhou
Yukai Shi
Xiaojun Yang
Xiaoyu Xian
Lunjia Liao
Ruimao Zhang
Liang Lin
100
0
0
07 May 2025
Fine-Tuning Large Language Models and Evaluating Retrieval Methods for Improved Question Answering on Building Codes
Mohammad Aqib
Mohd Hamza
Qipei Mei
Ying Hei Chui
RALM
ELM
92
0
0
07 May 2025
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
146
0
0
07 May 2025
Detecting Concept Drift in Neural Networks Using Chi-squared Goodness of Fit Testing
Jacob Glenn Ayers
Buvaneswari A. Ramanan
Manzoor A. Khan
58
0
0
07 May 2025
Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective
Songsong Duan
Xi Yang
Nannan Wang
Xinbo Gao
132
0
0
07 May 2025
Lossless Compression of Large Language Model-Generated Text via Next-Token Prediction
Yu Mao
Holger Pirk
Chun Jason Xue
54
0
0
07 May 2025
CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion
Yongqian Li
Pencheng Wan
Liang Han
Yaowei Wang
Liqiang Nie
Min Zhang
75
0
0
07 May 2025
Multi-Granular Attention based Heterogeneous Hypergraph Neural Network
Hong Jin
Kaicheng Zhou
Jie Yin
Lan You
Zhifeng Zhou
64
0
0
07 May 2025
AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual Grounding
Feng Xiao
Hongbin Xu
Guocan Zhao
Wenxiong Kang
245
0
0
07 May 2025
M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation
Qianru Zhang
Liang Qu
Honggang Wen
Dong Huang
Siu-Ming Yiu
Nguyen Quoc Viet Hung
Hongzhi Yin
Mamba
120
1
0
07 May 2025
In-Context Adaptation to Concept Drift for Learned Database Operations
Jiaqi Zhu
Shaofeng Cai
Yanyan Shen
Gang Chen
Fang Deng
Beng Chin Ooi
VLM
200
0
0
07 May 2025
ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via
α
α
α
-
β
β
β
-Divergence
Guanghui Wang
Zhiyong Yang
Ziyi Wang
Shi Wang
Qianqian Xu
Qingming Huang
287
0
0
07 May 2025
Previous
1
2
3
...
33
34
35
...
542
543
544
Next