Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 17,634 papers shown
Title
Multimodal Assessment of Classroom Discourse Quality: A Text-Centered Attention-Based Multi-Task Learning Approach
Ruikun Hou
B. Bühler
Tim Fütterer
Efe Bozkir
Peter Gerjets
Ulrich Trautwein
Enkelejda Kasneci
31
0
0
12 May 2025
Latent Behavior Diffusion for Sequential Reaction Generation in Dyadic Setting
Minh-Duc Nguyen
Hyung-Jeong Yang
Soo-Hyung Kim
Ji-Eun Shin
Seung-Won Kim
DiffM
36
0
0
12 May 2025
Anatomical Attention Alignment representation for Radiology Report Generation
Quang Vinh Nguyen
Minh Duc Nguyen
Thanh Hoang Son Vo
Hyung-Jeong Yang
Soo-Hyung Kim
MedIm
28
0
0
12 May 2025
TSLFormer: A Lightweight Transformer Model for Turkish Sign Language Recognition Using Skeletal Landmarks
Kutay Ertürk
Furkan Altınışık
İrem Sarıaltın
Ömer Nezih Gerek
SLR
42
0
0
11 May 2025
NetSight: Graph Attention Based Traffic Forecasting in Computer Networks
Jinming Xing
Guoheng Sun
Hui Sun
Linchao Pan
Shakir Mahmood
Xuanhao Luo
Muhammad Shahzad
31
0
0
11 May 2025
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
34
0
0
11 May 2025
Near-Field Channel Estimation for XL-MIMO: A Deep Generative Model Guided by Side Information
Zhenzhou Jin
Li You
Derrick Wing Kwan Ng
Xiang-Gen Xia
Xiqi Gao
29
2
0
11 May 2025
LLM-Augmented Chemical Synthesis and Design Decision Programs
Haorui Wang
Jeff Guo
Lingkai Kong
R. Ramprasad
Philippe Schwaller
Yuanqi Du
Chao Zhang
31
0
0
11 May 2025
Efficient and Robust Multidimensional Attention in Remote Physiological Sensing through Target Signal Constrained Factorization
Jitesh Joshi
Youngjun Cho
26
0
0
11 May 2025
Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria
Fengming Zhu
Fangzhen Lin
29
0
0
11 May 2025
RefPentester: A Knowledge-Informed Self-Reflective Penetration Testing Framework Based on Large Language Models
Hanzheng Dai
Yuanliang Li
Zhibo Zhang
Jun Yan
26
0
0
11 May 2025
Learning curves theory for hierarchically compositional data with power-law distributed features
Francesco Cagnetta
Hyunmo Kang
M. Wyart
38
0
0
11 May 2025
NeuRN: Neuro-inspired Domain Generalization for Image Classification
Hamd Jalil
Ahmed Qazi
Asim Iqbal
OOD
MedIm
26
0
0
11 May 2025
Mice to Machines: Neural Representations from Visual Cortex for Domain Generalization
Ahmed Qazi
Hamd Jalil
Asim Iqbal
OOD
33
0
0
11 May 2025
NeuGen: Amplifying the Ñeural' in Neural Radiance Fields for Domain Generalization
Ahmed Qazi
Abdul Basit
Asim Iqbal
AI4CE
23
0
0
11 May 2025
Towards Human-Centric Autonomous Driving: A Fast-Slow Architecture Integrating Large Language Model Guidance with Reinforcement Learning
Chengkai Xu
Jiaqi Liu
Yicheng Guo
Wenjie Qu
Peng Hang
Jian Sun
33
0
0
11 May 2025
Unraveling Quantum Environments: Transformer-Assisted Learning in Lindblad Dynamics
Chi-Sheng Chen
En-Jui Kuo
AI4CE
34
0
0
11 May 2025
Decoding Futures Price Dynamics: A Regularized Sparse Autoencoder for Interpretable Multi-Horizon Forecasting and Factor Discovery
Abhijit Gupta
31
0
0
11 May 2025
Hand-Shadow Poser
Hao Xu
Yinqiao Wang
Niloy J. Mitra
Shuaicheng Liu
Pheng-Ann Heng
Chi-Wing Fu
3DH
36
0
0
11 May 2025
Image Classification Using a Diffusion Model as a Pre-Training Model
Kosuke Ukita
Ye Xiaolong
Tsuyoshi Okita
DiffM
MedIm
VLM
37
0
0
11 May 2025
Enhancing Monocular Height Estimation via Sparse LiDAR-Guided Correction
Jian Song
Hongruixuan Chen
Naoto Yokoya
34
0
0
11 May 2025
Attention Mechanisms in Dynamical Systems: A Case Study with Predator-Prey Models
David Balaban
19
0
0
10 May 2025
Dynamic Domain Information Modulation Algorithm for Multi-domain Sentiment Analysis
Chunyi Yue
Ang Li
34
0
0
10 May 2025
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Lu Dong
Han Zhang
Hongjie Zhang
Yuanmin Huang
Z. Ling
Yu Qiao
Limin Wang
Yishuo Wang
AI4TS
43
0
0
10 May 2025
GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images
Chengfeng Wang
Wei Zhai
Yuhang Yang
Yang Cao
Zhengjun Zha
3DH
36
0
0
10 May 2025
Improving Generalization of Medical Image Registration Foundation Model
Jing Hu
Kaiwei Yu
Hongjiang Xian
Shu Hu
Xin Wang
MedIm
34
0
0
10 May 2025
Using External knowledge to Enhanced PLM for Semantic Matching
Min Li
Chun Yuan
37
0
0
10 May 2025
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
Zihan Qiu
Zhaoxiang Wang
Bo Zheng
Zeyu Huang
Kaiyue Wen
...
Fei Huang
Suozhi Huang
Dayiheng Liu
Jingren Zhou
Junyang Lin
MoE
40
0
0
10 May 2025
ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images
Xianghao Kong
Qiaosong Qi
Yuanbin Wang
Anyi Rao
Biaolong Chen
Aixi Zhang
Si Liu
Hao Jiang
DiffM
VGen
25
0
0
10 May 2025
Burger: Robust Graph Denoising-augmentation Fusion and Multi-semantic Modeling in Social Recommendation
Yuqin Lan
31
0
0
10 May 2025
A Short Overview of Multi-Modal Wi-Fi Sensing
Zijian Zhao
31
0
0
10 May 2025
JAEGER: Dual-Level Humanoid Whole-Body Controller
Ziluo Ding
Haobin Jiang
Yuxuan Wang
Zhenguo Sun
Yu Zhang
Xiaojie Niu
M. Yang
Weishuai Zeng
Xinrun Xu
Zongqing Lu
31
0
0
10 May 2025
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Min Li
Chun Yuan
ReLM
LRM
45
0
0
10 May 2025
Attention Is Not All You Need: The Importance of Feedforward Networks in Transformer Models
Isaac Gerber
34
0
0
10 May 2025
xGen-small Technical Report
Erik Nijkamp
Bo Pang
Egor Pakhomov
Akash Gokul
Jin Qu
Silvio Savarese
Yingbo Zhou
Caiming Xiong
LLMAG
58
0
0
10 May 2025
I Know What You Said: Unveiling Hardware Cache Side-Channels in Local Large Language Model Inference
Zibo Gao
Junjie Hu
Feng Guo
Yixin Zhang
Yinglong Han
Siyuan Liu
Haiyang Li
Zhiqiang Lv
31
0
0
10 May 2025
Learning Sequential Kinematic Models from Demonstrations for Multi-Jointed Articulated Objects
Anmol Gupta
Weiwei Gu
Omkar Patil
Jun Ki Lee
N. Gopalan
29
0
0
09 May 2025
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Hang Gao
Chenhao Zhang
Tie Wang
Junsuo Zhao
Fengge Wu
Changwen Zheng
Huaping Liu
LRM
34
0
0
09 May 2025
UniSymNet: A Unified Symbolic Network Guided by Transformer
Xinxin Li
Juan Zhang
Da Li
Xingyu Liu
Jin Xu
Junping Yin
34
0
0
09 May 2025
Physics-informed Temporal Difference Metric Learning for Robot Motion Planning
Ruiqi Ni
Zherong Pan
A. H. Qureshi
SSL
41
0
0
09 May 2025
LightNobel: Improving Sequence Length Limitation in Protein Structure Prediction Model via Adaptive Activation Quantization
Seunghee Han
S. Choi
Joo-Young Kim
31
0
0
09 May 2025
CellVerse: Do Large Language Models Really Understand Cell Biology?
Fan Zhang
Tianyu Liu
Zhihong Zhu
Yu Wang
Haoyu Wang
Donghao Zhou
Yefeng Zheng
Kun Wang
X. Wu
Pheng-Ann Heng
ELM
36
0
0
09 May 2025
The Application of Deep Learning for Lymph Node Segmentation: A Systematic Review
Jingguo Qu
Xinyang Han
Man-Lik Chui
Yao Pu
Simon Takadiyi Gunda
...
Jing Qin
Ann Dorothy King
Winnie Chiu-Wing Chu
J. Cai
Michael Tin-Cheung Ying
31
0
0
09 May 2025
Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification
Leon Eshuijs
Shihan Wang
Antske Fokkens
31
0
0
09 May 2025
Anymate: A Dataset and Baselines for Learning 3D Object Rigging
Yufan Deng
Yuhao Zhang
Chen Geng
Shangzhe Wu
Jiajun Wu
3DH
55
0
0
09 May 2025
Efficient Fairness Testing in Large Language Models: Prioritizing Metamorphic Relations for Bias Detection
Suavis Giramata
Madhusudan Srinivasan
Venkat Naidu Gudivada
Upulee Kanewala
31
0
0
09 May 2025
Generative Discovery of Partial Differential Equations by Learning from Math Handbooks
Hao Xu
Y. Chen
Rui Cao
Tianning Tang
Mengge Du
Jiacheng Li
Adrian H. Callaghan
Dongxiao Zhang
31
0
0
09 May 2025
Improving Generalizability of Kolmogorov-Arnold Networks via Error-Correcting Output Codes
Youngjoon Lee
J. Gong
Joonhyuk Kang
26
0
0
09 May 2025
Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition
Zhiyuan Chen
Keyi Li
Yifan Jia
Le Ye
Yufei Ma
DiffM
37
0
0
09 May 2025
TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for Documentaries
Jinze Lv
Jian Chen
Zi Long
Xianghua Fu
Yin Chen
VGen
47
0
0
09 May 2025
Previous
1
2
3
4
5
...
351
352
353
Next