ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.03762
  4. Cited By
Attention Is All You Need

Attention Is All You Need

12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
    3DV
ArXivPDFHTML

Papers citing "Attention Is All You Need"

50 / 17,634 papers shown
Title
Emergence of Structure in Ensembles of Random Neural Networks
Emergence of Structure in Ensembles of Random Neural Networks
Luca Muscarnera
Luigi Loreti
Giovanni Todeschini
Alessio Fumagalli
Francesco Regazzoni
31
0
0
15 May 2025
PIG: Privacy Jailbreak Attack on LLMs via Gradient-based Iterative In-Context Optimization
PIG: Privacy Jailbreak Attack on LLMs via Gradient-based Iterative In-Context Optimization
Yidan Wang
Yanan Cao
Yubing Ren
Fang Fang
Zheng Lin
Binxing Fang
PILM
44
0
0
15 May 2025
Continuity and Isolation Lead to Doubts or Dilemmas in Large Language Models
Continuity and Isolation Lead to Doubts or Dilemmas in Large Language Models
Hector Pasten
Felipe Urrutia
Hector Jimenez
Cristian B. Calderon
Cristóbal Rojas
Alexander Kozachinskiy
17
0
0
15 May 2025
Tracr-Injection: Distilling Algorithms into Pre-trained Language Models
Tracr-Injection: Distilling Algorithms into Pre-trained Language Models
Tomás Vergara-Browne
Álvaro Soto
17
0
0
15 May 2025
From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models
From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models
Yidan Wang
Yubing Ren
Yanan Cao
Binxing Fang
32
0
0
15 May 2025
VRU-CIPI: Crossing Intention Prediction at Intersections for Improving Vulnerable Road Users Safety
VRU-CIPI: Crossing Intention Prediction at Intersections for Improving Vulnerable Road Users Safety
Ahmed S. Abdelrahman
Mohamed Abdel-Aty
Quoc Dai Tran
24
0
0
15 May 2025
Optimizing Electric Bus Charging Scheduling with Uncertainties Using Hierarchical Deep Reinforcement Learning
Optimizing Electric Bus Charging Scheduling with Uncertainties Using Hierarchical Deep Reinforcement Learning
Jiaju Qi
Lei Lei
Thorsteinn Jonsson
Dusit Niyato
26
0
0
15 May 2025
JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation
JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation
Tiancong Cheng
Ying Zhang
Yuxuan Liang
Roger Zimmermann
Zhiwen Yu
Bin Guo
VLM
24
0
0
15 May 2025
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models
Yuncheng Guo
Xiaodong Gu
OffRL
VLM
32
0
0
15 May 2025
Embodied AI in Machine Learning -- is it Really Embodied?
Embodied AI in Machine Learning -- is it Really Embodied?
Matej Hoffmann
Shubhan Patni
LM&Ro
AI4CE
22
0
0
15 May 2025
AttentionGuard: Transformer-based Misbehavior Detection for Secure Vehicular Platoons
AttentionGuard: Transformer-based Misbehavior Detection for Secure Vehicular Platoons
Hexu Li
Konstantinos Kalogiannis
Ahmed Mohamed Hussain
P. Papadimitratos
19
0
0
15 May 2025
Modular Robot Control with Motor Primitives
Modular Robot Control with Motor Primitives
Moses C. Nah
Johannes Lachner
Neville Hogan
21
0
0
15 May 2025
IMITATE: Image Registration with Context for unknown time frame recovery
IMITATE: Image Registration with Context for unknown time frame recovery
Ziad Kheil
Lucas Robinet
Laurent Risser
Soleakhena Ken
26
0
0
15 May 2025
GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding
GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding
Yuki Kawana
Shintaro Shiba
Quan Kong
Norimasa Kobori
17
0
0
15 May 2025
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
Jun Guo
Xiaojian Ma
Yikai Wang
Min Yang
Huaping Liu
Qing Li
VGen
34
0
0
15 May 2025
Parallel Scaling Law for Language Models
Parallel Scaling Law for Language Models
Mouxiang Chen
Binyuan Hui
Zeyu Cui
Jiaxi Yang
Dayiheng Liu
Jianling Sun
Junyang Lin
Zhongxin Liu
MoE
LRM
37
0
0
15 May 2025
Diffusion-SAFE: Shared Autonomy Framework with Diffusion for Safe Human-to-Robot Driving Handover
Yunxin Fan
Monroe Kennedy III
23
0
0
15 May 2025
ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization
ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization
Wenhao Shen
Wanqi Yin
Xiaofeng Yang
Cheng Chen
Chaoyue Song
Zhongang Cai
Lei Yang
Hao Wang
Guosheng Lin
36
0
0
15 May 2025
Sequential Treatment Effect Estimation with Unmeasured Confounders
Sequential Treatment Effect Estimation with Unmeasured Confounders
Yingrong Wang
Anpeng Wu
Yangqiu Song
Ziyang Xiao
Ruoxuan Xiong
Qing Han
Kun Kuang
CML
38
0
0
14 May 2025
Beyond Pixels: Leveraging the Language of Soccer to Improve Spatio-Temporal Action Detection in Broadcast Videos
Beyond Pixels: Leveraging the Language of Soccer to Improve Spatio-Temporal Action Detection in Broadcast Videos
Jeremie Ochin
Raphael Chekroun
Bogdan Stanciulescu
Sotiris Manitsaris
19
0
0
14 May 2025
SALM: A Multi-Agent Framework for Language Model-Driven Social Network Simulation
SALM: A Multi-Agent Framework for Language Model-Driven Social Network Simulation
Gaurav Koley
LLMAG
26
0
0
14 May 2025
APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression
APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression
Srinivas Ravuri
Yuan Xu
Martin Ludwig Zehetner
Ketan Motlag
Sahin Albayrak
16
0
0
14 May 2025
Imitation Learning for Adaptive Control of a Virtual Soft Exoglove
Imitation Learning for Adaptive Control of a Virtual Soft Exoglove
Shirui Lyu
Vittorio Caggiano
Matteo Leonetti
Dario Farina
Letizia Gionfrida
18
0
0
14 May 2025
LiDDA: Data Driven Attribution at LinkedIn
LiDDA: Data Driven Attribution at LinkedIn
John Bencina
Erkut Aykutlug
Yue Chen
Zerui Zhang
Stephanie Sorenson
Shao Tang
Changshuai Wei
19
0
0
14 May 2025
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
Minh Hoang Nguyen
Linh Le Pham Van
Thommen George Karimpanal
Sunil Gupta
Hung Le
OffRL
LRM
37
0
0
14 May 2025
Text-driven Motion Generation: Overview, Challenges and Directions
Text-driven Motion Generation: Overview, Challenges and Directions
Ali Rida Sahili
Najett Neji
Hedi Tabia
VGen
38
0
0
14 May 2025
AdaFortiTran: An Adaptive Transformer Model for Robust OFDM Channel Estimation
AdaFortiTran: An Adaptive Transformer Model for Robust OFDM Channel Estimation
Berkay Guler
Hamid Jafarkhani
21
1
0
14 May 2025
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias
Brandon Smith
Mohamed Reda Bouadjenek
Tahsin Alamgir Kheya
Phillip Dawson
S. Aryal
ALM
ELM
26
0
0
14 May 2025
Don't Forget your Inverse DDIM for Image Editing
Don't Forget your Inverse DDIM for Image Editing
Guillermo Gomez-Trenado
Pablo Mesejo
Ó. Cordón
Stéphane Lathuilière
DiffM
28
0
0
14 May 2025
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Subrit Dikshit
Ritu Tiwari
Priyank Jain
24
0
0
14 May 2025
A Generative Neural Annealer for Black-Box Combinatorial Optimization
A Generative Neural Annealer for Black-Box Combinatorial Optimization
Yuan-Hang Zhang
M. Di Ventra
29
0
0
14 May 2025
Towards Fair In-Context Learning with Tabular Foundation Models
Towards Fair In-Context Learning with Tabular Foundation Models
Patrik Kenfack
Samira Ebrahimi Kahou
Ulrich Aïvodji
21
0
0
14 May 2025
Accelerating Machine Learning Systems via Category Theory: Applications to Spherical Attention for Gene Regulatory Networks
Accelerating Machine Learning Systems via Category Theory: Applications to Spherical Attention for Gene Regulatory Networks
Vincent Abbott
Kotaro Kamiya
Gerard Glowacki
Yu Atsumi
Gioele Zardini
Yoshihiro Maruyama
29
0
0
14 May 2025
Out-of-distribution generalisation is hard: evidence from ARC-like tasks
Out-of-distribution generalisation is hard: evidence from ARC-like tasks
George Dimitriadis
Spyridon Samothrakis
29
0
0
14 May 2025
UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing
UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing
Yung-Hsuan Lai
Janek Ebbers
Yu-Chiang Frank Wang
François Germain
Michael Jeffrey Jones
Moitreya Chatterjee
26
0
0
14 May 2025
Efficient LiDAR Reflectance Compression via Scanning Serialization
Efficient LiDAR Reflectance Compression via Scanning Serialization
Jiahao Zhu
Kang-Soo You
Dandan Ding
Zhan Ma
28
0
0
14 May 2025
Contactless Cardiac Pulse Monitoring Using Event Cameras
Contactless Cardiac Pulse Monitoring Using Event Cameras
Mohamed Moustafa
Joseph Lemley
Peter Corcoran
21
0
0
14 May 2025
Dyadic Mamba: Long-term Dyadic Human Motion Synthesis
Dyadic Mamba: Long-term Dyadic Human Motion Synthesis
Julian Tanke
Takashi Shibuya
Kengo Uchida
Koichi Saito
Yuki Mitsufuji
Mamba
47
0
0
14 May 2025
A 2D Semantic-Aware Position Encoding for Vision Transformers
A 2D Semantic-Aware Position Encoding for Vision Transformers
Xi Chen
Shiyang Zhou
Muqi Huang
Jiaxu Feng
Yun Xiong
...
Yujie Zhang
Huishuai Bao
Sijia Peng
Chong Li
Feng Shi
ViT
31
0
0
14 May 2025
Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt
Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt
Bin-Bin Gao
37
4
0
14 May 2025
CAD-Coder:Text-Guided CAD Files Code Generation
CAD-Coder:Text-Guided CAD Files Code Generation
Changqi He
Shuhan Zhang
Liguo Zhang
Jiajun Miao
34
0
0
13 May 2025
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking
Haofeng Liu
Mingqi Gao
Xuxiao Luo
Ziyue Wang
Guanyi Qin
J. Wu
Yueming Jin
37
0
0
13 May 2025
WaLLM -- Insights from an LLM-Powered Chatbot deployment via WhatsApp
WaLLM -- Insights from an LLM-Powered Chatbot deployment via WhatsApp
Hiba Eltigani
Rukhshan Haroon
Asli Kocak
Abdullah Bin Faisal
Noah Martin
Fahad Dogar
19
0
0
13 May 2025
Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning
Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning
Ardian Selmonaj
Oleg Szehr
Giacomo Del Rio
Alessandro Antonucci
Adrian Schneider
Michael Rüegsegger
29
0
0
13 May 2025
TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series
TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series
Xiaolei Qin
Di Wang
Jingyang Zhang
Fengxiang Wang
Xin Su
Bo Du
Liangpei Zhang
AI4TS
24
0
0
13 May 2025
Foundation Models Knowledge Distillation For Battery Capacity Degradation Forecast
Foundation Models Knowledge Distillation For Battery Capacity Degradation Forecast
Joey Chan
Zhen Chen
Ershun Pan
34
0
0
13 May 2025
Big Data and the Computational Social Science of Entrepreneurship and Innovation
Big Data and the Computational Social Science of Entrepreneurship and Innovation
Ningzi Li
Shiyang Lai
James Evans
AILaw
29
0
0
13 May 2025
Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies
Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies
Xiaoliang Luo
Xinyi Xu
Michael Ramscar
Bradley C. Love
30
0
0
13 May 2025
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
Yifu Yuan
Haiqin Cui
Yibin Chen
Zibin Dong
Fei Ni
Longxin Kou
Jinyi Liu
Pengyi Li
Yan Zheng
Jianye Hao
31
0
0
13 May 2025
Reinforcement Learning-based Fault-Tolerant Control for Quadrotor with Online Transformer Adaptation
Reinforcement Learning-based Fault-Tolerant Control for Quadrotor with Online Transformer Adaptation
Dohyun Kim
Jayden Dongwoo Lee
Hyochoong Bang
Jungho Bae
33
0
0
13 May 2025
Previous
12345...351352353
Next