Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.12180
Cited By
Axial Attention in Multidimensional Transformers
20 December 2019
Jonathan Ho
Nal Kalchbrenner
Dirk Weissenborn
Tim Salimans
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Axial Attention in Multidimensional Transformers"
50 / 287 papers shown
Title
Out-of-distribution generalisation is hard: evidence from ARC-like tasks
George Dimitriadis. Spyridon Samothrakis
Spyridon Samothrakis
23
0
0
14 May 2025
UNet with Axial Transformer : A Neural Weather Model for Precipitation Nowcasting
Maitreya Sonawane
Sumit Mamtani
65
0
0
28 Apr 2025
Distilling semantically aware orders for autoregressive image generation
Rishav Pramanik
Antoine Poupon
Juan A. Rodriguez
Masih Aminbeidokhti
David Vazquez
Christopher Pal
Zhaozheng Yin
M. Pedersoli
31
0
0
23 Apr 2025
CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired Transformer
Yang Liu
Zinan Zheng
Jiashun Cheng
Fugee Tsung
Deli Zhao
Yu Rong
J. Li
83
1
0
27 Feb 2025
Protein Large Language Models: A Comprehensive Survey
Yijia Xiao
Wanjia Zhao
Junkai Zhang
Yiqiao Jin
Han Zhang
...
Xiao Luo
Yu-Jie Zhang
James Y. Zou
Y. Sun
Wei Wang
LM&MA
AI4CE
54
3
0
21 Feb 2025
Universal Lesion Segmentation Challenge 2023: A Comparative Research of Different Algorithms
Kaiwen Shi
Yifei Li
Binh Ho
Jovian Wang
Kobe Guo
OOD
34
0
0
14 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
45
0
0
11 Feb 2025
ZETA: Leveraging Z-order Curves for Efficient Top-k Attention
Qiuhao Zeng
Jerry Huang
Peng Lu
Gezheng Xu
Boxing Chen
Charles X. Ling
Boyu Wang
49
1
0
24 Jan 2025
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Hongjun Wang
Wonmin Byeon
Jiarui Xu
Jinwei Gu
Ka Chun Cheung
Xiaolong Wang
Kai Han
Jan Kautz
Sifei Liu
149
0
0
21 Jan 2025
VaeDiff-DocRE: End-to-end Data Augmentation Framework for Document-level Relation Extraction
Khai Phan Tran
Wen Hua
Xue Li
SyDa
88
0
0
18 Dec 2024
Community Research Earth Digital Intelligence Twin (CREDIT)
John S. Schreck
Yingkai Sha
William E. Chapman
Dhamma Kimpara
Judith Berner
Seth McGinnis
Arnold Kazadi
Negin Sobhani
Ben Kirk
David John Gagne II
AI4Cl
34
1
0
09 Nov 2024
ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis
Xinyu Geng
Jiaming Wang
Jun Xu
MedIm
29
0
0
03 Nov 2024
Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization
Zichen Wang
Yaokun Ji
Jianing Tian
Shuangjia Zheng
DiffM
30
0
0
19 Oct 2024
Learning to refine domain knowledge for biological network inference
Peiwen Li
Menghua Wu
CML
41
1
0
18 Oct 2024
Metalic: Meta-Learning In-Context with Protein Language Models
Jacob Beck
Shikha Surana
Manus McAuliffe
Oliver Bent
Thomas D. Barrett
Juan Jose Garau Luis
Paul Duckworth
AI4CE
28
0
0
10 Oct 2024
System-Level Safety Monitoring and Recovery for Perception Failures in Autonomous Vehicles
Kaustav Chakraborty
Zeyuan Feng
Sushant Veer
Apoorva Sharma
B. Ivanovic
Marco Pavone
Somil Bansal
29
1
0
26 Sep 2024
GASA-UNet: Global Axial Self-Attention U-Net for 3D Medical Image Segmentation
Chengkun Sun
Russell Stevens Terry
Jiang Bian
Jie Xu
3DPC
16
0
0
20 Sep 2024
PROSE-FD: A Multimodal PDE Foundation Model for Learning Multiple Operators for Forecasting Fluid Dynamics
Yuxuan Liu
Jingmin Sun
Xinjie He
Griffin Pinney
Zecheng Zhang
Hayden Schaeffer
AI4CE
37
6
0
15 Sep 2024
Macformer: Transformer with Random Maclaurin Feature Attention
Yuhan Guo
Lizhong Ding
Ye Yuan
Guoren Wang
46
0
0
21 Aug 2024
Nonlocal Attention Operator: Materializing Hidden Knowledge Towards Interpretable Physics Discovery
Yue Yu
Ning Liu
Fei Lu
Tian Gao
S. Jafarzadeh
Stewart Silling
AI4CE
48
7
0
14 Aug 2024
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
Zewen Du
Zhenjiang Hu
Guiyu Zhao
Ying Jin
Hongbin Ma
ViT
26
2
0
29 Jul 2024
A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention
João D. Nunes
D. Montezuma
Domingos Oliveira
Tania Pereira
Jaime S. Cardoso
49
1
0
26 Jul 2024
CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation
Xiao Liu
Peng Gao
Tao Yu
Fei-Yue Wang
Ruyue Yuan
MedIm
ViT
33
14
0
25 Jul 2024
Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation
Hui Wei
Maxwell A. Xu
Colin Samplawski
James M. Rehg
Santosh Kumar
Benjamin M. Marlin
35
0
0
27 Jun 2024
Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers
Chao Lou
Zixia Jia
Zilong Zheng
Kewei Tu
ODL
31
18
0
24 Jun 2024
A Primal-Dual Framework for Transformers and Neural Networks
Tan M. Nguyen
Tam Nguyen
Nhat Ho
Andrea L. Bertozzi
Richard G. Baraniuk
Stanley J. Osher
ViT
21
13
0
19 Jun 2024
MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-Training
Bo Chen
Zhilei Bei
Xingyi Cheng
Pan Li
Jie Tang
Le Song
37
4
0
08 Jun 2024
SFANet: Spatial-Frequency Attention Network for Weather Forecasting
Jiaze Wang
Hao Chen
Hongcan Xu
Jinpeng Li
Bo-Lan Wang
Kun Shao
Furui Liu
Huaxi Chen
Guangyong Chen
Pheng-Ann Heng
64
0
0
29 May 2024
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
You Huang
Zongyu Lan
Liujuan Cao
Xianming Lin
Shengchuan Zhang
Guannan Jiang
Rongrong Ji
VLM
27
2
0
29 May 2024
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao
Xinggang Wang
Lianghui Zhu
Qian Zhang
Chang Huang
54
4
0
28 May 2024
Activator: GLU Activation Function as the Core Component of a Vision Transformer
Abdullah Nazhat Abdullah
Tarkan Aydin
ViT
38
0
0
24 May 2024
ArchesWeather: An efficient AI weather forecasting model at 1.5° resolution
Guillaume Couairon
Christian Lessig
A. Charantonis
C. Monteleoni
27
1
0
23 May 2024
Multi-scale Information Sharing and Selection Network with Boundary Attention for Polyp Segmentation
Xiaolu Kang
Zhuoqi Ma
Kang Liu
Yunan Li
Qiguang Miao
42
3
0
18 May 2024
CaFA: Global Weather Forecasting with Factorized Attention on Sphere
Zijie Li
Anthony Y. Zhou
Saurabh Patil
A. Farimani
42
6
0
12 May 2024
Efficient Bi-manipulation using RGBD Multi-model Fusion based on Attention Mechanism
Jian Shen
Jiaxin Huang
Zhigong Song
21
0
0
27 Apr 2024
HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images
Chengxi Han
Chen Wu
Haonan Guo
Meiqi Hu
Hongruixuan Chen
28
89
0
14 Apr 2024
State Space Models as Foundation Models: A Control Theoretic Overview
Carmen Amo Alonso
Jerome Sieber
M. Zeilinger
AI4CE
Mamba
36
13
0
25 Mar 2024
TiBiX: Leveraging Temporal Information for Bidirectional X-ray and Report Generation
Santosh Sanjeev
F. Maani
Arsen Abzhanov
Vijay Ram Papineni
Ibrahim Almakky
Bartlomiej W. Papie.z
Mohammad Yaqub
MedIm
58
0
0
20 Mar 2024
EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration
Abu Zahid Bin Aziz
Mokshagna Sai Teja Karanam
Tushar Kataria
Shireen Elhabian
ViT
MedIm
29
1
0
16 Mar 2024
UPS: Efficiently Building Foundation Models for PDE Solving via Cross-Modal Adaptation
Junhong Shen
Tanya Marwah
Ameet Talwalkar
AI4CE
42
2
0
11 Mar 2024
NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function
Abdullah Nazhat Abdullah
Tarkan Aydin
36
0
0
04 Mar 2024
Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: A Benchmark Study
Prottay Kumar Adhikary
Aseem Srivastava
Shivani Kumar
Salam Michael Singh
Puneet Manuja
Jini K. Gopinath
Vijay Krishnan
Swati Kedia
K. Deb
Tanmoy Chakraborty
AI4MH
38
8
0
29 Feb 2024
Parallelized Spatiotemporal Binding
Gautam Singh
Yue Wang
Jiawei Yang
B. Ivanovic
Sungjin Ahn
Marco Pavone
Tong Che
48
1
0
26 Feb 2024
Quantum Circuit Optimization with AlphaTensor
Francisco J. R. Ruiz
Tuomas Laakkonen
Johannes Bausch
Matej Balog
M. Barekatain
...
Bernardino Romera-Paredes
J. V. D. Wetering
Alhussein Fawzi
K. Meichanetzidis
Pushmeet Kohli
23
19
0
22 Feb 2024
Perceiving Longer Sequences With Bi-Directional Cross-Attention Transformers
Markus Hiller
Krista A. Ehinger
Tom Drummond
36
0
0
19 Feb 2024
Graph Structure Inference with BAM: Introducing the Bilinear Attention Mechanism
Philipp Froehlich
Heinz Koeppl
GNN
29
1
0
12 Feb 2024
Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data
Shufan Li
Harkanwar Singh
Aditya Grover
Mamba
92
56
0
08 Feb 2024
Sample, estimate, aggregate: A recipe for causal discovery foundation models
Menghua Wu
Yujia Bao
Regina Barzilay
Tommi Jaakkola
CML
49
7
0
02 Feb 2024
Endowing Protein Language Models with Structural Knowledge
Dexiong Chen
Philip Hartout
Paolo Pellizzoni
Carlos G. Oliver
Karsten Borgwardt
43
12
0
26 Jan 2024
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers
Katherine Crowson
Stefan Andreas Baumann
Alex Birch
Tanishq Mathew Abraham
Daniel Z. Kaplan
Enrico Shippole
26
48
0
21 Jan 2024
1
2
3
4
5
6
Next