Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.07253
Cited By
Energy Transformer
14 February 2023
Benjamin Hoover
Yuchen Liang
Bao Pham
Yikang Shen
Hendrik Strobelt
Duen Horng Chau
Mohammed J Zaki
Dmitry Krotov
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Energy Transformer"
30 / 30 papers shown
Title
Hyper-SET: Designing Transformers via Hyperspherical Energy Minimization
Yunzhe Hu
Difan Zou
Dong Xu
97
1
0
17 Feb 2025
Artificial Kuramoto Oscillatory Neurons
Takeru Miyato
Sindy Löwe
Andreas Geiger
Max Welling
AI4CE
160
9
0
17 Oct 2024
Associative memory and dead neurons
V. Fanaskov
Ivan Oseledets
84
1
0
02 Oct 2024
Memory Mosaics
Jianyu Zhang
Niklas Nolte
Ranajoy Sadhukhan
Beidi Chen
Léon Bottou
VLM
92
4
0
10 May 2024
Associative Transformer
Yuwei Sun
H. Ochiai
Zhirong Wu
Stephen Lin
Ryota Kanai
ViT
94
0
0
22 Sep 2023
Rethinking Graph Neural Networks for Anomaly Detection
Jianheng Tang
Jiajin Li
Zi-Chao Gao
Jia Li
111
218
0
31 May 2022
A New Perspective on the Effects of Spectrum in Graph Neural Networks
Mingqi Yang
Yanming Shen
Rui Li
Heng Qi
Qian Zhang
Baocai Yin
GNN
45
29
0
14 Dec 2021
MetaFormer Is Actually What You Need for Vision
Weihao Yu
Mi Luo
Pan Zhou
Chenyang Si
Yichen Zhou
Xinchao Wang
Jiashi Feng
Shuicheng Yan
165
909
0
22 Nov 2021
Global Self-Attention as a Replacement for Graph Convolution
Md Shamim Hussain
Mohammed J Zaki
D. Subramanian
ViT
59
127
0
07 Aug 2021
Hierarchical Associative Memory
Dmitry Krotov
BDL
127
34
0
14 Jul 2021
A Survey of Transformers
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
151
1,124
0
08 Jun 2021
GraphFormers: GNN-nested Transformers for Representation Learning on Textual Graph
Junhan Yang
Zheng Liu
Shitao Xiao
Chaozhuo Li
Defu Lian
Sanjay Agrawal
Amit Singh
Guangzhong Sun
Xing Xie
AI4CE
47
157
0
06 May 2021
A Generalization of Transformer Networks to Graphs
Vijay Prakash Dwivedi
Xavier Bresson
AI4CE
101
749
0
17 Dec 2020
Large Associative Memory Problem in Neurobiology and Machine Learning
Dmitry Krotov
J. Hopfield
54
136
0
16 Aug 2020
TUDataset: A collection of benchmark datasets for learning with graphs
Christopher Morris
Nils M. Kriege
Franka Bause
Kristian Kersting
Petra Mutzel
Marion Neumann
233
820
0
16 Jul 2020
Hopfield Networks is All You Need
Hubert Ramsauer
Bernhard Schafl
Johannes Lehner
Philipp Seidl
Michael Widrich
...
David P. Kreil
Michael K Kopp
Günter Klambauer
Johannes Brandstetter
Sepp Hochreiter
100
433
0
16 Jul 2020
Alleviating the Inconsistency Problem of Applying Graph Neural Network to Fraud Detection
Zhiwei Liu
Yingtong Dou
Philip S. Yu
Yutong Deng
Hao Peng
GNN
99
278
0
01 May 2020
Gradient Centralization: A New Optimization Technique for Deep Neural Networks
Hongwei Yong
Jianqiang Huang
Xiansheng Hua
Lei Zhang
ODL
66
186
0
03 Apr 2020
ASAP: Adaptive Structure Aware Pooling for Learning Hierarchical Graph Representations
Ekagra Ranjan
Soumya Sanyal
Partha P. Talukdar
GNN
171
333
0
18 Nov 2019
Improving Transformer Models by Reordering their Sublayers
Ofir Press
Noah A. Smith
Omer Levy
54
87
0
10 Nov 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
363
6,449
0
26 Sep 2019
Universal Graph Transformer Self-Attention Networks
Dai Quoc Nguyen
T. Nguyen
Dinh Q. Phung
ViT
79
66
0
26 Sep 2019
Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View
Yiping Lu
Zhuohan Li
Di He
Zhiqing Sun
Bin Dong
Tao Qin
Liwei Wang
Tie-Yan Liu
AI4CE
76
174
0
06 Jun 2019
When Does Label Smoothing Help?
Rafael Müller
Simon Kornblith
Geoffrey E. Hinton
UQCV
195
1,945
0
06 Jun 2019
Fast Graph Representation Learning with PyTorch Geometric
Matthias Fey
J. E. Lenssen
3DH
GNN
3DPC
226
4,339
0
06 Mar 2019
The Evolved Transformer
David R. So
Chen Liang
Quoc V. Le
ViT
107
462
0
30 Jan 2019
Decoupled Weight Decay Regularization
I. Loshchilov
Frank Hutter
OffRL
144
2,136
0
14 Nov 2017
Graph Attention Networks
Petar Velickovic
Guillem Cucurull
Arantxa Casanova
Adriana Romero
Pietro Lio
Yoshua Bengio
GNN
479
20,138
0
30 Oct 2017
SGDR: Stochastic Gradient Descent with Warm Restarts
I. Loshchilov
Frank Hutter
ODL
330
8,116
0
13 Aug 2016
Visualizing and Understanding Convolutional Networks
Matthew D. Zeiler
Rob Fergus
FAtt
SSL
595
15,882
0
12 Nov 2013
1