Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.06450
Cited By
Layer Normalization
21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Layer Normalization"
50 / 5,502 papers shown
Title
Reinforcement Learning-based Self-adaptive Differential Evolution through Automated Landscape Feature Learning
Hongshu Guo
Sijie Ma
Zechuan Huang
Yuzhi Hu
Zeyuan Ma
Xinglin Zhang
Yue-jiao Gong
55
2
0
23 Mar 2025
End-to-End Implicit Neural Representations for Classification
Alexander Gielisse
Jan van Gemert
47
0
0
23 Mar 2025
GLADMamba: Unsupervised Graph-Level Anomaly Detection Powered by Selective State Space Model
Yali Fu
Jindong Li
Qi Wang
Qianli Xing
Mamba
57
0
0
23 Mar 2025
SNRAware: Improved Deep Learning MRI Denoising with SNR Unit Training and G-factor Map Augmentation
H. Xue
Sarah M. Hooper
Iain Pierce
R. Davies
John Stairs
...
C. Manisty
James C. Moon
T. Treibel
Peter Kellman
Michael S. Hansen
MedIm
61
0
0
23 Mar 2025
Accurate Peak Detection in Multimodal Optimization via Approximated Landscape Learning
Zeyuan Ma
Hongqiao Lian
Wenjie Qiu
Yue-jiao Gong
55
1
0
23 Mar 2025
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
Tobias Gessler
Tin Dizdarevic
Ani Calinescu
Benjamin Ellis
Andrei Lupu
Jakob Foerster
59
1
0
22 Mar 2025
Variance Control via Weight Rescaling in LLM Pre-training
Louis Owen
Abhay Kumar
Nilabhra Roy Chowdhury
Fabian Güra
33
0
0
21 Mar 2025
HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis
Mengtian Li
Jinshu Chen
Wanquan Feng
Bingchuan Li
Fei Dai
Mingcong Liu
Qian He
3DH
52
0
0
21 Mar 2025
ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology
Vishwesh Ramanathan
Tony Xu
Pushpak Pati
Faruk Ahmed
Maged Goubran
Anne L. Martel
48
0
0
21 Mar 2025
ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration
Johan Edstedt
André Mateus
Alberto Jaenal
50
0
0
21 Mar 2025
Preference-Guided Diffusion for Multi-Objective Offline Optimization
Yashas Annadani
Syrine Belakaria
Stefano Ermon
Stefan Bauer
Barbara Engelhardt
50
0
0
21 Mar 2025
UniK3D: Universal Camera Monocular 3D Estimation
Luigi Piccinelli
Daniel Gehrig
Mattia Segu
Yifan Yang
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
47
0
0
20 Mar 2025
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
Philipp Becker
Abhinav Mehrotra
Ruchika Chavhan
Malcolm Chadwick
Luca Morreale
Mehdi Noroozi
Alberto Gil C. P. Ramos
Sourav Bhattacharya
51
0
0
20 Mar 2025
Semantic-Guided Global-Local Collaborative Networks for Lightweight Image Super-Resolution
Wanshu Fan
Yue Wang
Cong Wang
Yunzhe Zhang
Wei Wang
Dongsheng Zhou
49
0
0
20 Mar 2025
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
Kevin Wang
Ishaan Javali
Michał Bortkiewicz
Tomasz Trzciñski
Benjamin Eysenbach
SSL
OffRL
72
0
0
19 Mar 2025
Learning Distributions of Complex Fluid Simulations with Diffusion Graph Networks
Mario Lino
Tobias Pfaff
Nils Thuerey
DiffM
AI4CE
47
3
0
19 Mar 2025
ACE: A Cardinality Estimator for Set-Valued Queries
Yufan Sheng
Xin Cao
Kaiqi Zhao
Yixiang Fang
Jianzhong Qi
Wenjie Zhang
Christian S. Jensen
62
0
0
19 Mar 2025
Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling
Yanchen Luo
Zhiyuan Liu
Yi Zhao
Sihang Li
Kenji Kawaguchi
Tat-Seng Chua
Xuben Wang
MedIm
69
0
0
19 Mar 2025
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
Seokhyeon Hong
Chaelin Kim
Serin Yoon
Junghyun Nam
Sihun Cha
Junyong Noh
DiffM
VGen
73
1
0
18 Mar 2025
SplatVoxel: History-Aware Novel View Streaming without Temporal Training
Yiming Wang
Lucy Chai
Xuan Luo
Michael Niemeyer
Manuel Lagunas
Stephen Lombardi
Siyu Tang
Tiancheng Sun
3DGS
58
0
0
18 Mar 2025
TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba
Jiaxu Liu
Li Li
Hubert P. H. Shum
T. Breckon
Mamba
57
0
0
17 Mar 2025
xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference
M. Beck
Korbinian Poppel
Phillip Lippe
Richard Kurle
P. Blies
G. Klambauer
Sebastian Böck
Sepp Hochreiter
LRM
51
1
0
17 Mar 2025
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection
Shani Gamrian
Hila Barel
Feiran Li
Masakazu Yoshimura
Daisuke Iso
56
0
0
17 Mar 2025
Edit Transfer: Learning Image Editing via Vision In-Context Relations
Lan Chen
Qi Mao
Yuchao Gu
Mike Zheng Shou
64
1
0
17 Mar 2025
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder
Wonwoong Cho
Yan-Ying Chen
M. Klenk
David I. Inouye
Yanxia Zhang
DiffM
207
0
0
15 Mar 2025
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
Enes Erdogan
E. Aksoy
Sanem Sariel
46
0
0
15 Mar 2025
Multi-output Classification for Compound Fault Diagnosis in Motor under Partially Labeled Target Domain
Wonjun Yi
Yong-Hwa Park
31
0
0
15 Mar 2025
Unlocking Open-Set Language Accessibility in Vision Models
Fawaz Sammani
Jonas Fischer
Nikos Deligiannis
VLM
55
0
0
14 Mar 2025
Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages
Matteo Farina
Massimiliano Mancini
Giovanni Iacca
Elisa Ricci
VLM
60
0
0
14 Mar 2025
APLA: A Simple Adaptation Method for Vision Transformers
Moein Sorkhei
Emir Konuk
Kevin Smith
Christos Matsoukas
53
0
0
14 Mar 2025
Transformers without Normalization
Jiachen Zhu
Xinlei Chen
Kaiming He
Yann LeCun
Zhuang Liu
ViT
OffRL
65
7
0
13 Mar 2025
Cost-Optimal Grouped-Query Attention for Long-Context Modeling
Yuxiao Chen
Yutong Wu
Chenyang Song
Zhiyuan Liu
Maosong Sun
Xu Han
Zhiyuan Liu
Maosong Sun
73
0
0
12 Mar 2025
Temporal Difference Flows
Jesse Farebrother
Matteo Pirotta
Andrea Tirinzoni
Rémi Munos
A. Lazaric
Ahmed Touati
AI4TS
AIFin
57
0
0
12 Mar 2025
HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views
Ethan Griffiths
Maryam Haghighat
Simon Denman
Clinton Fookes
Milad Ramezani
3DPC
59
0
0
11 Mar 2025
Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera
Dong-Hee Paek
Seung-Hyun Kong
48
1
0
10 Mar 2025
Efficient Neural Clause-Selection Reinforcement
Martin Suda
44
0
0
10 Mar 2025
X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second
Guofeng Zhang
Ruyi Zha
Hao He
Yixun Liang
A. Yuille
Yiming Li
Yuanhao Cai
51
0
0
09 Mar 2025
Attention-Based Synthetic Data Generation for Calibration-Enhanced Survival Analysis: A Case Study for Chronic Kidney Disease Using Electronic Health Records
N. Kuo
B. Gallego
Louisa R Jorm
SyDa
74
0
0
08 Mar 2025
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
Zhijian Zhuo
Yutao Zeng
Ya Wang
Sijun Zhang
Jian Yang
Xiaoqing Li
Xun Zhou
Jinwen Ma
51
0
0
06 Mar 2025
A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery
Yiheng Zhu
Mingyang Li
Junlong Liu
Kun Fu
Jiangxu Wu
Yue Liu
Mingze Yin
Jieping Ye
Jian Wu
Zehua Wang
62
0
0
06 Mar 2025
Learning Transformer-based World Models with Contrastive Predictive Coding
Maxime Burchi
Radu Timofte
72
0
0
06 Mar 2025
A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers
William Merrill
Ashish Sabharwal
55
5
0
05 Mar 2025
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
Huang Huang
Fangchen Liu
Letian Fu
Tingfan Wu
Mustafa Mukadam
Jitendra Malik
Ken Goldberg
Pieter Abbeel
LM&Ro
VLM
85
6
0
05 Mar 2025
Learning to Reduce Search Space for Generalizable Neural Routing Solver
Changliang Zhou
Xi Lin
Zhenkun Wang
Qingfu Zhang
114
1
0
05 Mar 2025
VWAP Execution with Signature-Enhanced Transformers: A Multi-Asset Learning Approach
Remi Genet
70
0
0
04 Mar 2025
Disentangled Knowledge Tracing for Alleviating Cognitive Bias
Yiyun Zhou
Zheqi Lv
Shengyu Zhang
Jingyuan Chen
AI4Ed
48
0
0
04 Mar 2025
Improving Plasticity in Non-stationary Reinforcement Learning with Evidential Proximal Policy Optimization
Abdullah Akgul
Gulcin Baykal
Manuel Haußmann
M. Kandemir
38
0
0
03 Mar 2025
RSQ: Learning from Important Tokens Leads to Better Quantized LLMs
Yi-Lin Sung
Prateek Yadav
Jialu Li
Jaehong Yoon
Joey Tianyi Zhou
MQ
57
1
0
03 Mar 2025
Video-DPRP: A Differentially Private Approach for Visual Privacy-Preserving Video Human Activity Recognition
Allassan Tchangmena A Nken
Susan Mckeever
Peter Corcoran
Ihsan Ullah
PICV
50
0
0
03 Mar 2025
How simple can you go? An off-the-shelf transformer approach to molecular dynamics
Max Eissler
Tim Korjakow
Stefan Ganscha
Oliver T. Unke
Klaus-Robert Müller
Stefan Gugler
63
1
0
03 Mar 2025
Previous
1
2
3
4
5
6
...
109
110
111
Next