ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.06450
  4. Cited By
Layer Normalization

Layer Normalization

21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
ArXivPDFHTML

Papers citing "Layer Normalization"

50 / 5,502 papers shown
Title
Reinforcement Learning-based Self-adaptive Differential Evolution through Automated Landscape Feature Learning
Reinforcement Learning-based Self-adaptive Differential Evolution through Automated Landscape Feature Learning
Hongshu Guo
Sijie Ma
Zechuan Huang
Yuzhi Hu
Zeyuan Ma
Xinglin Zhang
Yue-jiao Gong
55
2
0
23 Mar 2025
End-to-End Implicit Neural Representations for Classification
End-to-End Implicit Neural Representations for Classification
Alexander Gielisse
Jan van Gemert
47
0
0
23 Mar 2025
GLADMamba: Unsupervised Graph-Level Anomaly Detection Powered by Selective State Space Model
GLADMamba: Unsupervised Graph-Level Anomaly Detection Powered by Selective State Space Model
Yali Fu
Jindong Li
Qi Wang
Qianli Xing
Mamba
57
0
0
23 Mar 2025
SNRAware: Improved Deep Learning MRI Denoising with SNR Unit Training and G-factor Map Augmentation
SNRAware: Improved Deep Learning MRI Denoising with SNR Unit Training and G-factor Map Augmentation
H. Xue
Sarah M. Hooper
Iain Pierce
R. Davies
John Stairs
...
C. Manisty
James C. Moon
T. Treibel
Peter Kellman
Michael S. Hansen
MedIm
61
0
0
23 Mar 2025
Accurate Peak Detection in Multimodal Optimization via Approximated Landscape Learning
Accurate Peak Detection in Multimodal Optimization via Approximated Landscape Learning
Zeyuan Ma
Hongqiao Lian
Wenjie Qiu
Yue-jiao Gong
55
1
0
23 Mar 2025
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
Tobias Gessler
Tin Dizdarevic
Ani Calinescu
Benjamin Ellis
Andrei Lupu
Jakob Foerster
59
1
0
22 Mar 2025
Variance Control via Weight Rescaling in LLM Pre-training
Variance Control via Weight Rescaling in LLM Pre-training
Louis Owen
Abhay Kumar
Nilabhra Roy Chowdhury
Fabian Güra
33
0
0
21 Mar 2025
HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis
HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis
Mengtian Li
Jinshu Chen
Wanquan Feng
Bingchuan Li
Fei Dai
Mingcong Liu
Qian He
3DH
52
0
0
21 Mar 2025
ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology
ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology
Vishwesh Ramanathan
Tony Xu
Pushpak Pati
Faruk Ahmed
Maged Goubran
Anne L. Martel
48
0
0
21 Mar 2025
ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration
ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration
Johan Edstedt
André Mateus
Alberto Jaenal
50
0
0
21 Mar 2025
Preference-Guided Diffusion for Multi-Objective Offline Optimization
Preference-Guided Diffusion for Multi-Objective Offline Optimization
Yashas Annadani
Syrine Belakaria
Stefano Ermon
Stefan Bauer
Barbara Engelhardt
50
0
0
21 Mar 2025
UniK3D: Universal Camera Monocular 3D Estimation
UniK3D: Universal Camera Monocular 3D Estimation
Luigi Piccinelli
Daniel Gehrig
Mattia Segu
Yifan Yang
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
47
0
0
20 Mar 2025
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
Philipp Becker
Abhinav Mehrotra
Ruchika Chavhan
Malcolm Chadwick
Luca Morreale
Mehdi Noroozi
Alberto Gil C. P. Ramos
Sourav Bhattacharya
51
0
0
20 Mar 2025
Semantic-Guided Global-Local Collaborative Networks for Lightweight Image Super-Resolution
Semantic-Guided Global-Local Collaborative Networks for Lightweight Image Super-Resolution
Wanshu Fan
Yue Wang
Cong Wang
Yunzhe Zhang
Wei Wang
Dongsheng Zhou
49
0
0
20 Mar 2025
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
Kevin Wang
Ishaan Javali
Michał Bortkiewicz
Tomasz Trzciñski
Benjamin Eysenbach
SSL
OffRL
72
0
0
19 Mar 2025
Learning Distributions of Complex Fluid Simulations with Diffusion Graph Networks
Learning Distributions of Complex Fluid Simulations with Diffusion Graph Networks
Mario Lino
Tobias Pfaff
Nils Thuerey
DiffM
AI4CE
47
3
0
19 Mar 2025
ACE: A Cardinality Estimator for Set-Valued Queries
ACE: A Cardinality Estimator for Set-Valued Queries
Yufan Sheng
Xin Cao
Kaiqi Zhao
Yixiang Fang
Jianzhong Qi
Wenjie Zhang
Christian S. Jensen
62
0
0
19 Mar 2025
Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling
Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling
Yanchen Luo
Zhiyuan Liu
Yi Zhao
Sihang Li
Kenji Kawaguchi
Tat-Seng Chua
Xuben Wang
MedIm
69
0
0
19 Mar 2025
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
Seokhyeon Hong
Chaelin Kim
Serin Yoon
Junghyun Nam
Sihun Cha
Junyong Noh
DiffM
VGen
73
1
0
18 Mar 2025
SplatVoxel: History-Aware Novel View Streaming without Temporal Training
SplatVoxel: History-Aware Novel View Streaming without Temporal Training
Yiming Wang
Lucy Chai
Xuan Luo
Michael Niemeyer
Manuel Lagunas
Stephen Lombardi
Siyu Tang
Tiancheng Sun
3DGS
58
0
0
18 Mar 2025
TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba
TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba
Jiaxu Liu
Li Li
Hubert P. H. Shum
T. Breckon
Mamba
57
0
0
17 Mar 2025
xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference
xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference
M. Beck
Korbinian Poppel
Phillip Lippe
Richard Kurle
P. Blies
G. Klambauer
Sebastian Böck
Sepp Hochreiter
LRM
51
1
0
17 Mar 2025
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection
Beyond RGB: Adaptive Parallel Processing for RAW Object Detection
Shani Gamrian
Hila Barel
Feiran Li
Masakazu Yoshimura
Daisuke Iso
56
0
0
17 Mar 2025
Edit Transfer: Learning Image Editing via Vision In-Context Relations
Edit Transfer: Learning Image Editing via Vision In-Context Relations
Lan Chen
Qi Mao
Yuchao Gu
Mike Zheng Shou
64
1
0
17 Mar 2025
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder
Wonwoong Cho
Yan-Ying Chen
M. Klenk
David I. Inouye
Yanxia Zhang
DiffM
207
0
0
15 Mar 2025
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
Enes Erdogan
E. Aksoy
Sanem Sariel
46
0
0
15 Mar 2025
Multi-output Classification for Compound Fault Diagnosis in Motor under Partially Labeled Target Domain
Multi-output Classification for Compound Fault Diagnosis in Motor under Partially Labeled Target Domain
Wonjun Yi
Yong-Hwa Park
31
0
0
15 Mar 2025
Unlocking Open-Set Language Accessibility in Vision Models
Fawaz Sammani
Jonas Fischer
Nikos Deligiannis
VLM
55
0
0
14 Mar 2025
Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages
Matteo Farina
Massimiliano Mancini
Giovanni Iacca
Elisa Ricci
VLM
60
0
0
14 Mar 2025
APLA: A Simple Adaptation Method for Vision Transformers
APLA: A Simple Adaptation Method for Vision Transformers
Moein Sorkhei
Emir Konuk
Kevin Smith
Christos Matsoukas
53
0
0
14 Mar 2025
Transformers without Normalization
Jiachen Zhu
Xinlei Chen
Kaiming He
Yann LeCun
Zhuang Liu
ViT
OffRL
65
7
0
13 Mar 2025
Cost-Optimal Grouped-Query Attention for Long-Context Modeling
Cost-Optimal Grouped-Query Attention for Long-Context Modeling
Yuxiao Chen
Yutong Wu
Chenyang Song
Zhiyuan Liu
Maosong Sun
Xu Han
Zhiyuan Liu
Maosong Sun
73
0
0
12 Mar 2025
Temporal Difference Flows
Jesse Farebrother
Matteo Pirotta
Andrea Tirinzoni
Rémi Munos
A. Lazaric
Ahmed Touati
AI4TS
AIFin
57
0
0
12 Mar 2025
HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views
HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views
Ethan Griffiths
Maryam Haghighat
Simon Denman
Clinton Fookes
Milad Ramezani
3DPC
59
0
0
11 Mar 2025
Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera
Dong-Hee Paek
Seung-Hyun Kong
48
1
0
10 Mar 2025
Efficient Neural Clause-Selection Reinforcement
Martin Suda
44
0
0
10 Mar 2025
X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second
Guofeng Zhang
Ruyi Zha
Hao He
Yixun Liang
A. Yuille
Yiming Li
Yuanhao Cai
51
0
0
09 Mar 2025
Attention-Based Synthetic Data Generation for Calibration-Enhanced Survival Analysis: A Case Study for Chronic Kidney Disease Using Electronic Health Records
N. Kuo
B. Gallego
Louisa R Jorm
SyDa
74
0
0
08 Mar 2025
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
Zhijian Zhuo
Yutao Zeng
Ya Wang
Sijun Zhang
Jian Yang
Xiaoqing Li
Xun Zhou
Jinwen Ma
51
0
0
06 Mar 2025
A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery
Yiheng Zhu
Mingyang Li
Junlong Liu
Kun Fu
Jiangxu Wu
Yue Liu
Mingze Yin
Jieping Ye
Jian Wu
Zehua Wang
62
0
0
06 Mar 2025
Learning Transformer-based World Models with Contrastive Predictive Coding
Maxime Burchi
Radu Timofte
72
0
0
06 Mar 2025
A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers
William Merrill
Ashish Sabharwal
55
5
0
05 Mar 2025
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
Huang Huang
Fangchen Liu
Letian Fu
Tingfan Wu
Mustafa Mukadam
Jitendra Malik
Ken Goldberg
Pieter Abbeel
LM&Ro
VLM
85
6
0
05 Mar 2025
Learning to Reduce Search Space for Generalizable Neural Routing Solver
Learning to Reduce Search Space for Generalizable Neural Routing Solver
Changliang Zhou
Xi Lin
Zhenkun Wang
Qingfu Zhang
114
1
0
05 Mar 2025
VWAP Execution with Signature-Enhanced Transformers: A Multi-Asset Learning Approach
Remi Genet
70
0
0
04 Mar 2025
Disentangled Knowledge Tracing for Alleviating Cognitive Bias
Yiyun Zhou
Zheqi Lv
Shengyu Zhang
Jingyuan Chen
AI4Ed
48
0
0
04 Mar 2025
Improving Plasticity in Non-stationary Reinforcement Learning with Evidential Proximal Policy Optimization
Abdullah Akgul
Gulcin Baykal
Manuel Haußmann
M. Kandemir
38
0
0
03 Mar 2025
RSQ: Learning from Important Tokens Leads to Better Quantized LLMs
Yi-Lin Sung
Prateek Yadav
Jialu Li
Jaehong Yoon
Joey Tianyi Zhou
MQ
57
1
0
03 Mar 2025
Video-DPRP: A Differentially Private Approach for Visual Privacy-Preserving Video Human Activity Recognition
Allassan Tchangmena A Nken
Susan Mckeever
Peter Corcoran
Ihsan Ullah
PICV
50
0
0
03 Mar 2025
How simple can you go? An off-the-shelf transformer approach to molecular dynamics
Max Eissler
Tim Korjakow
Stefan Ganscha
Oliver T. Unke
Klaus-Robert Müller
Stefan Gugler
63
1
0
03 Mar 2025
Previous
123456...109110111
Next