ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.06450
  4. Cited By
Layer Normalization

Layer Normalization

21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
ArXivPDFHTML

Papers citing "Layer Normalization"

50 / 5,502 papers shown
Title
Chain-of-Model Learning for Language Model
Chain-of-Model Learning for Language Model
Kaitao Song
Xiaohua Wang
Xu Tan
Huiqiang Jiang
Chengruidong Zhang
...
Xiaoqing Zheng
Tao Qin
Yuqing Yang
Dongsheng Li
Lili Qiu
LRM
AI4CE
4
0
0
17 May 2025
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
Giyeong Oh
Woohyun Cho
Siyeol Kim
Suhwan Choi
Younjae Yu
4
0
0
17 May 2025
GeoMaNO: Geometric Mamba Neural Operator for Partial Differential Equations
GeoMaNO: Geometric Mamba Neural Operator for Partial Differential Equations
Xi Han
Jingwei Zhang
Dimitris Samaras
Fei Hou
Hong Qin
AI4CE
2
0
0
17 May 2025
Continuous Optimization for Feature Selection with Permutation-Invariant Embedding and Policy-Guided Search
Continuous Optimization for Feature Selection with Permutation-Invariant Embedding and Policy-Guided Search
Rui Liu
Rui Xie
Zijun Yao
Yanjie Fu
Dongjie Wang
2
0
0
16 May 2025
Mollifier Layers: Enabling Efficient High-Order Derivatives in Inverse PDE Learning
Mollifier Layers: Enabling Efficient High-Order Derivatives in Inverse PDE Learning
Ananyae Kumar Bhartari
Vinayak Vinayak
Vivek B Shenoy
AI4CE
4
0
0
16 May 2025
A Unified and Scalable Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability
A Unified and Scalable Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability
Jie Zhu
Jirong Zha
Ding Li
Leye Wang
31
0
0
15 May 2025
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
Jintian Shao
Hongyi Huang
Jiayi Wu
Beiwen Zhang
ZhiYu Wu
You Shan
MingKai Zheng
29
0
0
15 May 2025
AdaFortiTran: An Adaptive Transformer Model for Robust OFDM Channel Estimation
AdaFortiTran: An Adaptive Transformer Model for Robust OFDM Channel Estimation
Berkay Guler
Hamid Jafarkhani
21
1
0
14 May 2025
Sequential Treatment Effect Estimation with Unmeasured Confounders
Sequential Treatment Effect Estimation with Unmeasured Confounders
Yingrong Wang
Anpeng Wu
Yangqiu Song
Ziyang Xiao
Ruoxuan Xiong
Qing Han
Kun Kuang
CML
38
0
0
14 May 2025
UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing
UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing
Yung-Hsuan Lai
Janek Ebbers
Yu-Chiang Frank Wang
François Germain
Michael Jeffrey Jones
Moitreya Chatterjee
26
0
0
14 May 2025
Simple Semi-supervised Knowledge Distillation from Vision-Language Models via $\mathbf{\texttt{D}}$ual-$\mathbf{\texttt{H}}$ead $\mathbf{\texttt{O}}$ptimization
Simple Semi-supervised Knowledge Distillation from Vision-Language Models via D\mathbf{\texttt{D}}Dual-H\mathbf{\texttt{H}}Head O\mathbf{\texttt{O}}Optimization
Seongjae Kang
Dong Bok Lee
Hyungjoon Jang
Sung Ju Hwang
VLM
58
0
0
12 May 2025
IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method
IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method
Mihyeon Kim
Juhyoung Park
Youngbin Kim
34
0
0
11 May 2025
TSLFormer: A Lightweight Transformer Model for Turkish Sign Language Recognition Using Skeletal Landmarks
TSLFormer: A Lightweight Transformer Model for Turkish Sign Language Recognition Using Skeletal Landmarks
Kutay Ertürk
Furkan Altınışık
İrem Sarıaltın
Ömer Nezih Gerek
SLR
42
0
0
11 May 2025
Mask-PINNs: Regulating Feature Distributions in Physics-Informed Neural Networks
Mask-PINNs: Regulating Feature Distributions in Physics-Informed Neural Networks
Feilong Jiang
Xiaonan Hou
Jianqiao Ye
Min Xia
OOD
PINN
42
0
0
09 May 2025
Nonlinear Motion-Guided and Spatio-Temporal Aware Network for Unsupervised Event-Based Optical Flow
Nonlinear Motion-Guided and Spatio-Temporal Aware Network for Unsupervised Event-Based Optical Flow
Zuntao Liu
Hao Zhuang
Junjie Jiang
Yuhang Song
Zheng Fang
50
0
0
08 May 2025
SEVA: Leveraging Single-Step Ensemble of Vicinal Augmentations for Test-Time Adaptation
SEVA: Leveraging Single-Step Ensemble of Vicinal Augmentations for Test-Time Adaptation
Zixuan Hu
Yichun Hu
Ling-yu Duan
TTA
59
0
0
07 May 2025
Matching Distance and Geometric Distribution Aided Learning Multiview Point Cloud Registration
Matching Distance and Geometric Distribution Aided Learning Multiview Point Cloud Registration
Shiqi Li
Jihua Zhu
Yifan Xie
Naiwen Hu
Di Wang
3DPC
73
3
0
06 May 2025
The Inverse Drum Machine: Source Separation Through Joint Transcription and Analysis-by-Synthesis
The Inverse Drum Machine: Source Separation Through Joint Transcription and Analysis-by-Synthesis
Bernardo Torres
Geoffroy Peeters
G. Richard
46
0
0
06 May 2025
Sharpness-Aware Minimization with Z-Score Gradient Filtering for Neural Networks
Sharpness-Aware Minimization with Z-Score Gradient Filtering for Neural Networks
Juyoung Yun
40
0
0
05 May 2025
Demystifying optimized prompts in language models
Demystifying optimized prompts in language models
Rimon Melamed
Lucas H. McCabe
H. H. Huang
41
0
0
04 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
66
0
0
04 May 2025
Distilling Two-Timed Flow Models by Separately Matching Initial and Terminal Velocities
Distilling Two-Timed Flow Models by Separately Matching Initial and Terminal Velocities
Pramook Khungurn
Pratch Piyawongwisal
Sira Sriswadi
Supasorn Suwajanakorn
37
0
0
02 May 2025
iMacSR: Intermediate Multi-Access Supervision and Regularization in Training Autonomous Driving Models
iMacSR: Intermediate Multi-Access Supervision and Regularization in Training Autonomous Driving Models
Wei-Bin Kou
Guangxu Zhu
Yichen Jin
Shuai Wang
Ming Tang
Yik-Chung Wu
41
0
0
01 May 2025
On the Importance of Gaussianizing Representations
On the Importance of Gaussianizing Representations
Daniel Eftekhari
Vardan Papyan
31
0
0
01 May 2025
Direct Motion Models for Assessing Generated Videos
Direct Motion Models for Assessing Generated Videos
Kelsey R. Allen
Carl Doersch
Guangyao Zhou
Mohammed Suhail
Danny Driess
...
Thomas Kipf
Mehdi S. M. Sajjadi
Kevin P. Murphy
João Carreira
Sjoerd van Steenkiste
EGVM
DiffM
VGen
78
0
0
30 Apr 2025
GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers
GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers
Xinyu Li
Qi Yao
Yanjie Wang
DiffM
48
0
0
30 Apr 2025
A comparative study of deep learning and ensemble learning to extend the horizon of traffic forecasting
A comparative study of deep learning and ensemble learning to extend the horizon of traffic forecasting
Xiao Zheng
Saeed Asadi Bagloee
Majid Sarvi
AI4TS
43
0
0
30 Apr 2025
Mjölnir: A Deep Learning Parametrization Framework for Global Lightning Flash Density
Mjölnir: A Deep Learning Parametrization Framework for Global Lightning Flash Density
Minjong Cheon
35
0
0
28 Apr 2025
xEdgeFace: Efficient Cross-Spectral Face Recognition for Edge Devices
xEdgeFace: Efficient Cross-Spectral Face Recognition for Edge Devices
Anjith George
S´ebastien Marcel
CVBM
65
0
0
28 Apr 2025
Modelling of Underwater Vehicles using Physics-Informed Neural Networks with Control
Modelling of Underwater Vehicles using Physics-Informed Neural Networks with Control
Abdelhakim Amer
David Felsager
Yury Brodskiy
Andriy Sarabakha
PINN
AI4CE
63
0
0
28 Apr 2025
Improving Reasoning Performance in Large Language Models via Representation Engineering
Improving Reasoning Performance in Large Language Models via Representation Engineering
Bertram Højer
Oliver Jarvis
Stefan Heinrich
LRM
83
1
0
28 Apr 2025
VTire: A Bimodal Visuotactile Tire with High-Resolution Sensing Capability
VTire: A Bimodal Visuotactile Tire with High-Resolution Sensing Capability
Shoujie Li
Jianle Xu
Tong Wu
Yang Yang
Yuxiao Chen
Xueqian Wang
Wenbo Ding
Xuzhi Zhang
39
0
0
27 Apr 2025
CANet: ChronoAdaptive Network for Enhanced Long-Term Time Series Forecasting under Non-Stationarity
CANet: ChronoAdaptive Network for Enhanced Long-Term Time Series Forecasting under Non-Stationarity
Mert Sonmezer
Seyda Ertekin
AI4TS
26
0
0
24 Apr 2025
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
Mingqi Yuan
Qi Wang
Guozheng Ma
Bo-wen Li
Xin Jin
Yunbo Wang
Xiaokang Yang
Wenjun Zeng
D. Tao
OffRL
AI4CE
35
0
0
24 Apr 2025
Dynamic Time-aware Continual User Representation Learning
Dynamic Time-aware Continual User Representation Learning
Seungyoon Choi
Sein Kim
Hongseok Kang
Wonjoong Kim
Chanyoung Park
CLL
57
0
0
23 Apr 2025
A Novel Graph Transformer Framework for Gene Regulatory Network Inference
A Novel Graph Transformer Framework for Gene Regulatory Network Inference
Binon Teji
Swarup Roy
24
0
0
23 Apr 2025
Distilling semantically aware orders for autoregressive image generation
Distilling semantically aware orders for autoregressive image generation
Rishav Pramanik
Antoine Poupon
Juan A. Rodriguez
Masih Aminbeidokhti
David Vazquez
Christopher Pal
Zhaozheng Yin
M. Pedersoli
31
0
0
23 Apr 2025
STFM: A Spatio-Temporal Information Fusion Model Based on Phase Space Reconstruction for Sea Surface Temperature Prediction
STFM: A Spatio-Temporal Information Fusion Model Based on Phase Space Reconstruction for Sea Surface Temperature Prediction
Yin Wang
Chunlin Gong
Xiang Wu
Hanleran Zhang
29
0
0
23 Apr 2025
PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning
PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning
Song Wang
Xiaolu Liu
Lingdong Kong
Jianyun Xu
Chunyong Hu
Gongfan Fang
Wentong Li
Jianke Zhu
Xinchao Wang
29
0
0
22 Apr 2025
SUPRA: Subspace Parameterized Attention for Neural Operator on General Domains
SUPRA: Subspace Parameterized Attention for Neural Operator on General Domains
Zherui Yang
Zhengyang Xue
Ligang Liu
29
0
0
22 Apr 2025
What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale
What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale
Xiaoyong Yuan
Xiaolong Ma
Linke Guo
Lan Zhang
DiffM
42
0
0
21 Apr 2025
VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
VeLU: Variance-enhanced Learning Unit for Deep Neural Networks
Ashkan Shakarami
Yousef Yeganeh
Azade Farshad
Lorenzo Nicolè
Stefano Ghidoni
Nassir Navab
54
0
0
21 Apr 2025
An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes
An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes
Ji Qi
Yuan Yao
Yushi Bai
Bin Xu
Juanzi Li
Zhiyuan Liu
Tat-Seng Chua
41
0
0
21 Apr 2025
6G WavesFM: A Foundation Model for Sensing, Communication, and Localization
6G WavesFM: A Foundation Model for Sensing, Communication, and Localization
Ahmed Aboulfotouh
E. Mohammed
Hatem Abou-Zeid
36
0
0
18 Apr 2025
Consensus-aware Contrastive Learning for Group Recommendation
Consensus-aware Contrastive Learning for Group Recommendation
Soyoung Kim
Dongjun Lee
J. H. Kim
29
0
0
18 Apr 2025
Fairness and Robustness in Machine Unlearning
Fairness and Robustness in Machine Unlearning
Khoa Tran
Simon S. Woo
FaML
OOD
MU
AAML
72
0
0
18 Apr 2025
Improving Sequential Recommenders through Counterfactual Augmentation of System Exposure
Improving Sequential Recommenders through Counterfactual Augmentation of System Exposure
Ziqi Zhao
Zhaochun Ren
Jiyuan Yang
Zuming Yan
Zihan Wang
Liu Yang
Pengjie Ren
Zhumin Chen
Maarten de Rijke
Xin Xin
CML
37
0
0
18 Apr 2025
SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration
SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration
Xi Tong
Xing Luo
Jiangxin Yang
Yanpeng Cao
34
0
0
17 Apr 2025
Plain Transformers Can be Powerful Graph Learners
Plain Transformers Can be Powerful Graph Learners
Liheng Ma
Soumyasundar Pal
Yingxue Zhang
Philip Torr
Mark J. Coates
28
0
0
17 Apr 2025
Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off
Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off
Riza Velioglu
Petra Bevandic
Robin Chan
Barbara Hammer
DiffM
36
0
0
17 Apr 2025
1234...109110111
Next