ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.02967
  4. Cited By
Evolving Normalization-Activation Layers

Evolving Normalization-Activation Layers

6 April 2020
Hanxiao Liu
Andrew Brock
Karen Simonyan
Quoc V. Le
ArXivPDFHTML

Papers citing "Evolving Normalization-Activation Layers"

20 / 20 papers shown
Title
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Xin Zou
Yizhou Wang
Yibo Yan
Yuanhuiyi Lyu
Kening Zheng
...
Junkai Chen
Peijie Jiang
Jiaheng Liu
Chang Tang
Xuming Hu
89
7
0
04 Oct 2024
SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data
SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data
Sakshi Choudhary
Sai Aparna Aketi
Kaushik Roy
FedML
50
0
0
22 May 2024
Homogenizing Non-IID datasets via In-Distribution Knowledge Distillation
  for Decentralized Learning
Homogenizing Non-IID datasets via In-Distribution Knowledge Distillation for Decentralized Learning
Deepak Ravikumar
Gobinda Saha
Sai Aparna Aketi
Kaushik Roy
21
2
0
09 Apr 2023
Unified Functional Hashing in Automatic Machine Learning
Unified Functional Hashing in Automatic Machine Learning
Ryan Gillard
S. Jonany
Yingjie Miao
Michael Munn
Connal de Souza
Jonathan Dungay
Chen Liang
David R. So
Quoc V. Le
Esteban Real
26
2
0
10 Feb 2023
Efficient Activation Function Optimization through Surrogate Modeling
Efficient Activation Function Optimization through Surrogate Modeling
G. Bingham
Risto Miikkulainen
24
2
0
13 Jan 2023
Similarity of Neural Architectures using Adversarial Attack
  Transferability
Similarity of Neural Architectures using Adversarial Attack Transferability
Jaehui Hwang
Dongyoon Han
Byeongho Heo
Song Park
Sanghyuk Chun
Jong-Seok Lee
AAML
39
1
0
20 Oct 2022
Nish: A Novel Negative Stimulated Hybrid Activation Function
Nish: A Novel Negative Stimulated Hybrid Activation Function
Yildiray Anagün
Ş. Işık
27
2
0
17 Oct 2022
GAAF: Searching Activation Functions for Binary Neural Networks through
  Genetic Algorithm
GAAF: Searching Activation Functions for Binary Neural Networks through Genetic Algorithm
Yanfei Li
Tong Geng
S. Stein
Ang Li
Hui-Ling Yu
MQ
31
8
0
05 Jun 2022
Data-heterogeneity-aware Mixing for Decentralized Learning
Data-heterogeneity-aware Mixing for Decentralized Learning
Yatin Dandi
Anastasia Koloskova
Martin Jaggi
Sebastian U. Stich
43
18
0
13 Apr 2022
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A
  Study on Surgical Workflow Analysis
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis
Dominik Rivoir
Isabel Funke
Stefanie Speidel
24
17
0
15 Mar 2022
Reciprocal Normalization for Domain Adaptation
Reciprocal Normalization for Domain Adaptation
Zhiyong Huang
Kekai Sheng
Ke Li
Jian Liang
Taiping Yao
Weiming Dong
D. Zhou
Xing Sun
46
11
0
20 Dec 2021
Automated Deep Learning: Neural Architecture Search Is Not the End
Automated Deep Learning: Neural Architecture Search Is Not the End
Xuanyi Dong
D. Kedziora
Katarzyna Musial
Bogdan Gabrys
34
26
0
16 Dec 2021
TorchEsegeta: Framework for Interpretability and Explainability of
  Image-based Deep Learning Models
TorchEsegeta: Framework for Interpretability and Explainability of Image-based Deep Learning Models
S. Chatterjee
Arnab Das
Chirag Mandal
Budhaditya Mukhopadhyay
Manish Vipinraj
Aniruddh Shukla
R. Rao
Chompunuch Sarasaen
Oliver Speck
A. Nürnberger
MedIm
42
14
0
16 Oct 2021
RelaySum for Decentralized Deep Learning on Heterogeneous Data
RelaySum for Decentralized Deep Learning on Heterogeneous Data
Thijs Vogels
Lie He
Anastasia Koloskova
Tao R. Lin
Sai Praneeth Karimireddy
Sebastian U. Stich
Martin Jaggi
FedML
MoE
11
61
0
08 Oct 2021
AutoInit: Analytic Signal-Preserving Weight Initialization for Neural
  Networks
AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks
G. Bingham
Risto Miikkulainen
ODL
24
4
0
18 Sep 2021
Primer: Searching for Efficient Transformers for Language Modeling
Primer: Searching for Efficient Transformers for Language Modeling
David R. So
Wojciech Mañke
Hanxiao Liu
Zihang Dai
Noam M. Shazeer
Quoc V. Le
VLM
91
154
0
17 Sep 2021
Rethinking "Batch" in BatchNorm
Rethinking "Batch" in BatchNorm
Yuxin Wu
Justin Johnson
BDL
43
66
0
17 May 2021
Modeling the geospatial evolution of COVID-19 using spatio-temporal
  convolutional sequence-to-sequence neural networks
Modeling the geospatial evolution of COVID-19 using spatio-temporal convolutional sequence-to-sequence neural networks
Mário Cardoso
A. Cavalheiro
Alexandre Borges
A. F. Duarte
Amilcar Soares
M. Pereira
N. Nunes
L. Azevedo
Arlindo L. Oliveira
48
8
0
06 May 2021
Efficient Multi-objective Neural Architecture Search via Lamarckian
  Evolution
Efficient Multi-objective Neural Architecture Search via Lamarckian Evolution
T. Elsken
J. H. Metzen
Frank Hutter
131
499
0
24 Apr 2018
Neural Architecture Search with Reinforcement Learning
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
274
5,331
0
05 Nov 2016
1