ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.08415
  4. Cited By
Gaussian Error Linear Units (GELUs)

Gaussian Error Linear Units (GELUs)

27 June 2016
Dan Hendrycks
Kevin Gimpel
ArXivPDFHTML

Papers citing "Gaussian Error Linear Units (GELUs)"

50 / 945 papers shown
Title
LLIC: Large Receptive Field Transform Coding with Adaptive Weights for
  Learned Image Compression
LLIC: Large Receptive Field Transform Coding with Adaptive Weights for Learned Image Compression
Wei Jiang
Peirong Ning
Jiayu Yang
Yongqi Zhai
Feng Gao
Ronggang Wang
38
6
0
19 Apr 2023
CoPR: Towards Accurate Visual Localization With Continuous
  Place-descriptor Regression
CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression
Mubariz Zaffar
Liangliang Nan
Julian F. P. Kooij
27
2
0
14 Apr 2023
Reinforcement Learning Tutor Better Supported Lower Performers in a Math
  Task
Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
S. Ruan
Allen Nie
William Steenbergen
Jiayu He
JQ Zhang
...
Kyle Dang Nguyen
Catherine Y Wang
Rui Ying
James A. Landay
Emma Brunskill
28
18
0
11 Apr 2023
DeFeeNet: Consecutive 3D Human Motion Prediction with Deviation Feedback
DeFeeNet: Consecutive 3D Human Motion Prediction with Deviation Feedback
Xiaoning Sun
Huaijiang Sun
Bin Li
Dong Wei
Weiqing Li
Jianfeng Lu
3DH
36
6
0
10 Apr 2023
Embodied Concept Learner: Self-supervised Learning of Concepts and
  Mapping through Instruction Following
Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following
Mingyu Ding
Yan Xu
Zhenfang Chen
David D. Cox
Ping Luo
J. Tenenbaum
Chuang Gan
LM&Ro
64
21
0
07 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
30
41
0
07 Apr 2023
ClothCombo: Modeling Inter-Cloth Interaction for Draping Multi-Layered
  Clothes
ClothCombo: Modeling Inter-Cloth Interaction for Draping Multi-Layered Clothes
Dohae Lee
Hyun Kang
In-Kwon Lee
3DH
AI4CE
36
7
0
07 Apr 2023
Anomaly Detection via Gumbel Noise Score Matching
Anomaly Detection via Gumbel Noise Score Matching
Ahsan Mahmood
Junier Oliva
Martin Styner
26
1
0
06 Apr 2023
Segment Anything
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
110
6,867
0
05 Apr 2023
Industrial Anomaly Detection with Domain Shift: A Real-world Dataset and
  Masked Multi-scale Reconstruction
Industrial Anomaly Detection with Domain Shift: A Real-world Dataset and Masked Multi-scale Reconstruction
Zilong Zhang
Zhibin Zhao
Xingwu Zhang
Chuang Sun
Xuefeng Chen
32
50
0
05 Apr 2023
Blockwise Compression of Transformer-based Models without Retraining
Blockwise Compression of Transformer-based Models without Retraining
Gaochen Dong
W. Chen
26
3
0
04 Apr 2023
TransPimLib: A Library for Efficient Transcendental Functions on
  Processing-in-Memory Systems
TransPimLib: A Library for Efficient Transcendental Functions on Processing-in-Memory Systems
Maurus Item
Juan Gómez Luna
Yu-Yin Guo
Geraldo F. Oliveira
Mohammad Sadrosadati
O. Mutlu
40
5
0
03 Apr 2023
Transformer-based interpretable multi-modal data fusion for skin lesion
  classification
Transformer-based interpretable multi-modal data fusion for skin lesion classification
Theodor Cheslerean-Boghiu
Melia-Evelina Fleischmann
Theresa Willem
Tobias Lasser
ViT
MedIm
AI4CE
29
2
0
03 Apr 2023
CNNs with Multi-Level Attention for Domain Generalization
CNNs with Multi-Level Attention for Domain Generalization
Aristotelis Ballas
Christos Diou
OOD
29
6
0
02 Apr 2023
Resolution-Invariant Image Classification based on Fourier Neural
  Operators
Resolution-Invariant Image Classification based on Fourier Neural Operators
Samira Kabri
Tim Roith
Daniel Tenbrinck
Martin Burger
29
6
0
02 Apr 2023
Hierarchical Vision Transformers for Cardiac Ejection Fraction
  Estimation
Hierarchical Vision Transformers for Cardiac Ejection Fraction Estimation
Lhuqita Fazry
Asep Haryono
Nuzulul Khairu Nissa
Sunarno
Naufal Muhammad Hirzi
M. F. Rachmadi
W. Jatmiko
MedIm
16
16
0
31 Mar 2023
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual
  Benchmarking on HumanEval-X
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X
Qinkai Zheng
Xiao Xia
Xu Zou
Yuxiao Dong
Shanshan Wang
...
Andi Wang
Yang Li
Teng Su
Zhilin Yang
Jie Tang
ELM
ALM
SyDa
71
320
0
30 Mar 2023
BloombergGPT: A Large Language Model for Finance
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
99
793
0
30 Mar 2023
Ensemble weather forecast post-processing with a flexible probabilistic
  neural network approach
Ensemble weather forecast post-processing with a flexible probabilistic neural network approach
P. Mlakar
J. Merse
Jana Faganeli Pucer
25
4
0
29 Mar 2023
GNNBuilder: An Automated Framework for Generic Graph Neural Network
  Accelerator Generation, Simulation, and Optimization
GNNBuilder: An Automated Framework for Generic Graph Neural Network Accelerator Generation, Simulation, and Optimization
Stefan Abi-Karam
Cong Hao
GNN
36
7
0
29 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
48
119
0
29 Mar 2023
Multi-modal learning for geospatial vegetation forecasting
Multi-modal learning for geospatial vegetation forecasting
V. Benson
Claire Robin
C. Requena-Mesa
Lazaro Alonso
Nuno Carvalhais
José A. Cortés
Zhihan Gao
Nora Linscheid
M. Weynants
Markus Reichstein
30
11
0
28 Mar 2023
SELF-VS: Self-supervised Encoding Learning For Video Summarization
SELF-VS: Self-supervised Encoding Learning For Video Summarization
Hojjat Mokhtarabadi
Kaveh Bahraman
M. Hosseinzadeh
M. Eftekhari
AI4TS
SSL
ViT
25
0
0
28 Mar 2023
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot
  Learning
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
Man Liu
Feng Li
Chunjie Zhang
Yunchao Wei
H. Bai
Yao-Min Zhao
47
39
0
27 Mar 2023
Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot
  Learning
Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning
Siteng Huang
Biao Gong
Yutong Feng
Min Zhang
Yiliang Lv
Donglin Wang
CoGe
35
10
0
27 Mar 2023
ReBotNet: Fast Real-time Video Enhancement
ReBotNet: Fast Real-time Video Enhancement
Jeya Maria Jose Valanarasu
Rahul Garg
Andeep S. Toor
Xin Tong
Weijuan Xi
Andreas Lugmayr
Vishal M. Patel
A. Menini
34
0
0
23 Mar 2023
Towards Better Dynamic Graph Learning: New Architecture and Unified
  Library
Towards Better Dynamic Graph Learning: New Architecture and Unified Library
Le Yu
Leilei Sun
Bowen Du
Weifeng Lv
AI4CE
29
99
0
23 Mar 2023
Online Transformers with Spiking Neurons for Fast Prosthetic Hand
  Control
Online Transformers with Spiking Neurons for Fast Prosthetic Hand Control
Nathan Leroux
Jan Finkbeiner
Emre Neftci
41
9
0
21 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
47
20
0
17 Mar 2023
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image
  Segmentation
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation
Saikat Roy
Gregor Koehler
Constantin Ulrich
Michael Baumgartner
Jens Petersen
Fabian Isensee
Paul F. Jaeger
Klaus Maier-Hein
ViT
MedIm
40
138
0
17 Mar 2023
Efficient Computation Sharing for Multi-Task Visual Scene Understanding
Efficient Computation Sharing for Multi-Task Visual Scene Understanding
Sara Shoouri
Mingyu Yang
Zichen Fan
Hun-Seok Kim
MoE
28
3
0
16 Mar 2023
Block-wise Bit-Compression of Transformer-based Models
Gaochen Dong
W. Chen
24
0
0
16 Mar 2023
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
Haoyu He
Jianfei Cai
Jing Zhang
Dacheng Tao
Bohan Zhuang
VPVLM
22
50
0
15 Mar 2023
Graph Transformer GANs for Graph-Constrained House Generation
Graph Transformer GANs for Graph-Constrained House Generation
Hao Tang
Zhenyu Zhang
Humphrey Shi
Bo-wen Li
Lin Shao
N. Sebe
Radu Timofte
Luc Van Gool
46
19
0
14 Mar 2023
Good Neighbors Are All You Need for Chinese Grapheme-to-Phoneme
  Conversion
Good Neighbors Are All You Need for Chinese Grapheme-to-Phoneme Conversion
Jungjun Kim
C. Han
Gyuhyeon Nam
Gyeongsu Chae
16
2
0
14 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
ViM: Vision Middleware for Unified Downstream Transferring
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
37
1
0
13 Mar 2023
Transformer Encoder with Multiscale Deep Learning for Pain
  Classification Using Physiological Signals
Transformer Encoder with Multiscale Deep Learning for Pain Classification Using Physiological Signals
Zhenyu Lu
Burcu Ozek
S. Kamarthi
ViT
MedIm
29
15
0
13 Mar 2023
Sequential Spatial Network for Collision Avoidance in Autonomous Driving
Sequential Spatial Network for Collision Avoidance in Autonomous Driving
Haichuan Li
Liguo Zhou
Zhenshan Bing
M. Khatun
Rolf Jung
Alois C. Knoll
26
1
0
12 Mar 2023
Inducing Neural Collapse to a Fixed Hierarchy-Aware Frame for Reducing
  Mistake Severity
Inducing Neural Collapse to a Fixed Hierarchy-Aware Frame for Reducing Mistake Severity
Tong Liang
Jim Davis
38
11
0
10 Mar 2023
Gradient-Free Structured Pruning with Unlabeled Data
Gradient-Free Structured Pruning with Unlabeled Data
Azade Nova
H. Dai
Dale Schuurmans
SyDa
40
20
0
07 Mar 2023
Variational Inference for Neyman-Scott Processes
Variational Inference for Neyman-Scott Processes
Chengkuan Hong
C. Shelton
BDL
13
2
0
07 Mar 2023
Angel-PTM: A Scalable and Economical Large-scale Pre-training System in
  Tencent
Angel-PTM: A Scalable and Economical Large-scale Pre-training System in Tencent
Xiaonan Nie
Yi Liu
Fangcheng Fu
Jinbao Xue
Dian Jiao
Xupeng Miao
Yangyu Tao
Bin Cui
MoE
33
17
0
06 Mar 2023
Unified Keyword Spotting and Audio Tagging on Mobile Devices with
  Transformers
Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers
Heinrich Dinkel
Yongqing Wang
Zhiyong Yan
Junbo Zhang
Yujun Wang
41
4
0
03 Mar 2023
BakedSDF: Meshing Neural SDFs for Real-Time View Synthesis
BakedSDF: Meshing Neural SDFs for Real-Time View Synthesis
Lior Yariv
Peter Hedman
Christian Reiser
Dor Verbin
Pratul P. Srinivasan
Richard Szeliski
Jonathan T. Barron
B. Mildenhall
3DGS
AI4CE
35
202
0
28 Feb 2023
Toward Robust Uncertainty Estimation with Random Activation Functions
Toward Robust Uncertainty Estimation with Random Activation Functions
Y. Stoyanova
Soroush Ghandi
M. Tavakol
UQCV
26
2
0
28 Feb 2023
BrainBERT: Self-supervised representation learning for intracranial
  recordings
BrainBERT: Self-supervised representation learning for intracranial recordings
Christopher Wang
Vighnesh Subramaniam
A. Yaari
Gabriel Kreiman
Boris Katz
Ignacio Cases
Andrei Barbu
MedIm
SSL
27
31
0
28 Feb 2023
Structured Pruning of Self-Supervised Pre-trained Models for Speech
  Recognition and Understanding
Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding
Yifan Peng
Kwangyoun Kim
Felix Wu
Prashant Sridhar
Shinji Watanabe
27
34
0
27 Feb 2023
Full Stack Optimization of Transformer Inference: a Survey
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
36
102
0
27 Feb 2023
Language-Driven Representation Learning for Robotics
Language-Driven Representation Learning for Robotics
Siddharth Karamcheti
Suraj Nair
Annie S. Chen
Thomas Kollar
Chelsea Finn
Dorsa Sadigh
Percy Liang
LM&Ro
SSL
47
145
0
24 Feb 2023
Adapting Pre-trained Language Models for Quantum Natural Language
  Processing
Adapting Pre-trained Language Models for Quantum Natural Language Processing
Qiuchi Li
Benyou Wang
Yudong Zhu
Christina Lioma
Qun Liu
AI4CE
37
4
0
24 Feb 2023
Previous
123...789...171819
Next