Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.08415
Cited By
Gaussian Error Linear Units (GELUs)
27 June 2016
Dan Hendrycks
Kevin Gimpel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Gaussian Error Linear Units (GELUs)"
50 / 945 papers shown
Title
LLIC: Large Receptive Field Transform Coding with Adaptive Weights for Learned Image Compression
Wei Jiang
Peirong Ning
Jiayu Yang
Yongqi Zhai
Feng Gao
Ronggang Wang
38
6
0
19 Apr 2023
CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression
Mubariz Zaffar
Liangliang Nan
Julian F. P. Kooij
27
2
0
14 Apr 2023
Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task
S. Ruan
Allen Nie
William Steenbergen
Jiayu He
JQ Zhang
...
Kyle Dang Nguyen
Catherine Y Wang
Rui Ying
James A. Landay
Emma Brunskill
28
18
0
11 Apr 2023
DeFeeNet: Consecutive 3D Human Motion Prediction with Deviation Feedback
Xiaoning Sun
Huaijiang Sun
Bin Li
Dong Wei
Weiqing Li
Jianfeng Lu
3DH
36
6
0
10 Apr 2023
Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following
Mingyu Ding
Yan Xu
Zhenfang Chen
David D. Cox
Ping Luo
J. Tenenbaum
Chuang Gan
LM&Ro
64
21
0
07 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
30
41
0
07 Apr 2023
ClothCombo: Modeling Inter-Cloth Interaction for Draping Multi-Layered Clothes
Dohae Lee
Hyun Kang
In-Kwon Lee
3DH
AI4CE
36
7
0
07 Apr 2023
Anomaly Detection via Gumbel Noise Score Matching
Ahsan Mahmood
Junier Oliva
Martin Styner
26
1
0
06 Apr 2023
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
110
6,867
0
05 Apr 2023
Industrial Anomaly Detection with Domain Shift: A Real-world Dataset and Masked Multi-scale Reconstruction
Zilong Zhang
Zhibin Zhao
Xingwu Zhang
Chuang Sun
Xuefeng Chen
32
50
0
05 Apr 2023
Blockwise Compression of Transformer-based Models without Retraining
Gaochen Dong
W. Chen
26
3
0
04 Apr 2023
TransPimLib: A Library for Efficient Transcendental Functions on Processing-in-Memory Systems
Maurus Item
Juan Gómez Luna
Yu-Yin Guo
Geraldo F. Oliveira
Mohammad Sadrosadati
O. Mutlu
40
5
0
03 Apr 2023
Transformer-based interpretable multi-modal data fusion for skin lesion classification
Theodor Cheslerean-Boghiu
Melia-Evelina Fleischmann
Theresa Willem
Tobias Lasser
ViT
MedIm
AI4CE
29
2
0
03 Apr 2023
CNNs with Multi-Level Attention for Domain Generalization
Aristotelis Ballas
Christos Diou
OOD
29
6
0
02 Apr 2023
Resolution-Invariant Image Classification based on Fourier Neural Operators
Samira Kabri
Tim Roith
Daniel Tenbrinck
Martin Burger
29
6
0
02 Apr 2023
Hierarchical Vision Transformers for Cardiac Ejection Fraction Estimation
Lhuqita Fazry
Asep Haryono
Nuzulul Khairu Nissa
Sunarno
Naufal Muhammad Hirzi
M. F. Rachmadi
W. Jatmiko
MedIm
16
16
0
31 Mar 2023
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X
Qinkai Zheng
Xiao Xia
Xu Zou
Yuxiao Dong
Shanshan Wang
...
Andi Wang
Yang Li
Teng Su
Zhilin Yang
Jie Tang
ELM
ALM
SyDa
71
320
0
30 Mar 2023
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
99
793
0
30 Mar 2023
Ensemble weather forecast post-processing with a flexible probabilistic neural network approach
P. Mlakar
J. Merse
Jana Faganeli Pucer
25
4
0
29 Mar 2023
GNNBuilder: An Automated Framework for Generic Graph Neural Network Accelerator Generation, Simulation, and Optimization
Stefan Abi-Karam
Cong Hao
GNN
36
7
0
29 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
48
119
0
29 Mar 2023
Multi-modal learning for geospatial vegetation forecasting
V. Benson
Claire Robin
C. Requena-Mesa
Lazaro Alonso
Nuno Carvalhais
José A. Cortés
Zhihan Gao
Nora Linscheid
M. Weynants
Markus Reichstein
30
11
0
28 Mar 2023
SELF-VS: Self-supervised Encoding Learning For Video Summarization
Hojjat Mokhtarabadi
Kaveh Bahraman
M. Hosseinzadeh
M. Eftekhari
AI4TS
SSL
ViT
25
0
0
28 Mar 2023
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
Man Liu
Feng Li
Chunjie Zhang
Yunchao Wei
H. Bai
Yao-Min Zhao
47
39
0
27 Mar 2023
Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning
Siteng Huang
Biao Gong
Yutong Feng
Min Zhang
Yiliang Lv
Donglin Wang
CoGe
35
10
0
27 Mar 2023
ReBotNet: Fast Real-time Video Enhancement
Jeya Maria Jose Valanarasu
Rahul Garg
Andeep S. Toor
Xin Tong
Weijuan Xi
Andreas Lugmayr
Vishal M. Patel
A. Menini
34
0
0
23 Mar 2023
Towards Better Dynamic Graph Learning: New Architecture and Unified Library
Le Yu
Leilei Sun
Bowen Du
Weifeng Lv
AI4CE
29
99
0
23 Mar 2023
Online Transformers with Spiking Neurons for Fast Prosthetic Hand Control
Nathan Leroux
Jan Finkbeiner
Emre Neftci
41
9
0
21 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
47
20
0
17 Mar 2023
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation
Saikat Roy
Gregor Koehler
Constantin Ulrich
Michael Baumgartner
Jens Petersen
Fabian Isensee
Paul F. Jaeger
Klaus Maier-Hein
ViT
MedIm
40
138
0
17 Mar 2023
Efficient Computation Sharing for Multi-Task Visual Scene Understanding
Sara Shoouri
Mingyu Yang
Zichen Fan
Hun-Seok Kim
MoE
28
3
0
16 Mar 2023
Block-wise Bit-Compression of Transformer-based Models
Gaochen Dong
W. Chen
24
0
0
16 Mar 2023
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
Haoyu He
Jianfei Cai
Jing Zhang
Dacheng Tao
Bohan Zhuang
VPVLM
22
50
0
15 Mar 2023
Graph Transformer GANs for Graph-Constrained House Generation
Hao Tang
Zhenyu Zhang
Humphrey Shi
Bo-wen Li
Lin Shao
N. Sebe
Radu Timofte
Luc Van Gool
46
19
0
14 Mar 2023
Good Neighbors Are All You Need for Chinese Grapheme-to-Phoneme Conversion
Jungjun Kim
C. Han
Gyuhyeon Nam
Gyeongsu Chae
16
2
0
14 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
37
1
0
13 Mar 2023
Transformer Encoder with Multiscale Deep Learning for Pain Classification Using Physiological Signals
Zhenyu Lu
Burcu Ozek
S. Kamarthi
ViT
MedIm
29
15
0
13 Mar 2023
Sequential Spatial Network for Collision Avoidance in Autonomous Driving
Haichuan Li
Liguo Zhou
Zhenshan Bing
M. Khatun
Rolf Jung
Alois C. Knoll
26
1
0
12 Mar 2023
Inducing Neural Collapse to a Fixed Hierarchy-Aware Frame for Reducing Mistake Severity
Tong Liang
Jim Davis
38
11
0
10 Mar 2023
Gradient-Free Structured Pruning with Unlabeled Data
Azade Nova
H. Dai
Dale Schuurmans
SyDa
40
20
0
07 Mar 2023
Variational Inference for Neyman-Scott Processes
Chengkuan Hong
C. Shelton
BDL
13
2
0
07 Mar 2023
Angel-PTM: A Scalable and Economical Large-scale Pre-training System in Tencent
Xiaonan Nie
Yi Liu
Fangcheng Fu
Jinbao Xue
Dian Jiao
Xupeng Miao
Yangyu Tao
Bin Cui
MoE
33
17
0
06 Mar 2023
Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers
Heinrich Dinkel
Yongqing Wang
Zhiyong Yan
Junbo Zhang
Yujun Wang
41
4
0
03 Mar 2023
BakedSDF: Meshing Neural SDFs for Real-Time View Synthesis
Lior Yariv
Peter Hedman
Christian Reiser
Dor Verbin
Pratul P. Srinivasan
Richard Szeliski
Jonathan T. Barron
B. Mildenhall
3DGS
AI4CE
35
202
0
28 Feb 2023
Toward Robust Uncertainty Estimation with Random Activation Functions
Y. Stoyanova
Soroush Ghandi
M. Tavakol
UQCV
26
2
0
28 Feb 2023
BrainBERT: Self-supervised representation learning for intracranial recordings
Christopher Wang
Vighnesh Subramaniam
A. Yaari
Gabriel Kreiman
Boris Katz
Ignacio Cases
Andrei Barbu
MedIm
SSL
27
31
0
28 Feb 2023
Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding
Yifan Peng
Kwangyoun Kim
Felix Wu
Prashant Sridhar
Shinji Watanabe
27
34
0
27 Feb 2023
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
36
102
0
27 Feb 2023
Language-Driven Representation Learning for Robotics
Siddharth Karamcheti
Suraj Nair
Annie S. Chen
Thomas Kollar
Chelsea Finn
Dorsa Sadigh
Percy Liang
LM&Ro
SSL
47
145
0
24 Feb 2023
Adapting Pre-trained Language Models for Quantum Natural Language Processing
Qiuchi Li
Benyou Wang
Yudong Zhu
Christina Lioma
Qun Liu
AI4CE
37
4
0
24 Feb 2023
Previous
1
2
3
...
7
8
9
...
17
18
19
Next