Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.12471
Cited By
Neural Network Acceptability Judgments
31 May 2018
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Network Acceptability Judgments"
50 / 883 papers shown
Title
RuCoLA: Russian Corpus of Linguistic Acceptability
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
32
28
0
23 Oct 2022
The Curious Case of Absolute Position Embeddings
Koustuv Sinha
Amirhossein Kazemnejad
Siva Reddy
J. Pineau
Dieuwke Hupkes
Adina Williams
87
15
0
23 Oct 2022
PATS: Sensitivity-aware Noisy Learning for Pretrained Language Models
Yupeng Zhang
Hongzhi Zhang
Sirui Wang
Wei Wu
Zhoujun Li
AAML
37
1
0
22 Oct 2022
Efficiently Tuned Parameters are Task Embeddings
Wangchunshu Zhou
Canwen Xu
Julian McAuley
18
8
0
21 Oct 2022
SLING: Sino Linguistic Evaluation of Large Language Models
Yixiao Song
Kalpesh Krishna
R. Bhatt
Mohit Iyyer
24
8
0
21 Oct 2022
Multitasking Models are Robust to Structural Failure: A Neural Model for Bilingual Cognitive Reserve
Giannis Daras
Negin Raoof
Zoi Gkalitsiou
A. Dimakis
35
2
0
20 Oct 2022
AugCSE: Contrastive Sentence Embedding with Diverse Augmentations
Zilu Tang
Muhammed Yusuf Kocyigit
Derry Wijaya
37
9
0
20 Oct 2022
Learning to Invert: Simple Adaptive Attacks for Gradient Inversion in Federated Learning
Ruihan Wu
Xiangyu Chen
Chuan Guo
Kilian Q. Weinberger
FedML
20
26
0
19 Oct 2022
Tiny-Attention Adapter: Contexts Are More Important Than the Number of Parameters
Hongyu Zhao
Hao Tan
Hongyuan Mei
MoE
41
16
0
18 Oct 2022
Deepfake Text Detection: Limitations and Opportunities
Jiameng Pu
Zain Sarwar
Sifat Muhammad Abdullah
A. Rehman
Yoonjin Kim
P. Bhattacharya
M. Javed
Bimal Viswanath
AAML
24
54
0
17 Oct 2022
ConEntail: An Entailment-based Framework for Universal Zero and Few Shot Classification with Supervised Contrastive Pretraining
H. Zhang
Aysa Xuemo Fan
Rui Zhang
VLM
46
3
0
14 Oct 2022
Predicting Fine-Tuning Performance with Probing
Zining Zhu
Soroosh Shahtalebi
Frank Rudzicz
30
9
0
13 Oct 2022
Towards a Unified Multi-Dimensional Evaluator for Text Generation
Ming Zhong
Yang Liu
Da Yin
Yuning Mao
Yizhu Jiao
Peng Liu
Chenguang Zhu
Heng Ji
Jiawei Han
ELM
50
256
0
13 Oct 2022
Task Compass: Scaling Multi-task Pre-training with Task Prefix
ZhuoSheng Zhang
Shuohang Wang
Yichong Xu
Yuwei Fang
Wenhao Yu
Yang Liu
Han Zhao
Chenguang Zhu
Michael Zeng
SSL
LRM
25
16
0
12 Oct 2022
Multilingual BERT has an accent: Evaluating English influences on fluency in multilingual models
Isabel Papadimitriou
Kezia Lopez
Daniel Jurafsky
29
0
0
11 Oct 2022
Model Cascading: Towards Jointly Improving Efficiency and Accuracy of NLP Systems
Neeraj Varshney
Chitta Baral
30
27
0
11 Oct 2022
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models
Qihuang Zhong
Liang Ding
Li Shen
Peng Mi
Juhua Liu
Bo Du
Dacheng Tao
AAML
40
50
0
11 Oct 2022
Instance Regularization for Discriminative Language Model Pre-training
ZhuoSheng Zhang
Hai Zhao
M. Zhou
19
1
0
11 Oct 2022
Multi-CLS BERT: An Efficient Alternative to Traditional Ensembling
Haw-Shiuan Chang
Ruei-Yao Sun
Kathryn Ricci
Andrew McCallum
43
15
0
10 Oct 2022
Parameter-Efficient Tuning with Special Token Adaptation
Xiaoocong Yang
James Y. Huang
Wenxuan Zhou
Muhao Chen
34
12
0
10 Oct 2022
Dynamic Latent Separation for Deep Learning
Yi-Lin Tuan
Zih-Yun Chiu
William Yang Wang
36
0
0
07 Oct 2022
Less is More: Task-aware Layer-wise Distillation for Language Model Compression
Chen Liang
Simiao Zuo
Qingru Zhang
Pengcheng He
Weizhu Chen
Tuo Zhao
VLM
50
68
0
04 Oct 2022
Efficient Non-Parametric Optimizer Search for Diverse Tasks
Ruochen Wang
Yuanhao Xiong
Minhao Cheng
Cho-Jui Hsieh
27
5
0
27 Sep 2022
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier Layers
Nurullah Sevim
Ege Ozan Özyedek
Furkan Şahinuç
Aykut Koç
40
11
0
26 Sep 2022
Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings
Yiren Jian
Chongyang Gao
Soroush Vosoughi
SSL
31
15
0
20 Sep 2022
Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models
Zichun Yu
Tianyu Gao
Zhengyan Zhang
Yankai Lin
Zhiyuan Liu
Maosong Sun
Jie Zhou
VLM
LRM
38
1
0
20 Sep 2022
How to Find Strong Summary Coherence Measures? A Toolbox and a Comparative Study for Summary Coherence Measure Evaluation
Julius Steen
K. Markert
HILM
19
4
0
14 Sep 2022
Probing for Understanding of English Verb Classes and Alternations in Large Pre-trained Language Models
David K. Yi
James V. Bruno
Jiayu Han
Peter Zukerman
Shane Steinert-Threlkeld
19
1
0
11 Sep 2022
Let Me Check the Examples: Enhancing Demonstration Learning via Explicit Imitation
Sirui Wang
Kaiwen Wei
Hongzhi Zhang
Yun Li
Wei Wu
39
2
0
31 Aug 2022
Transformers with Learnable Activation Functions
Haishuo Fang
Ji-Ung Lee
N. Moosavi
Iryna Gurevych
AI4CE
25
7
0
30 Aug 2022
Addressing Token Uniformity in Transformers via Singular Value Transformation
Hanqi Yan
Lin Gui
Wenjie Li
Yulan He
35
14
0
24 Aug 2022
Learning Better Masking for Better Language Model Pre-training
Dongjie Yang
ZhuoSheng Zhang
Hai Zhao
37
15
0
23 Aug 2022
A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework
Avinash Madasu
Anvesh Rao Vijjini
30
0
0
21 Aug 2022
Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Yile Wang
Yue Zhang
26
4
0
20 Aug 2022
ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset
Zhihua Jin
Xingbo Wang
Furui Cheng
Chunhui Sun
Qun Liu
Huamin Qu
32
9
0
17 Aug 2022
What Artificial Neural Networks Can Tell Us About Human Language Acquisition
Alex Warstadt
Samuel R. Bowman
27
111
0
17 Aug 2022
Improving the Trainability of Deep Neural Networks through Layerwise Batch-Entropy Regularization
David Peer
Bart Keulen
Sebastian Stabinger
J. Piater
A. Rodríguez-Sánchez
42
6
0
01 Aug 2022
Few-shot Adaptation Works with UnpredicTable Data
Jun Shern Chan
Michael Pieler
Jonathan Jao
Jérémy Scheurer
Ethan Perez
36
5
0
01 Aug 2022
MoEC: Mixture of Expert Clusters
Yuan Xie
Shaohan Huang
Tianyu Chen
Furu Wei
MoE
45
11
0
19 Jul 2022
ELECTRA is a Zero-Shot Learner, Too
Shiwen Ni
Hung-Yu kao
30
9
0
17 Jul 2022
Forming Trees with Treeformers
Nilay Patel
Jeffrey Flanigan
AI4CE
26
3
0
14 Jul 2022
Gender Biases and Where to Find Them: Exploring Gender Bias in Pre-Trained Transformer-based Language Models Using Movement Pruning
Przemyslaw K. Joniak
Akiko Aizawa
18
27
0
06 Jul 2022
Betti numbers of attention graphs is all you really need
Laida Kushnareva
D. Piontkovski
Irina Piontkovskaya
GNN
27
2
0
05 Jul 2022
Language model compression with weighted low-rank factorization
Yen-Chang Hsu
Ting Hua
Sung-En Chang
Qiang Lou
Yilin Shen
Hongxia Jin
19
93
0
30 Jun 2022
The Topological BERT: Transforming Attention into Topology for Natural Language Processing
Ilan Perez
Raphael Reinauer
30
17
0
30 Jun 2022
Knowledge Distillation of Transformer-based Language Models Revisited
Chengqiang Lu
Jianwei Zhang
Yunfei Chu
Zhengyu Chen
Jingren Zhou
Fei Wu
Haiqing Chen
Hongxia Yang
VLM
27
10
0
29 Jun 2022
PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance
Qingru Zhang
Simiao Zuo
Chen Liang
Alexander Bukharin
Pengcheng He
Weizhu Chen
T. Zhao
27
78
0
25 Jun 2022
MVP: Multi-task Supervised Pre-training for Natural Language Generation
Tianyi Tang
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
49
24
0
24 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjD
VLM
MLLM
77
393
0
17 Jun 2022
Language with Vision: a Study on Grounded Word and Sentence Embeddings
Hassan Shahmohammadi
Maria Heitmeier
Elnaz Shafaei-Bajestan
Hendrik P. A. Lensch
Harald Baayen
25
10
0
17 Jun 2022
Previous
1
2
3
...
9
10
11
...
16
17
18
Next