ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.12471
  4. Cited By
Neural Network Acceptability Judgments
v1v2v3 (latest)

Neural Network Acceptability Judgments

31 May 2018
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
ArXiv (abs)PDFHTML

Papers citing "Neural Network Acceptability Judgments"

50 / 894 papers shown
Title
Analyzing and Mitigating Interference in Neural Architecture Search
Analyzing and Mitigating Interference in Neural Architecture Search
Jin Xu
Xu Tan
Kaitao Song
Renqian Luo
Yichong Leng
Tao Qin
Tie-Yan Liu
Jian Li
MoMe
72
29
0
29 Aug 2021
Rethinking Why Intermediate-Task Fine-Tuning Works
Rethinking Why Intermediate-Task Fine-Tuning Works
Ting-Yun Chang
Chi-Jen Lu
LRM
96
30
0
26 Aug 2021
Towards Zero-shot Language Modeling
Towards Zero-shot Language Modeling
Edoardo Ponti
Ivan Vulić
Ryan Cotterell
Roi Reichart
Anna Korhonen
87
19
0
06 Aug 2021
Controlled Text Generation as Continuous Optimization with Multiple
  Constraints
Controlled Text Generation as Continuous Optimization with Multiple Constraints
Sachin Kumar
Eric Malmi
Aliaksei Severyn
Yulia Tsvetkov
BDLAI4CE
106
79
0
04 Aug 2021
Local Structure Matters Most: Perturbation Study in NLU
Local Structure Matters Most: Perturbation Study in NLU
Louis Clouâtre
Prasanna Parthasarathi
Payel Das
Sarath Chandar
66
13
0
29 Jul 2021
The Law of Large Documents: Understanding the Structure of Legal
  Contracts Using Visual Cues
The Law of Large Documents: Understanding the Structure of Legal Contracts Using Visual Cues
Allison Hegel
Marina Shah
Genevieve Peaslee
Brendan Roof
Emad Elwany
AILaw
55
8
0
16 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
85
37
0
15 Jul 2021
FLEX: Unifying Evaluation for Few-Shot NLP
FLEX: Unifying Evaluation for Few-Shot NLP
Jonathan Bragg
Arman Cohan
Kyle Lo
Iz Beltagy
270
108
0
15 Jul 2021
Noise Stability Regularization for Improving BERT Fine-tuning
Noise Stability Regularization for Improving BERT Fine-tuning
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
79
45
0
10 Jul 2021
The MultiBERTs: BERT Reproductions for Robustness Analysis
The MultiBERTs: BERT Reproductions for Robustness Analysis
Thibault Sellam
Steve Yadlowsky
Jason W. Wei
Naomi Saphra
Alexander DÁmour
...
Iulia Turc
Jacob Eisenstein
Dipanjan Das
Ian Tenney
Ellie Pavlick
109
95
0
30 Jun 2021
Learning to Sample Replacements for ELECTRA Pre-Training
Learning to Sample Replacements for ELECTRA Pre-Training
Y. Hao
Li Dong
Hangbo Bao
Ke Xu
Furu Wei
MU
45
12
0
25 Jun 2021
LV-BERT: Exploiting Layer Variety for BERT
LV-BERT: Exploiting Layer Variety for BERT
Weihao Yu
Zihang Jiang
Fei Chen
Qibin Hou
Jiashi Feng
MQ
49
0
0
22 Jun 2021
Dive into Deep Learning
Dive into Deep Learning
Aston Zhang
Zachary Chase Lipton
Mu Li
Alexander J. Smola
VLM
100
572
0
21 Jun 2021
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based
  Masked Language-models
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models
Elad Ben-Zaken
Shauli Ravfogel
Yoav Goldberg
241
1,245
0
18 Jun 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRLAI4TSAI4CEALMAIMat
624
10,625
0
17 Jun 2021
Algorithm to Compilation Co-design: An Integrated View of Neural Network
  Sparsity
Algorithm to Compilation Co-design: An Integrated View of Neural Network Sparsity
Fu-Ming Guo
Austin Huang
27
1
0
16 Jun 2021
On the proper role of linguistically-oriented deep net analysis in
  linguistic theorizing
On the proper role of linguistically-oriented deep net analysis in linguistic theorizing
Marco Baroni
131
53
0
16 Jun 2021
SAS: Self-Augmentation Strategy for Language Model Pre-training
SAS: Self-Augmentation Strategy for Language Model Pre-training
Yifei Xu
Jingqiao Zhang
Ru He
Liangzhu Ge
Chao Yang
Cheng Yang
Ying Wu
46
1
0
14 Jun 2021
Why Can You Lay Off Heads? Investigating How BERT Heads Transfer
Why Can You Lay Off Heads? Investigating How BERT Heads Transfer
Ting-Rui Chiang
Yun-Nung Chen
38
0
0
14 Jun 2021
RefBERT: Compressing BERT by Referencing to Pre-computed Representations
RefBERT: Compressing BERT by Referencing to Pre-computed Representations
Xinyi Wang
Haiqing Yang
Liang Zhao
Yang Mo
Jianping Shen
MQ
72
4
0
11 Jun 2021
Convolutions and Self-Attention: Re-interpreting Relative Positions in
  Pre-trained Language Models
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models
Tyler A. Chang
Yifan Xu
Weijian Xu
Zhuowen Tu
ViT
57
15
0
10 Jun 2021
Bayesian Attention Belief Networks
Bayesian Attention Belief Networks
Shujian Zhang
Xinjie Fan
Bo Chen
Mingyuan Zhou
BDL
110
32
0
09 Jun 2021
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers
Rabeeh Karimi Mahabadi
James Henderson
Sebastian Ruder
MoE
147
494
0
08 Jun 2021
BERT Learns to Teach: Knowledge Distillation with Meta Learning
BERT Learns to Teach: Knowledge Distillation with Meta Learning
Wangchunshu Zhou
Canwen Xu
Julian McAuley
124
87
0
08 Jun 2021
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared
  Hypernetworks
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks
Rabeeh Karimi Mahabadi
Sebastian Ruder
Mostafa Dehghani
James Henderson
MoE
82
313
0
08 Jun 2021
Refiner: Refining Self-attention for Vision Transformers
Refiner: Refining Self-attention for Vision Transformers
Daquan Zhou
Yujun Shi
Bingyi Kang
Weihao Yu
Zihang Jiang
Yuan Li
Xiaojie Jin
Qibin Hou
Jiashi Feng
ViT
96
62
0
07 Jun 2021
Learning Slice-Aware Representations with Mixture of Attentions
Learning Slice-Aware Representations with Mixture of Attentions
Cheng Wang
Sungjin Lee
Sunghyun Park
Han Li
Young-Bum Kim
R. Sarikaya
59
2
0
04 Jun 2021
Generate, Prune, Select: A Pipeline for Counterspeech Generation against
  Online Hate Speech
Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech
Wanzheng Zhu
S. Bhat
66
57
0
03 Jun 2021
A Cluster-based Approach for Improving Isotropy in Contextual Embedding
  Space
A Cluster-based Approach for Improving Isotropy in Contextual Embedding Space
S. Rajaee
Mohammad Taher Pilehvar
75
41
0
02 Jun 2021
Using Integrated Gradients and Constituency Parse Trees to explain
  Linguistic Acceptability learnt by BERT
Using Integrated Gradients and Constituency Parse Trees to explain Linguistic Acceptability learnt by BERT
Anmol Nayak
Hariprasad Timmapathini
47
5
0
01 Jun 2021
Training ELECTRA Augmented with Multi-word Selection
Training ELECTRA Augmented with Multi-word Selection
Jiaming Shen
Jialu Liu
Tianqi Liu
Cong Yu
Jiawei Han
79
9
0
31 May 2021
Greedy-layer Pruning: Speeding up Transformer Models for Natural
  Language Processing
Greedy-layer Pruning: Speeding up Transformer Models for Natural Language Processing
David Peer
Sebastian Stabinger
Stefan Engl
A. Rodríguez-Sánchez
39
28
0
31 May 2021
A Compression-Compilation Framework for On-mobile Real-time BERT
  Applications
A Compression-Compilation Framework for On-mobile Real-time BERT Applications
Wei Niu
Zhenglun Kong
Geng Yuan
Weiwen Jiang
Jiexiong Guan
Caiwen Ding
Pu Zhao
Sijia Liu
Bin Ren
Yanzhi Wang
MQ
27
4
0
30 May 2021
Pre-training Universal Language Representation
Pre-training Universal Language Representation
Yian Li
Hai Zhao
SSL
62
8
0
30 May 2021
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural
  Architecture Search
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search
Jin Xu
Xu Tan
Renqian Luo
Kaitao Song
Jian Li
Tao Qin
Tie-Yan Liu
MQ
62
79
0
30 May 2021
Early Exiting with Ensemble Internal Classifiers
Early Exiting with Ensemble Internal Classifiers
Tianxiang Sun
Yunhua Zhou
Xiangyang Liu
Xinyu Zhang
Hao Jiang
Bo Zhao
Xuanjing Huang
Xipeng Qiu
70
31
0
28 May 2021
Super Tickets in Pre-Trained Language Models: From Model Compression to
  Improving Generalization
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Chen Liang
Simiao Zuo
Minshuo Chen
Haoming Jiang
Xiaodong Liu
Pengcheng He
T. Zhao
Weizhu Chen
59
69
0
25 May 2021
AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on
  the Fly
AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly
Yuchen Jin
Dinesh Manocha
Liangyu Zhao
Yibo Zhu
Chuanxiong Guo
Marco Canini
Arvind Krishnamurthy
84
19
0
22 May 2021
KLUE: Korean Language Understanding Evaluation
KLUE: Korean Language Understanding Evaluation
Sungjoon Park
Jihyung Moon
Sungdong Kim
Won Ik Cho
Jiyoon Han
...
Seonghyun Kim
Lucy Park
Alice Oh
Jung-Woo Ha
Kyunghyun Cho
ELMVLM
119
198
0
20 May 2021
Long Text Generation by Modeling Sentence-Level and Discourse-Level
  Coherence
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence
Jian Guan
Xiaoxi Mao
Changjie Fan
Zitao Liu
Wenbiao Ding
Minlie Huang
AuLLM
95
83
0
19 May 2021
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
Jian Guan
Zhexin Zhang
Zhuoer Feng
Zitao Liu
Wenbiao Ding
Xiaoxi Mao
Changjie Fan
Minlie Huang
87
61
0
19 May 2021
How is BERT surprised? Layerwise detection of linguistic anomalies
How is BERT surprised? Layerwise detection of linguistic anomalies
Bai Li
Zining Zhu
Guillaume Thomas
Yang Xu
Frank Rudzicz
76
31
0
16 May 2021
DaLAJ - a dataset for linguistic acceptability judgments for Swedish:
  Format, baseline, sharing
DaLAJ - a dataset for linguistic acceptability judgments for Swedish: Format, baseline, sharing
Elena Volodina
Yousuf Ali Mohammed
Julia Klezl
62
22
0
14 May 2021
The Summary Loop: Learning to Write Abstractive Summaries Without
  Examples
The Summary Loop: Learning to Write Abstractive Summaries Without Examples
Philippe Laban
Andrew Hsi Bloomberg
John F. Canny
Marti A. Hearst
69
56
0
11 May 2021
Benchmarking down-scaled (not so large) pre-trained language models
Benchmarking down-scaled (not so large) pre-trained language models
Matthias Aßenmacher
P. Schulze
C. Heumann
26
1
0
11 May 2021
Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction
  from Language Models
Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models
Anne Beyer
Sharid Loáiciga
David Schlangen
64
16
0
07 May 2021
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing
  Regressions In NLP Model Updates
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates
Yuqing Xie
Yi-An Lai
Yuanjun Xiong
Yi Zhang
Stefano Soatto
UQCV
59
16
0
07 May 2021
Entailment as Few-Shot Learner
Entailment as Few-Shot Learner
Sinong Wang
Han Fang
Madian Khabsa
Hanzi Mao
Hao Ma
100
184
0
29 Apr 2021
Morph Call: Probing Morphosyntactic Content of Multilingual Transformers
Morph Call: Probing Morphosyntactic Content of Multilingual Transformers
Vladislav Mikhailov
O. Serikov
Ekaterina Artemova
82
9
0
26 Apr 2021
On Geodesic Distances and Contextual Embedding Compression for Text
  Classification
On Geodesic Distances and Contextual Embedding Compression for Text Classification
Rishi Jha
Kai Mihata
25
6
0
22 Apr 2021
Previous
123...131415161718
Next