ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.12471
  4. Cited By
Neural Network Acceptability Judgments
v1v2v3 (latest)

Neural Network Acceptability Judgments

31 May 2018
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
ArXiv (abs)PDFHTML

Papers citing "Neural Network Acceptability Judgments"

50 / 894 papers shown
Title
Mask More and Mask Later: Efficient Pre-training of Masked Language
  Models by Disentangling the [MASK] Token
Mask More and Mask Later: Efficient Pre-training of Masked Language Models by Disentangling the [MASK] Token
Baohao Liao
David Thulke
Sanjika Hewavitharana
Hermann Ney
Christof Monz
75
9
0
09 Nov 2022
How Much Does Attention Actually Attend? Questioning the Importance of
  Attention in Pretrained Transformers
How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers
Michael Hassid
Hao Peng
Daniel Rotem
Jungo Kasai
Ivan Montero
Noah A. Smith
Roy Schwartz
96
26
0
07 Nov 2022
Tuning Language Models as Training Data Generators for
  Augmentation-Enhanced Few-Shot Learning
Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
Yu Meng
Martin Michalski
Jiaxin Huang
Yu Zhang
Tarek Abdelzaher
Jiawei Han
VLM
120
49
0
06 Nov 2022
LMentry: A Language Model Benchmark of Elementary Language Tasks
LMentry: A Language Model Benchmark of Elementary Language Tasks
Avia Efrat
Or Honovich
Omer Levy
104
20
0
03 Nov 2022
Fine-Tuning Pre-Trained Language Models Effectively by Optimizing
  Subnetworks Adaptively
Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively
Haojie Zhang
Ge Li
Jia Li
Zhongjin Zhang
Yuqi Zhu
Zhi Jin
AI4CE
57
28
0
03 Nov 2022
Numerical Optimizations for Weighted Low-rank Estimation on Language
  Model
Numerical Optimizations for Weighted Low-rank Estimation on Language Model
Ting Hua
Yen-Chang Hsu
Felicity Wang
Qiang Lou
Yilin Shen
Hongxia Jin
70
15
0
02 Nov 2022
Where to start? Analyzing the potential value of intermediate models
Where to start? Analyzing the potential value of intermediate models
Leshem Choshen
Elad Venezian
Shachar Don-Yehiya
Noam Slonim
Yoav Katz
MoMe
95
27
0
31 Oct 2022
Learning New Tasks from a Few Examples with Soft-Label Prototypes
Learning New Tasks from a Few Examples with Soft-Label Prototypes
Avyav Kumar Singh
Ekaterina Shutova
H. Yannakoudakis
VLM
89
0
0
31 Oct 2022
Empirical Evaluation of Post-Training Quantization Methods for Language
  Tasks
Empirical Evaluation of Post-Training Quantization Methods for Language Tasks
Ting Hu
Christoph Meinel
Haojin Yang
MQ
94
3
0
29 Oct 2022
Probing for targeted syntactic knowledge through grammatical error
  detection
Probing for targeted syntactic knowledge through grammatical error detection
Christopher Davis
Christopher Bryant
Andrew Caines
Marek Rei
P. Buttery
45
4
0
28 Oct 2022
MABEL: Attenuating Gender Bias using Textual Entailment Data
MABEL: Attenuating Gender Bias using Textual Entailment Data
Jacqueline He
Mengzhou Xia
C. Fellbaum
Danqi Chen
54
32
0
26 Oct 2022
Beyond English-Centric Bitexts for Better Multilingual Language
  Representation Learning
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning
Barun Patra
Saksham Singhal
Shaohan Huang
Zewen Chi
Li Dong
Furu Wei
Vishrav Chaudhary
Xia Song
127
24
0
26 Oct 2022
Analyzing Multi-Task Learning for Abstractive Text Summarization
Analyzing Multi-Task Learning for Abstractive Text Summarization
Frederic Kirstein
Jan Philip Wahle
Terry Ruas
Bela Gipp
75
4
0
26 Oct 2022
Multilingual Relation Classification via Efficient and Effective
  Prompting
Multilingual Relation Classification via Efficient and Effective Prompting
Yuxuan Chen
David Harbecke
Leonhard Hennig
LRM
78
12
0
25 Oct 2022
RuCoLA: Russian Corpus of Linguistic Acceptability
RuCoLA: Russian Corpus of Linguistic Acceptability
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
80
29
0
23 Oct 2022
The Curious Case of Absolute Position Embeddings
The Curious Case of Absolute Position Embeddings
Koustuv Sinha
Amirhossein Kazemnejad
Siva Reddy
J. Pineau
Dieuwke Hupkes
Adina Williams
135
15
0
23 Oct 2022
PATS: Sensitivity-aware Noisy Learning for Pretrained Language Models
PATS: Sensitivity-aware Noisy Learning for Pretrained Language Models
Yupeng Zhang
Hongzhi Zhang
Sirui Wang
Wei Wu
Zhoujun Li
AAML
85
1
0
22 Oct 2022
Efficiently Tuned Parameters are Task Embeddings
Efficiently Tuned Parameters are Task Embeddings
Wangchunshu Zhou
Canwen Xu
Julian McAuley
58
8
0
21 Oct 2022
SLING: Sino Linguistic Evaluation of Large Language Models
SLING: Sino Linguistic Evaluation of Large Language Models
Yixiao Song
Kalpesh Krishna
R. Bhatt
Mohit Iyyer
83
10
0
21 Oct 2022
Multitasking Models are Robust to Structural Failure: A Neural Model for
  Bilingual Cognitive Reserve
Multitasking Models are Robust to Structural Failure: A Neural Model for Bilingual Cognitive Reserve
Giannis Daras
Negin Raoof
Zoi Gkalitsiou
A. Dimakis
62
3
0
20 Oct 2022
AugCSE: Contrastive Sentence Embedding with Diverse Augmentations
AugCSE: Contrastive Sentence Embedding with Diverse Augmentations
Zilu Tang
Muhammed Yusuf Kocyigit
Derry Wijaya
110
9
0
20 Oct 2022
Learning to Invert: Simple Adaptive Attacks for Gradient Inversion in
  Federated Learning
Learning to Invert: Simple Adaptive Attacks for Gradient Inversion in Federated Learning
Ruihan Wu
Xiangyu Chen
Chuan Guo
Kilian Q. Weinberger
FedML
69
30
0
19 Oct 2022
Tiny-Attention Adapter: Contexts Are More Important Than the Number of
  Parameters
Tiny-Attention Adapter: Contexts Are More Important Than the Number of Parameters
Hongyu Zhao
Hao Tan
Hongyuan Mei
MoE
81
18
0
18 Oct 2022
Deepfake Text Detection: Limitations and Opportunities
Deepfake Text Detection: Limitations and Opportunities
Jiameng Pu
Zain Sarwar
Sifat Muhammad Abdullah
A. Rehman
Yoonjin Kim
P. Bhattacharya
M. Javed
Bimal Viswanath
AAML
68
57
0
17 Oct 2022
ConEntail: An Entailment-based Framework for Universal Zero and Few Shot
  Classification with Supervised Contrastive Pretraining
ConEntail: An Entailment-based Framework for Universal Zero and Few Shot Classification with Supervised Contrastive Pretraining
H. Zhang
Aysa Xuemo Fan
Rui Zhang
VLM
85
3
0
14 Oct 2022
Predicting Fine-Tuning Performance with Probing
Predicting Fine-Tuning Performance with Probing
Zining Zhu
Soroosh Shahtalebi
Frank Rudzicz
60
10
0
13 Oct 2022
Towards a Unified Multi-Dimensional Evaluator for Text Generation
Towards a Unified Multi-Dimensional Evaluator for Text Generation
Ming Zhong
Yang Liu
Da Yin
Yuning Mao
Yizhu Jiao
Peng Liu
Chenguang Zhu
Heng Ji
Jiawei Han
ELM
107
276
0
13 Oct 2022
Task Compass: Scaling Multi-task Pre-training with Task Prefix
Task Compass: Scaling Multi-task Pre-training with Task Prefix
Zhuosheng Zhang
Shuohang Wang
Yichong Xu
Yuwei Fang
Wenhao Yu
Yang Liu
Han Zhao
Chenguang Zhu
Michael Zeng
SSLLRM
70
16
0
12 Oct 2022
Multilingual BERT has an accent: Evaluating English influences on
  fluency in multilingual models
Multilingual BERT has an accent: Evaluating English influences on fluency in multilingual models
Isabel Papadimitriou
Kezia Lopez
Daniel Jurafsky
40
0
0
11 Oct 2022
Model Cascading: Towards Jointly Improving Efficiency and Accuracy of
  NLP Systems
Model Cascading: Towards Jointly Improving Efficiency and Accuracy of NLP Systems
Neeraj Varshney
Chitta Baral
78
28
0
11 Oct 2022
Improving Sharpness-Aware Minimization with Fisher Mask for Better
  Generalization on Language Models
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models
Qihuang Zhong
Liang Ding
Li Shen
Peng Mi
Juhua Liu
Bo Du
Dacheng Tao
AAML
90
51
0
11 Oct 2022
Instance Regularization for Discriminative Language Model Pre-training
Instance Regularization for Discriminative Language Model Pre-training
Zhuosheng Zhang
Hai Zhao
M. Zhou
88
1
0
11 Oct 2022
Multi-CLS BERT: An Efficient Alternative to Traditional Ensembling
Multi-CLS BERT: An Efficient Alternative to Traditional Ensembling
Haw-Shiuan Chang
Ruei-Yao Sun
Kathryn Ricci
Andrew McCallum
108
15
0
10 Oct 2022
Parameter-Efficient Tuning with Special Token Adaptation
Parameter-Efficient Tuning with Special Token Adaptation
Xiaoocong Yang
James Y. Huang
Wenxuan Zhou
Muhao Chen
89
12
0
10 Oct 2022
Dynamic Latent Separation for Deep Learning
Dynamic Latent Separation for Deep Learning
Yi-Lin Tuan
Zih-Yun Chiu
William Yang Wang
66
0
0
07 Oct 2022
Less is More: Task-aware Layer-wise Distillation for Language Model
  Compression
Less is More: Task-aware Layer-wise Distillation for Language Model Compression
Chen Liang
Simiao Zuo
Qingru Zhang
Pengcheng He
Weizhu Chen
Tuo Zhao
VLM
108
74
0
04 Oct 2022
Efficient Non-Parametric Optimizer Search for Diverse Tasks
Efficient Non-Parametric Optimizer Search for Diverse Tasks
Ruochen Wang
Yuanhao Xiong
Minhao Cheng
Cho-Jui Hsieh
102
5
0
27 Sep 2022
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier
  Layers
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier Layers
Nurullah Sevim
Ege Ozan Özyedek
Furkan Şahinuç
Aykut Koç
85
12
0
26 Sep 2022
Non-Linguistic Supervision for Contrastive Learning of Sentence
  Embeddings
Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings
Yiren Jian
Chongyang Gao
Soroush Vosoughi
SSL
95
15
0
20 Sep 2022
Automatic Label Sequence Generation for Prompting Sequence-to-sequence
  Models
Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models
Zichun Yu
Tianyu Gao
Zhengyan Zhang
Yankai Lin
Zhiyuan Liu
Maosong Sun
Jie Zhou
VLMLRM
51
1
0
20 Sep 2022
How to Find Strong Summary Coherence Measures? A Toolbox and a
  Comparative Study for Summary Coherence Measure Evaluation
How to Find Strong Summary Coherence Measures? A Toolbox and a Comparative Study for Summary Coherence Measure Evaluation
Julius Steen
K. Markert
HILM
44
4
0
14 Sep 2022
Probing for Understanding of English Verb Classes and Alternations in
  Large Pre-trained Language Models
Probing for Understanding of English Verb Classes and Alternations in Large Pre-trained Language Models
David K. Yi
James V. Bruno
Jiayu Han
Peter Zukerman
Shane Steinert-Threlkeld
32
1
0
11 Sep 2022
Let Me Check the Examples: Enhancing Demonstration Learning via Explicit
  Imitation
Let Me Check the Examples: Enhancing Demonstration Learning via Explicit Imitation
Sirui Wang
Kaiwen Wei
Hongzhi Zhang
Yun Li
Wei Wu
61
3
0
31 Aug 2022
Transformers with Learnable Activation Functions
Transformers with Learnable Activation Functions
Haishuo Fang
Ji-Ung Lee
N. Moosavi
Iryna Gurevych
AI4CE
46
8
0
30 Aug 2022
Addressing Token Uniformity in Transformers via Singular Value
  Transformation
Addressing Token Uniformity in Transformers via Singular Value Transformation
Hanqi Yan
Lin Gui
Wenjie Li
Yulan He
75
15
0
24 Aug 2022
Learning Better Masking for Better Language Model Pre-training
Learning Better Masking for Better Language Model Pre-training
Dongjie Yang
Zhuosheng Zhang
Hai Zhao
80
15
0
23 Aug 2022
A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum
  Framework
A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework
Avinash Madasu
Anvesh Rao Vijjini
32
0
0
21 Aug 2022
Lost in Context? On the Sense-wise Variance of Contextualized Word
  Embeddings
Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Yile Wang
Yue Zhang
59
5
0
20 Aug 2022
ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in
  Natural Language Understanding Dataset
ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset
Zhihua Jin
Xingbo Wang
Furui Cheng
Chunhui Sun
Qun Liu
Huamin Qu
56
9
0
17 Aug 2022
What Artificial Neural Networks Can Tell Us About Human Language
  Acquisition
What Artificial Neural Networks Can Tell Us About Human Language Acquisition
Alex Warstadt
Samuel R. Bowman
88
120
0
17 Aug 2022
Previous
123...91011...161718
Next