Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.12471
Cited By
v1
v2
v3 (latest)
Neural Network Acceptability Judgments
31 May 2018
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Network Acceptability Judgments"
50 / 894 papers shown
Title
How are Prompts Different in Terms of Sensitivity?
Sheng Lu
Hendrik Schuff
Iryna Gurevych
87
19
0
13 Nov 2023
STEER: Unified Style Transfer with Expert Reinforcement
Skyler Hallinan
Faeze Brahman
Ximing Lu
Jaehun Jung
Sean Welleck
Yejin Choi
OffRL
58
14
0
13 Nov 2023
Mirror: A Universal Framework for Various Information Extraction Tasks
Tong Zhu
Junfei Ren
Zijian Yu
Mengsong Wu
Guoliang Zhang
Xiaoye Qu
Wenliang Chen
Zhefeng Wang
Baoxing Huai
Min Zhang
89
14
0
09 Nov 2023
Large GPT-like Models are Bad Babies: A Closer Look at the Relationship between Linguistic Competence and Psycholinguistic Measures
Julius Steuer
Marius Mosbach
Dietrich Klakow
39
10
0
08 Nov 2023
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Le Yu
Yu Bowen
Haiyang Yu
Fei Huang
Yongbin Li
MoMe
118
336
0
06 Nov 2023
Not all layers are equally as important: Every Layer Counts BERT
Lucas Georges Gabriel Charpentier
David Samuel
89
18
0
03 Nov 2023
Ling-CL: Understanding NLP Models through Linguistic Curricula
Mohamed Elgaar
Hadi Amiri
77
2
0
31 Oct 2023
Evaluating Neural Language Models as Cognitive Models of Language Acquisition
Héctor Javier Vázquez Martínez
Annika Lea Heuser
Charles D. Yang
Jordan Kodner
97
10
0
31 Oct 2023
Mean BERTs make erratic language teachers: the effectiveness of latent bootstrapping in low-resource settings
David Samuel
54
4
0
30 Oct 2023
Outlier Dimensions Encode Task-Specific Knowledge
William Rudman
Catherine Chen
Carsten Eickhoff
65
5
0
26 Oct 2023
torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP
Yoshitomo Matsubara
VLM
74
1
0
26 Oct 2023
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
Ahmed Alajrami
Katerina Margatina
Nikolaos Aletras
AAML
65
1
0
26 Oct 2023
How well can machine-generated texts be identified and can language models be trained to avoid identification?
Sinclair Schneider
Florian Steuber
João A. G. Schneider
Gabi Dreo Rodosek
DeLMO
33
1
0
25 Oct 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Xunliang Cai
Dongyan Zhao
Ran Wang
Rui Yan
61
4
0
24 Oct 2023
Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization
Tianshi Che
Ji Liu
Yang Zhou
Jiaxiang Ren
Jiwen Zhou
Victor S. Sheng
H. Dai
Dejing Dou
90
56
0
23 Oct 2023
Statistical Depth for Ranking and Characterizing Transformer-Based Text Embeddings
Parker Seegmiller
S. Preum
128
3
0
23 Oct 2023
Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives
Mario Giulianelli
Sarenne Wallbridge
Raquel Fernández
54
15
0
20 Oct 2023
Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models
Miaoxi Zhu
Qihuang Zhong
Li Shen
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
MQ
VLM
66
1
0
20 Oct 2023
Breaking through Deterministic Barriers: Randomized Pruning Mask Generation and Selection
Jianwei Li
Weizhi Gao
Qi Lei
Dongkuan Xu
57
2
0
19 Oct 2023
A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models
Yi Zhou
Jose Camacho-Collados
Danushka Bollegala
153
6
0
19 Oct 2023
Rethinking the Construction of Effective Metrics for Understanding the Mechanisms of Pretrained Language Models
You Li
Jinhui Yin
Yuming Lin
63
0
0
19 Oct 2023
Measuring Pointwise
V
\mathcal{V}
V
-Usable Information In-Context-ly
Sheng Lu
Shan Chen
Yingya Li
Danielle Bitterman
G. Savova
Iryna Gurevych
43
0
0
18 Oct 2023
SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Mohammadreza Salehi
Sachin Mehta
Aditya Kusupati
Ali Farhadi
Hannaneh Hajishirzi
118
6
0
18 Oct 2023
Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning
Hao Zhao
Jie Fu
Zhaofeng He
155
6
0
18 Oct 2023
A State-Vector Framework for Dataset Effects
E. Sahak
Zining Zhu
Frank Rudzicz
55
1
0
17 Oct 2023
G10: Enabling An Efficient Unified GPU Memory and Storage Architecture with Smart Tensor Migrations
Haoyang Zhang
Yirui Eric Zhou
Yu Xue
Yiqi Liu
Jian Huang
34
18
0
13 Oct 2023
Split-and-Denoise: Protect large language model inference with local differential privacy
Peihua Mai
Ran Yan
Zhe Huang
Youjia Yang
Yan Pang
71
14
0
13 Oct 2023
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Yixiao Li
Yifan Yu
Chen Liang
Pengcheng He
Nikos Karampatziakis
Weizhu Chen
Tuo Zhao
MQ
136
149
0
12 Oct 2023
Faithfulness Measurable Masked Language Models
Andreas Madsen
Siva Reddy
Sarath Chandar
79
3
0
11 Oct 2023
ZooPFL: Exploring Black-box Foundation Models for Personalized Federated Learning
Wang Lu
Hao Yu
Jindong Wang
Damien Teney
Haohan Wang
Yiqiang Chen
Qiang Yang
Xing Xie
Xiangyang Ji
97
8
0
08 Oct 2023
Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
Cheng Zhang
Jianyi Cheng
Ilia Shumailov
George A. Constantinides
Yiren Zhao
MQ
79
10
0
08 Oct 2023
Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration
Ercong Nie
Helmut Schmid
Hinrich Schütze
UQCV
90
2
0
08 Oct 2023
Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Zihao Lin
Yan Sun
Yifan Shi
Xueqian Wang
Lifu Huang
Li Shen
Dacheng Tao
93
12
0
04 Oct 2023
Defending Against Authorship Identification Attacks
Haining Wang
56
2
0
02 Oct 2023
JCoLA: Japanese Corpus of Linguistic Acceptability
Taiga Someya
Yushi Sugimoto
Yohei Oseki
61
6
0
22 Sep 2023
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Tianhua Zhang
Jiaxin Ge
Hongyin Luo
Yung-Sung Chuang
Mingye Gao
Yuan Gong
Xixin Wu
Yoon Kim
Helen M. Meng
James R. Glass
LRM
ReLM
143
16
0
19 Sep 2023
The Impact of Debiasing on the Performance of Language Models in Downstream Tasks is Underestimated
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
102
7
0
16 Sep 2023
Anchor Points: Benchmarking Models with Much Fewer Examples
Rajan Vivek
Kawin Ethayarajh
Diyi Yang
Douwe Kiela
ALM
116
28
0
14 Sep 2023
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning
Sanghyeon Kim
Hyunmo Yang
Younghyun Kim
Youngjoon Hong
Eunbyung Park
AI4CE
73
18
0
13 Sep 2023
Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity
Joseph Gatto
Omar Sharif
Parker Seegmiller
Philip Bohlman
S. Preum
33
8
0
12 Sep 2023
DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning
Zhengxiang Shi
Aldo Lipani
VLM
124
34
0
11 Sep 2023
SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning
Bin Wang
Zhengyuan Liu
Xin Huang
Fangkai Jiao
Yang Ding
Ai Ti Aw
Nancy F. Chen
LRM
91
75
0
09 Sep 2023
ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer
Zachary Horvitz
Ajay Patel
Chris Callison-Burch
Zhou Yu
Kathleen McKeown
DiffM
99
14
0
29 Aug 2023
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
Haiwen Diao
Bo Wan
Yanzhe Zhang
Xuecong Jia
Huchuan Lu
Long Chen
VLM
79
19
0
28 Aug 2023
Improving Knowledge Distillation for BERT Models: Loss Functions, Mapping Methods, and Weight Tuning
Apoorv Dankar
Adeem Jassani
Kartikaeya Kumar
18
1
0
26 Aug 2023
Leveraging Knowledge and Reinforcement Learning for Enhanced Reliability of Language Models
Nancy Tyagi
Surjodeep Sarkar
Manas Gaur
KELM
53
1
0
25 Aug 2023
Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices
Elizaveta Kostenok
D. Cherniavskii
Alexey Zaytsev
83
6
0
22 Aug 2023
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Neel Guha
Julian Nyarko
Daniel E. Ho
Christopher Ré
Adam Chilton
...
Spencer Williams
Sunny G. Gandhi
Tomer Zur
Varun J. Iyer
Zehua Li
AILaw
LRM
ELM
82
182
0
20 Aug 2023
Don't lose the message while paraphrasing: A study on content preserving style transfer
N. Babakov
David Dale
I. Gusev
I. Krotova
Alexander Panchenko
67
21
0
17 Aug 2023
Lightweight Adaptation of Neural Language Models via Subspace Embedding
Amit Kumar Jaiswal
Haiming Liu
57
2
0
16 Aug 2023
Previous
1
2
3
...
5
6
7
...
16
17
18
Next