Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.05426
Cited By
v1
v2
v3
v4 (latest)
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
18 April 2017
Adina Williams
Nikita Nangia
Samuel R. Bowman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference"
50 / 2,772 papers shown
Title
Data Contamination Through the Lens of Time
Manley Roberts
Himanshu Thakur
Christine Herlihy
Colin White
Samuel Dooley
130
32
0
16 Oct 2023
One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging For Cross-Lingual Transfer
Fabian David Schmidt
Ivan Vulić
Goran Glavaš
MoMe
44
4
0
16 Oct 2023
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
Chong Li
Shaonan Wang
Yunhao Zhang
Jiajun Zhang
Chengqing Zong
80
6
0
16 Oct 2023
A Search for Prompts: Generating Structured Answers from Contracts
Adam Roegiest
Radha Chitta
Jonathan Donnelly
Maya Lash
A. Vtyurina
Franccois Longtin
ELM
AILaw
54
2
0
16 Oct 2023
RSVP: Customer Intent Detection via Agent Response Contrastive and Generative Pre-Training
Yu-Chien Tang
Wei-Yao Wang
An-Zi Yen
Wenjie Peng
73
1
0
15 Oct 2023
DPZero: Private Fine-Tuning of Language Models without Backpropagation
Liang Zhang
Bingcong Li
K. K. Thekumparampil
Sewoong Oh
Niao He
96
15
0
14 Oct 2023
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters
Yixin Wan
George Pu
Jiao Sun
Aparna Garimella
Kai-Wei Chang
Nanyun Peng
116
200
0
13 Oct 2023
Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning
Hongfu Liu
Ye Wang
80
9
0
13 Oct 2023
InstructTODS: Large Language Models for End-to-End Task-Oriented Dialogue Systems
Willy Chung
Samuel Cahyawijaya
Bryan Wilie
Holy Lovenia
Pascale Fung
80
6
0
13 Oct 2023
GLoRE: Evaluating Logical Reasoning of Large Language Models
Hanmeng Liu
Zhiyang Teng
Ruoxi Ning
Jian Liu
Qiji Zhou
Yuexin Zhang
Yue Zhang
ReLM
ELM
LRM
164
8
0
13 Oct 2023
Tokenizer Choice For LLM Training: Negligible or Crucial?
Mehdi Ali
Michael Fromm
Klaudia Thellmann
Richard Rutmann
Max Lübbering
...
Malte Ostendorff
Samuel Weinbach
R. Sifa
Stefan Kesselheim
Nicolas Flores-Herr
116
61
0
12 Oct 2023
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Yixiao Li
Yifan Yu
Chen Liang
Pengcheng He
Nikos Karampatziakis
Weizhu Chen
Tuo Zhao
MQ
138
149
0
12 Oct 2023
Effects of Human Adversarial and Affable Samples on BERT Generalization
Aparna Elangovan
Jiayuan He
Yuan Li
Karin Verspoor
108
3
0
12 Oct 2023
Language Models are Universal Embedders
Xin Zhang
Zehan Li
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
Min Zhang
KELM
ELM
290
9
0
12 Oct 2023
Faithfulness Measurable Masked Language Models
Andreas Madsen
Siva Reddy
Sarath Chandar
83
3
0
11 Oct 2023
Fast-ELECTRA for Efficient Pre-training
Chengyu Dong
Liyuan Liu
Hao Cheng
Jingbo Shang
Jianfeng Gao
Xiaodong Liu
79
2
0
11 Oct 2023
Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization
Caoyun Fan
Wenqing Chen
Jidong Tian
Yitian Li
Hao He
Yaohui Jin
CML
65
4
0
10 Oct 2023
FTFT: Efficient and Robust Fine-Tuning by Transferring Training Dynamics
Yupei Du
Albert Gatt
Dong Nguyen
71
1
0
10 Oct 2023
Mitigating Simplicity Bias in Deep Learning for Improved OOD Generalization and Robustness
Bhavya Vasudeva
Kameron Shahabi
Vatsal Sharan
67
4
0
09 Oct 2023
Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution
Xinze Li
Yixin Cao2
Liangming Pan
Yubo Ma
Aixin Sun
HILM
40
21
0
09 Oct 2023
DORIS-MAE: Scientific Document Retrieval using Multi-level Aspect-based Queries
Jianyou Wang
Kaicheng Wang
Xiaoyue Wang
Prudhviraj Naidu
Leon Bergen
R. Paturi
103
11
0
07 Oct 2023
From Nuisance to News Sense: Augmenting the News with Cross-Document Evidence and Context
Jeremiah Milbauer
Ziqi Ding
Zhijin Wu
Tongshuang Wu
88
2
0
06 Oct 2023
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Tom Sherborne
Naomi Saphra
Pradeep Dasigi
Hao Peng
58
5
0
05 Oct 2023
Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Zihao Lin
Yan Sun
Yifan Shi
Xueqian Wang
Lifu Huang
Li Shen
Dacheng Tao
98
12
0
04 Oct 2023
Beyond Labeling Oracles: What does it mean to steal ML models?
Avital Shafran
Ilia Shumailov
Murat A. Erdogdu
Nicolas Papernot
AAML
91
4
0
03 Oct 2023
SEA: Sparse Linear Attention with Estimated Attention Mask
Heejun Lee
Jina Kim
Jeffrey Willette
Sung Ju Hwang
162
7
0
03 Oct 2023
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Ori Yoran
Tomer Wolfson
Ori Ram
Jonathan Berant
RALM
LRM
120
216
0
02 Oct 2023
Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models
Man Luo
Shrinidhi Kumbhar
Ming shen
Mihir Parmar
Neeraj Varshney
Pratyay Banerjee
Somak Aditya
Chitta Baral
ReLM
ELM
LRM
137
31
0
02 Oct 2023
TRAM: Benchmarking Temporal Reasoning for Large Language Models
Yuqing Wang
Yun Zhao
LRM
111
14
0
02 Oct 2023
A Novel Computational and Modeling Foundation for Automatic Coherence Assessment
Aviya Maimon
Reut Tsarfaty
116
5
0
01 Oct 2023
Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt Engineering
Han Zhou
Xingchen Wan
Lev Proleev
Diana Mincu
Jilin Chen
Katherine A. Heller
Subhrajit Roy
UQLM
87
61
0
29 Sep 2023
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks
Hao Chen
Jindong Wang
Ankit Shah
Ran Tao
Hongxin Wei
Berfin cSimcsek
Masashi Sugiyama
Bhiksha Raj
110
32
0
29 Sep 2023
Discovering environments with XRM
Mohammad Pezeshki
Diane Bouchacourt
Mark Ibrahim
Jimuyang Zhang
Pascal Vincent
David Lopez-Paz
98
19
0
28 Sep 2023
Multi-Modal Financial Time-Series Retrieval Through Latent Space Projections
Tom Bamford
Andrea Coletta
Elizabeth Fons
Sriram Gopalakrishnan
Svitlana Vyetrenko
T. Balch
Manuela Veloso
AI4TS
52
11
0
28 Sep 2023
ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers
Junjie Yin
Jiahao Dong
Yingheng Wang
Christopher De Sa
Volodymyr Kuleshov
MQ
68
6
0
28 Sep 2023
Do We Run How We Say We Run? Formalization and Practice of Governance in OSS Communities
Mahasweta Chakraborti
Curtis Atkisson
Stefan Stanciulescu
V. Filkov
Seth Frey
83
5
0
25 Sep 2023
Substituting Data Annotation with Balanced Updates and Collective Loss in Multi-label Text Classification
Muberra Ozmen
Joseph Cotnareanu
Mark Coates
34
0
0
24 Sep 2023
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP
M. Kabir
Mohammed Saidul Islam
Md Tahmid Rahman Laskar
Mir Tafseer Nayeem
M Saiful Bari
Enamul Hoque
LM&MA
77
17
0
22 Sep 2023
AnglE-optimized Text Embeddings
Xianming Li
Jing Li
RALM
91
101
0
22 Sep 2023
Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation
Ali Mousavi
Xin Zhan
Richard He Bai
Peng Shi
Theo Rekatsinas
...
Jeff Pound
Josh Susskind
Natalie Schluter
Ihab F. Ilyas
Navdeep Jaitly
60
2
0
20 Sep 2023
What Learned Representations and Influence Functions Can Tell Us About Adversarial Examples
Shakila Mahjabin Tonni
Mark Dras
TDI
AAML
GAN
62
0
0
19 Sep 2023
Understanding Catastrophic Forgetting in Language Models via Implicit Inference
Suhas Kotha
Jacob Mitchell Springer
Aditi Raghunathan
CLL
128
71
0
18 Sep 2023
Not Enough Labeled Data? Just Add Semantics: A Data-Efficient Method for Inferring Online Health Texts
Joseph Gatto
S. Preum
AI4MH
59
1
0
18 Sep 2023
Adapting Large Language Models via Reading Comprehension
Daixuan Cheng
Shaohan Huang
Furu Wei
CLL
SyDa
AI4CE
88
36
0
18 Sep 2023
Mitigating Shortcuts in Language Models with Soft Label Encoding
Zirui He
Huiqi Deng
Haiyan Zhao
Ninghao Liu
Jundong Li
69
2
0
17 Sep 2023
The Impact of Debiasing on the Performance of Language Models in Downstream Tasks is Underestimated
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
102
7
0
16 Sep 2023
Rethinking STS and NLI in Large Language Models
Yuxia Wang
Minghan Wang
Preslav Nakov
LRM
71
3
0
16 Sep 2023
X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across Paragraphs
Juan Diego Rodriguez
Katrin Erk
Greg Durrett
94
4
0
16 Sep 2023
Towards Last-layer Retraining for Group Robustness with Fewer Annotations
Tyler LaBonte
Vidya Muthukumar
Abhishek Kumar
104
42
0
15 Sep 2023
Anchor Points: Benchmarking Models with Much Fewer Examples
Rajan Vivek
Kawin Ethayarajh
Diyi Yang
Douwe Kiela
ALM
116
28
0
14 Sep 2023
Previous
1
2
3
...
14
15
16
...
54
55
56
Next