Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.05426
Cited By
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
18 April 2017
Adina Williams
Nikita Nangia
Samuel R. Bowman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference"
50 / 2,724 papers shown
Title
GPT or BERT: why not both?
Lucas Georges Gabriel Charpentier
David Samuel
55
5
0
31 Dec 2024
In-Context Learning with Iterative Demonstration Selection
Chengwei Qin
Aston Zhang
Chong Chen
Anirudh Dagar
Wenming Ye
LRM
70
38
0
31 Dec 2024
Cut the Deadwood Out: Post-Training Model Purification with Selective Module Substitution
Yao Tong
Weijun Li
Xuanli He
Haolan Zhan
Qiongkai Xu
AAML
38
1
0
31 Dec 2024
MapExplorer: New Content Generation from Low-Dimensional Visualizations
Xingjian Zhang
Ziyang Xiong
Shixuan Liu
Yutong Xie
Tolga Ergen
Dongsub Shim
Hua Xu
Honglak Lee
Qiaozhu Me
41
0
0
24 Dec 2024
Adversarial Robustness through Dynamic Ensemble Learning
Hetvi Waghela
Jaydip Sen
Sneha Rakshit
AAML
91
0
0
20 Dec 2024
Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimization
Yue Zhang
Liqiang Jing
Vibhav Gogate
116
2
0
19 Dec 2024
A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI
Beiduo Chen
Siyao Peng
Anna Korhonen
Barbara Plank
76
0
0
18 Dec 2024
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Benjamin Warner
Antoine Chaffin
Benjamin Clavié
Orion Weller
Oskar Hallström
...
Tom Aarsen
Nathan Cooper
Griffin Adams
Jeremy Howard
Iacopo Poli
90
79
0
18 Dec 2024
In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning
Yifei Duan
Liu Li
Zirui Zhai
Jinxia Yao
80
0
0
17 Dec 2024
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Reasoning
Hongbin Zhang
K. Chen
Xuefeng Bai
Yang Xiang
Min Zhang
79
0
0
17 Dec 2024
Coverage-based Fairness in Multi-document Summarization
Haoyuan Li
Yusen Zhang
Rui Zhang
Snigdha Chaturvedi
80
0
0
11 Dec 2024
The Pitfalls of Memorization: When Memorization Hurts Generalization
Reza Bayat
Mohammad Pezeshki
Elvis Dohmatob
David Lopez-Paz
Pascal Vincent
OOD
105
3
0
10 Dec 2024
The Vulnerability of Language Model Benchmarks: Do They Accurately Reflect True LLM Performance?
Sourav Banerjee
Ayushi Agarwal
Eishkaran Singh
ELM
73
2
0
02 Dec 2024
Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers
Chancharik Mitra
Brandon Huang
Tianning Chai
Zhiqiu Lin
Assaf Arbelle
Rogerio Feris
Leonid Karlinsky
Trevor Darrell
Deva Ramanan
Roei Herzig
VLM
134
4
0
28 Nov 2024
Sneaking Syntax into Transformer Language Models with Tree Regularization
Ananjan Nandi
Christopher D. Manning
Shikhar Murty
74
0
0
28 Nov 2024
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
64
0
0
27 Nov 2024
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models
Y. Fu
Yin Yu
Xiaotian Han
Runchao Li
Xianxuan Long
Haotian Yu
Pan Li
SyDa
67
0
0
25 Nov 2024
IterIS: Iterative Inference-Solving Alignment for LoRA Merging
Hongxu Chen
Runshi Li
Bowei Zhu
Zhen Wang
Long Chen
MoMe
98
0
0
21 Nov 2024
PatentEdits: Framing Patent Novelty as Textual Entailment
Ryan Lee
Alexander Spangher
Xuezhe Ma
66
0
0
20 Nov 2024
Gradual Fine-Tuning with Graph Routing for Multi-Source Unsupervised Domain Adaptation
Yao Ma
Samuel Louvan
Z. Wang
28
0
0
11 Nov 2024
More Expressive Attention with Negative Weights
Ang Lv
Ruobing Xie
Shuaipeng Li
Jiayi Liao
Xingchen Sun
Zhanhui Kang
Di Wang
Rui Yan
42
0
0
11 Nov 2024
The Inherent Adversarial Robustness of Analog In-Memory Computing
Corey Lammie
Julian Büchel
A. Vasilopoulos
Manuel Le Gallo
Abu Sebastian
AAML
49
0
0
11 Nov 2024
Model Fusion through Bayesian Optimization in Language Model Fine-Tuning
Chaeyun Jang
Hyungi Lee
Jungtaek Kim
Juho Lee
MoMe
53
0
0
11 Nov 2024
The Empirical Impact of Data Sanitization on Language Models
Anwesan Pal
Radhika Bhargava
Kyle Hinsz
Jacques Esterhuizen
Sudipta Bhattacharya
29
0
0
08 Nov 2024
Confidence Calibration of Classifiers with Many Classes
Adrien LeCoz
Stéphane Herbin
Faouzi Adjed
UQCV
37
1
0
05 Nov 2024
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
Xingtai Lv
Ning Ding
Kaiyan Zhang
Ermo Hua
Ganqu Cui
Bowen Zhou
42
1
0
04 Nov 2024
Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior
Mingxuan Zhang
Y. Sun
F. Liang
34
0
0
01 Nov 2024
A Simple Remedy for Dataset Bias via Self-Influence: A Mislabeled Sample Perspective
Yeonsung Jung
Jaeyun Song
J. Yang
Jin-Hwa Kim
Sung-Yub Kim
Eunho Yang
47
0
0
01 Nov 2024
P-Masking: Power Law Masking Improves Multi-attribute Controlled Generation
Mohamed Elgaar
Hadi Amiri
AI4CE
36
0
0
31 Oct 2024
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Jiayi Wang
Yao Lu
Maurice Weber
Max Ryabinin
Yihong Chen
Raphael Tang
Pontus Stenetorp
LRM
47
1
0
31 Oct 2024
Larger models yield better results? Streamlined severity classification of ADHD-related concerns using BERT-based knowledge distillation
Ahmed Akib Jawad Karim
Kazi Hafiz Md. Asad
Md. Golam Rabiul Alam
AI4MH
47
2
0
30 Oct 2024
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and Resolution
Shikha Bordia
AILaw
47
0
0
30 Oct 2024
Focus On This, Not That! Steering LLMs With Adaptive Feature Specification
Tom A. Lamb
Adam Davies
Alasdair Paren
Philip Torr
Francesco Pinto
52
0
0
30 Oct 2024
Improving In-Context Learning with Small Language Model Ensembles
M. Mehdi Mojarradi
Lingyi Yang
Robert McCraith
Adam Mahdi
36
1
0
29 Oct 2024
LoRA vs Full Fine-tuning: An Illusion of Equivalence
Reece Shuttleworth
Jacob Andreas
Antonio Torralba
Pratyusha Sharma
35
10
0
28 Oct 2024
uOttawa at LegalLens-2024: Transformer-based Classification Experiments
Nima Meghdadi
Diana Inkpen
AILaw
21
0
0
28 Oct 2024
LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment
Ge Yang
Changyi He
J. Guo
Jianyu Wu
Yifu Ding
Aishan Liu
Haotong Qin
Pengliang Ji
Xianglong Liu
MQ
33
4
0
28 Oct 2024
Relation-based Counterfactual Data Augmentation and Contrastive Learning for Robustifying Natural Language Inference Models
H. Yang
Sseung-won Hwang
Jungmin So
40
0
0
28 Oct 2024
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
40
2
0
28 Oct 2024
A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
Haoyu Song
Wenbo Zhang
Kaiyan Zhang
Ting Liu
32
3
0
26 Oct 2024
Model merging with SVD to tie the Knots
George Stoica
Pratik Ramesh
B. Ecsedi
Leshem Choshen
Judy Hoffman
MoMe
39
9
0
25 Oct 2024
Can Stories Help LLMs Reason? Curating Information Space Through Narrative
Vahid Sadiri Javadi
Johanne R. Trippas
Yash Kumar Lal
Lucie Flek
43
1
0
25 Oct 2024
Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance
Omer Nahum
Nitay Calderon
Orgad Keller
Idan Szpektor
Roi Reichart
27
2
0
24 Oct 2024
Task Calibration: Calibrating Large Language Models on Inference Tasks
Yingjie Li
Yun Luo
Xiaotian Xie
Yue Zhang
LRM
21
0
0
24 Oct 2024
Atomic Fact Decomposition Helps Attributed Question Answering
Zhichao Yan
J. Wang
Jiaoyan Chen
Xiaoli Li
Ru Li
Jeff Z. Pan
KELM
HILM
36
0
0
22 Oct 2024
A Statistical Analysis of LLMs' Self-Evaluation Using Proverbs
Ryosuke Sonoda
Ramya Srinivasan
59
1
0
22 Oct 2024
Tethering Broken Themes: Aligning Neural Topic Models with Labels and Authors
Mayank Nagda
Phil Ostheimer
Sophie Fellenz
51
0
0
22 Oct 2024
From Tokens to Materials: Leveraging Language Models for Scientific Discovery
Yuwei Wan
Tong Xie
Nan Wu
Wenjie Zhang
Chunyu Kit
B. Hoex
27
0
0
21 Oct 2024
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning
Arijit Das
26
1
0
21 Oct 2024
Towards Optimal Adapter Placement for Efficient Transfer Learning
Aleksandra I. Nowak
Otniel-Bogdan Mercea
Anurag Arnab
Jonas Pfeiffer
Yann N. Dauphin
Utku Evci
28
0
0
21 Oct 2024
Previous
1
2
3
4
5
6
...
53
54
55
Next