ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.05426
  4. Cited By
A Broad-Coverage Challenge Corpus for Sentence Understanding through
  Inference
v1v2v3v4 (latest)

A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference

18 April 2017
Adina Williams
Nikita Nangia
Samuel R. Bowman
ArXiv (abs)PDFHTML

Papers citing "A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference"

50 / 2,772 papers shown
Title
Training Task Experts through Retrieval Based Distillation
Training Task Experts through Retrieval Based Distillation
Jiaxin Ge
Xueying Jia
Vijay Viswanathan
Hongyin Luo
Graham Neubig
86
3
0
07 Jul 2024
Hallucination Detection: Robustly Discerning Reliable Answers in Large
  Language Models
Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models
Yuyan Chen
Qiang Fu
Yichen Yuan
Zhihao Wen
Ge Fan
Dayiheng Liu
Dongmei Zhang
Zhixu Li
Yanghua Xiao
HILM
74
77
0
04 Jul 2024
MAPO: Boosting Large Language Model Performance with Model-Adaptive
  Prompt Optimization
MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization
Yuyan Chen
Zhihao Wen
Ge Fan
Zhengyu Chen
Wei Wu
Dayiheng Liu
Zhixu Li
Bang Liu
Yanghua Xiao
100
20
0
04 Jul 2024
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language
  Models
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models
Ying Zhang
Ziheng Yang
Shufan Ji
KELM
51
1
0
03 Jul 2024
Croppable Knowledge Graph Embedding
Croppable Knowledge Graph Embedding
Yushan Zhu
Wen Zhang
Zhiqiang Liu
Yin Hua
Lei Liang
H. Chen
81
0
0
03 Jul 2024
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Wataru Hashimoto
Hidetaka Kamigaito
Taro Watanabe
143
0
0
02 Jul 2024
EconNLI: Evaluating Large Language Models on Economics Reasoning
EconNLI: Evaluating Large Language Models on Economics Reasoning
Yue Guo
Yi Yang
49
5
0
01 Jul 2024
LLM Uncertainty Quantification through Directional Entailment Graph and
  Claim Level Response Augmentation
LLM Uncertainty Quantification through Directional Entailment Graph and Claim Level Response Augmentation
Longchao Da
Tiejin Chen
Lu Cheng
Hua Wei
97
13
0
01 Jul 2024
Exploring Advanced Large Language Models with LLMsuite
Exploring Advanced Large Language Models with LLMsuite
Giorgio Roffo
LLMAG
36
0
0
01 Jul 2024
How Does Overparameterization Affect Features?
How Does Overparameterization Affect Features?
Ahmet Cagri Duzgun
Samy Jelassi
Yuanzhi Li
47
0
0
01 Jul 2024
Self-Translate-Train: A Simple but Strong Baseline for Cross-lingual
  Transfer of Large Language Models
Self-Translate-Train: A Simple but Strong Baseline for Cross-lingual Transfer of Large Language Models
Ryokan Ri
Shun Kiyono
Sho Takase
SyDa
47
0
0
29 Jun 2024
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP
Omer Goldman
Alon Jacovi
Aviv Slobodkin
Aviya Maimon
Ido Dagan
Reut Tsarfaty
124
11
0
29 Jun 2024
DataGen: Unified Synthetic Dataset Generation via Large Language Models
DataGen: Unified Synthetic Dataset Generation via Large Language Models
Yue Huang
Siyuan Wu
Chujie Gao
Dongping Chen
Qihui Zhang
...
Tianyi Zhou
Xiangliang Zhang
Jianfeng Gao
Chaowei Xiao
Lichao Sun
SyDa
125
20
0
27 Jun 2024
Weak Reward Model Transforms Generative Models into Robust Causal Event
  Extraction Systems
Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems
Italo Luis da Silva
Hanqi Yan
Lin Gui
Yulan He
CML
109
0
0
26 Jun 2024
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for
  Memory-Efficient Large Language Models Fine-Tuning
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning
Yifan Yang
Kai Zhen
Ershad Banijamal
Athanasios Mouchtaris
Zheng Zhang
73
9
0
26 Jun 2024
Decoding with Limited Teacher Supervision Requires Understanding When to
  Trust the Teacher
Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher
Hyunjong Ok
Jegwang Ryu
Jaeho Lee
47
0
0
26 Jun 2024
ViANLI: Adversarial Natural Language Inference for Vietnamese
ViANLI: Adversarial Natural Language Inference for Vietnamese
Tin Van Huynh
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
64
0
0
25 Jun 2024
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse
  Gradients
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients
Aashiq Muhamed
Oscar Li
David Woodruff
Mona Diab
Virginia Smith
101
13
0
25 Jun 2024
"Seeing the Big through the Small": Can LLMs Approximate Human Judgment
  Distributions on NLI from a Few Explanations?
"Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Beiduo Chen
Xinpeng Wang
Siyao Peng
Robert Litschko
Anna Korhonen
Barbara Plank
115
9
0
25 Jun 2024
Data Debiasing with Datamodels (D3M): Improving Subgroup Robustness via
  Data Selection
Data Debiasing with Datamodels (D3M): Improving Subgroup Robustness via Data Selection
Saachi Jain
Kimia Hamidieh
Kristian Georgiev
Andrew Ilyas
Marzyeh Ghassemi
Aleksander Madry
84
3
0
24 Jun 2024
Exploring Factual Entailment with NLI: A News Media Study
Exploring Factual Entailment with NLI: A News Media Study
Guy Mor-Lan
Effi Levi
102
0
0
24 Jun 2024
Towards Fine-Grained Citation Evaluation in Generated Text: A
  Comparative Analysis of Faithfulness Metrics
Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Weijia Zhang
Mohammad Aliannejadi
Yifei Yuan
Jiahuan Pei
Jia-Hong Huang
Evangelos Kanoulas
HILM
88
13
0
21 Jun 2024
Optimised Grouped-Query Attention Mechanism for Transformers
Optimised Grouped-Query Attention Mechanism for Transformers
Yuang Chen
Cheng Zhang
Xitong Gao
Robert D. Mullins
George A. Constantinides
Yiren Zhao
75
9
0
21 Jun 2024
Depth $F_1$: Improving Evaluation of Cross-Domain Text Classification by
  Measuring Semantic Generalizability
Depth F1F_1F1​: Improving Evaluation of Cross-Domain Text Classification by Measuring Semantic Generalizability
Parker Seegmiller
Joseph Gatto
S. Preum
VLM
93
0
0
20 Jun 2024
Revealing Vision-Language Integration in the Brain with Multimodal
  Networks
Revealing Vision-Language Integration in the Brain with Multimodal Networks
Vighnesh Subramaniam
C. Conwell
Christopher Wang
Gabriel Kreiman
Boris Katz
Ignacio Cases
Andrei Barbu
102
12
0
20 Jun 2024
Information Guided Regularization for Fine-tuning Language Models
Information Guided Regularization for Fine-tuning Language Models
Mandar Sharma
Nikhil Muralidhar
Shengzhe Xu
Raquib Bin Yousuf
Naren Ramakrishnan
106
0
0
20 Jun 2024
When Parts are Greater Than Sums: Individual LLM Components Can
  Outperform Full Models
When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models
Ting-Yun Chang
Jesse Thomason
Robin Jia
112
5
0
19 Jun 2024
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+
  Languages
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages
Fabian David Schmidt
Philipp Borchert
Ivan Vulić
Goran Glavaš
78
6
0
18 Jun 2024
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning
Tianyuan Zou
Yang Liu
Ziwei Sun
Jianqing Zhang
Jingjing Liu
Ya-Qin Zhang
103
3
0
18 Jun 2024
GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory
GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory
Haoze Wu
Zihan Qiu
Zili Wang
Hang Zhao
Jie Fu
MoE
100
3
0
18 Jun 2024
Knowledge Fusion By Evolving Weights of Language Models
Knowledge Fusion By Evolving Weights of Language Models
Guodong DU
Yiyao Cao
Hanting Liu
Runhua Jiang
Shuyang Yu
Yifei Guo
Sim Kuan Goh
Jing Li
MoMe
91
15
0
18 Jun 2024
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to
  Address Shortcut Shifts in Natural Language Understanding
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding
Ukyo Honda
Tatsushi Oka
Peinan Zhang
Masato Mita
90
1
0
17 Jun 2024
A Systematic Analysis of Large Language Models as Soft Reasoners: The
  Case of Syllogistic Inferences
A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences
Leonardo Bertolazzi
Albert Gatt
Raffaella Bernardi
LRMELM
39
6
0
17 Jun 2024
An Empirical Investigation of Matrix Factorization Methods for
  Pre-trained Transformers
An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers
Ashim Gupta
Sina Mahdipour Saravani
P. Sadayappan
Vivek Srikumar
57
2
0
17 Jun 2024
Self-training Large Language Models through Knowledge Detection
Self-training Large Language Models through Knowledge Detection
Wei Jie Yeo
Teddy Ferdinan
Przemyslaw Kazienko
Ranjan Satapathy
Erik Cambria
104
10
0
17 Jun 2024
FamiCom: Further Demystifying Prompts for Language Models with
  Task-Agnostic Performance Estimation
FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Bangzheng Li
Ben Zhou
Xingyu Fu
Fei Wang
Dan Roth
Muhao Chen
83
6
0
17 Jun 2024
Evaluating the Generalization Ability of Quantized LLMs: Benchmark,
  Analysis, and Toolbox
Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolbox
Yijun Liu
Yuan Meng
Fang Wu
Shenhao Peng
Hang Yao
Chaoyu Guan
Chen Tang
Xinzhu Ma
Zhi Wang
Wenwu Zhu
MQ
110
8
0
15 Jun 2024
Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic
  Textual Similarity
Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual Similarity
Bowen Zhang
Chunping Li
57
0
0
14 Jun 2024
Self-Knowledge Distillation for Learning Ambiguity
Self-Knowledge Distillation for Learning Ambiguity
Hancheol Park
Soyeong Jeong
Sukmin Cho
Jong C. Park
71
1
0
14 Jun 2024
Unraveling the Mechanics of Learning-Based Demonstration Selection for
  In-Context Learning
Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning
Hui Liu
Wenya Wang
Hao Sun
Chris Xing Tian
Chenqi Kong
Xin Dong
Haoliang Li
78
6
0
14 Jun 2024
ReadCtrl: Personalizing text generation with readability-controlled
  instruction learning
ReadCtrl: Personalizing text generation with readability-controlled instruction learning
Hieu Tran
Zonghai Yao
Lingxi Li
Hong-ye Yu
81
2
0
13 Jun 2024
When Linear Attention Meets Autoregressive Decoding: Towards More
  Effective and Efficient Linearized Large Language Models
When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
Haoran You
Yichao Fu
Zheng Wang
Amir Yazdanbakhsh
Yingyan Celine Lin
133
4
0
11 Jun 2024
Transferring Knowledge from Large Foundation Models to Small Downstream
  Models
Transferring Knowledge from Large Foundation Models to Small Downstream Models
Shikai Qiu
Boran Han
Danielle C. Maddix
Shuai Zhang
Yuyang Wang
Andrew Gordon Wilson
57
4
0
11 Jun 2024
Decipherment-Aware Multilingual Learning in Jointly Trained Language
  Models
Decipherment-Aware Multilingual Learning in Jointly Trained Language Models
Grandee Lee
76
0
0
11 Jun 2024
Symmetric Dot-Product Attention for Efficient Training of BERT Language
  Models
Symmetric Dot-Product Attention for Efficient Training of BERT Language Models
Martin Courtois
Malte Ostendorff
Leonhard Hennig
Georg Rehm
91
2
0
10 Jun 2024
Multi-Prompting Decoder Helps Better Language Understanding
Multi-Prompting Decoder Helps Better Language Understanding
Zifeng Cheng
Zhaoling Chen
Zhiwei Jiang
Yafeng Yin
Shiping Ge
Shiping Ge
Qing Gu
AI4CE
100
1
0
10 Jun 2024
GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge?
GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge?
Dayoon Ko
Jinyoung Kim
Hahyeon Choi
Gunhee Kim
CLLRALMKELM
74
6
0
09 Jun 2024
Investigating and Addressing Hallucinations of LLMs in Tasks Involving
  Negation
Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation
Neeraj Varshney
Satyam Raj
Venkatesh Mishra
Agneet Chatterjee
Ritika Sarkar
Amir Saeidi
Chitta Baral
LRM
96
11
0
08 Jun 2024
Advancing Semantic Textual Similarity Modeling: A Regression Framework
  with Translated ReLU and Smooth K2 Loss
Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss
Bowen Zhang
Chunping Li
58
2
0
08 Jun 2024
SuperPos-Prompt: Enhancing Soft Prompt Tuning of Language Models with
  Superposition of Multi Token Embeddings
SuperPos-Prompt: Enhancing Soft Prompt Tuning of Language Models with Superposition of Multi Token Embeddings
MohammadAli SadraeiJavaeri
Ehsaneddin Asgari
A. Mchardy
Hamid R. Rabiee
VLMAAML
73
0
0
07 Jun 2024
Previous
123...678...545556
Next