ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.11931
  4. Cited By
Evaluating Commonsense in Pre-trained Language Models

Evaluating Commonsense in Pre-trained Language Models

27 November 2019
Xuhui Zhou
Yue Zhang
Leyang Cui
Dandan Huang
    AI4MH
    LRM
ArXivPDFHTML

Papers citing "Evaluating Commonsense in Pre-trained Language Models"

33 / 33 papers shown
Title
Tgea: An error-annotated dataset and benchmark tasks for text generation from pretrained language models
Jie He
Bo Peng
Yi-Lun Liao
Qun Liu
Deyi Xiong
68
8
0
06 Mar 2025
The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation
Jie He
Tao Wang
Deyi Xiong
Qun Liu
ELM
LRM
82
27
0
05 Mar 2025
Shades of Zero: Distinguishing Impossibility from Inconceivability
Shades of Zero: Distinguishing Impossibility from Inconceivability
Jennifer Hu
Felix Sosa
T. Ullman
46
0
0
27 Feb 2025
Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution
Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution
Tatiana Anikina
Arne Binder
David Harbecke
Stalin Varanasi
Leonhard Hennig
Simon Ostermann
Sebastian Möller
Josef van Genabith
96
0
0
31 Jan 2025
Every Answer Matters: Evaluating Commonsense with Probabilistic Measures
Every Answer Matters: Evaluating Commonsense with Probabilistic Measures
Qi Cheng
Michael Boratko
Pranay Kumar Yelugam
T. O’Gorman
Nalini Singh
Andrew McCallum
X. Li
ELM
LRM
40
3
0
06 Jun 2024
CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark Tasks
  for Chinese Large Language Models
CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark Tasks for Chinese Large Language Models
Dan Shi
Chaobin You
Jian-Tao Huang
Taihao Li
Deyi Xiong
LRM
30
0
0
20 Dec 2023
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large
  Language Models with SocKET Benchmark
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
Minje Choi
Jiaxin Pei
Sagar Kumar
Chang Shu
David Jurgens
ALM
LLMAG
35
69
0
24 May 2023
Harvesting Event Schemas from Large Language Models
Harvesting Event Schemas from Large Language Models
Jialong Tang
Hongyu Lin
Zhuoqun Li
Yaojie Lu
Xianpei Han
Le Sun
26
4
0
12 May 2023
ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in
  Large Language Models
ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in Large Language Models
Thilini Wijesiriwardene
Ruwan Wickramarachchi
Bimal Gajera
Shreeyash Mukul Gowaikar
Chandan Gupta
Aman Chadha
Aishwarya N. Reganti
Amit P. Sheth
Amitava Das
ELM
25
14
0
08 May 2023
Just Tell Me: Prompt Engineering in Business Process Management
Just Tell Me: Prompt Engineering in Business Process Management
Kiran Busch
Alexander Rochlitzer
Diana Sola
Henrik Leopold
31
29
0
14 Apr 2023
Understanding Finetuning for Factual Knowledge Extraction from Language
  Models
Understanding Finetuning for Factual Knowledge Extraction from Language Models
Mehran Kazemi
Sid Mittal
Deepak Ramachandran
KELM
34
10
0
26 Jan 2023
Contrastive Learning Reduces Hallucination in Conversations
Contrastive Learning Reduces Hallucination in Conversations
Weiwei Sun
Zhengliang Shi
Shen Gao
Pengjie Ren
Maarten de Rijke
Z. Ren
42
62
0
20 Dec 2022
GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained
  Language Models
GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models
Da Yin
Hritik Bansal
Masoud Monajatipoor
Liunian Harold Li
Kai-Wei Chang
49
28
0
24 May 2022
Probing Script Knowledge from Pre-Trained Models
Probing Script Knowledge from Pre-Trained Models
Zijian Jin
Xingyu Zhang
Mo Yu
Lifu Huang
24
4
0
16 Apr 2022
Commonsense Knowledge Reasoning and Generation with Pre-trained Language
  Models: A Survey
Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Prajjwal Bhargava
Vincent Ng
ReLM
LRM
49
62
0
28 Jan 2022
Zero-shot Commonsense Question Answering with Cloze Translation and
  Consistency Optimization
Zero-shot Commonsense Question Answering with Cloze Translation and Consistency Optimization
Zi-Yi Dou
Nanyun Peng
ELM
17
26
0
01 Jan 2022
Few-shot Instruction Prompts for Pretrained Language Models to Detect
  Social Biases
Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases
Shrimai Prabhumoye
Rafal Kocielnik
M. Shoeybi
Anima Anandkumar
Bryan Catanzaro
35
20
0
15 Dec 2021
RuleBert: Teaching Soft Rules to Pre-trained Language Models
RuleBert: Teaching Soft Rules to Pre-trained Language Models
Mohammed Saeed
N. Ahmadi
Preslav Nakov
Paolo Papotti
LRM
253
31
0
24 Sep 2021
AES Systems Are Both Overstable And Oversensitive: Explaining Why And
  Proposing Defenses
AES Systems Are Both Overstable And Oversensitive: Explaining Why And Proposing Defenses
Yaman Kumar Singla
Swapnil Parekh
Somesh Singh
Junjie Li
R. Shah
Changyou Chen
AAML
41
14
0
24 Sep 2021
Implicit Premise Generation with Discourse-aware Commonsense Knowledge
  Models
Implicit Premise Generation with Discourse-aware Commonsense Knowledge Models
Tuhin Chakrabarty
Aadit Trivedi
Smaranda Muresan
LRM
39
13
0
11 Sep 2021
Pre-Trained Models: Past, Present and Future
Pre-Trained Models: Past, Present and Future
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
58
816
0
14 Jun 2021
Can Transformer Language Models Predict Psychometric Properties?
Can Transformer Language Models Predict Psychometric Properties?
Antonio Laverghetta
Animesh Nighojkar
Jamshidbek Mirzakhalov
John Licato
LM&MA
38
14
0
12 Jun 2021
Can Generative Pre-trained Language Models Serve as Knowledge Bases for
  Closed-book QA?
Can Generative Pre-trained Language Models Serve as Knowledge Bases for Closed-book QA?
Cunxiang Wang
Pai Liu
Yue Zhang
RALM
42
80
0
03 Jun 2021
Back to Square One: Artifact Detection, Training and Commonsense
  Disentanglement in the Winograd Schema
Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema
Yanai Elazar
Hongming Zhang
Yoav Goldberg
Dan Roth
ReLM
LRM
45
44
0
16 Apr 2021
Probing Across Time: What Does RoBERTa Know and When?
Probing Across Time: What Does RoBERTa Know and When?
Leo Z. Liu
Yizhong Wang
Jungo Kasai
Hannaneh Hajishirzi
Noah A. Smith
KELM
13
80
0
16 Apr 2021
Relational World Knowledge Representation in Contextual Language Models:
  A Review
Relational World Knowledge Representation in Contextual Language Models: A Review
Tara Safavi
Danai Koutra
KELM
38
51
0
12 Apr 2021
ZJUKLAB at SemEval-2021 Task 4: Negative Augmentation with Language
  Model for Reading Comprehension of Abstract Meaning
ZJUKLAB at SemEval-2021 Task 4: Negative Augmentation with Language Model for Reading Comprehension of Abstract Meaning
Xin Xie
Xiangnan Chen
Xiang Chen
Yong Wang
Ningyu Zhang
Shumin Deng
Huajun Chen
42
2
0
25 Feb 2021
LIREx: Augmenting Language Inference with Relevant Explanation
LIREx: Augmenting Language Inference with Relevant Explanation
Xinyan Zhao
V. Vydiswaran
LRM
28
37
0
16 Dec 2020
SemEval-2020 Task 4: Commonsense Validation and Explanation
SemEval-2020 Task 4: Commonsense Validation and Explanation
Cunxiang Wang
Shuailong Liang
Yili Jin
Yilong Wang
Xiao-Dan Zhu
Yue Zhang
LRM
25
98
0
01 Jul 2020
Behind the Scene: Revealing the Secrets of Pre-trained
  Vision-and-Language Models
Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models
Jize Cao
Zhe Gan
Yu Cheng
Licheng Yu
Yen-Chun Chen
Jingjing Liu
VLM
22
127
0
15 May 2020
RICA: Evaluating Robust Inference Capabilities Based on Commonsense
  Axioms
RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms
Pei Zhou
Rahul Khanna
Seyeon Lee
Bill Yuchen Lin
Daniel E. Ho
Jay Pujara
Xiang Ren
ReLM
21
36
0
02 May 2020
Align, Mask and Select: A Simple Method for Incorporating Commonsense
  Knowledge into Language Representation Models
Align, Mask and Select: A Simple Method for Incorporating Commonsense Knowledge into Language Representation Models
Zhiquan Ye
Qian Chen
Wen Wang
Zhenhua Ling
27
68
0
19 Aug 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
1