Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.02847
Cited By
A Simple Method for Commonsense Reasoning
7 June 2018
Trieu H. Trinh
Quoc V. Le
LRM
ReLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Simple Method for Commonsense Reasoning"
50 / 291 papers shown
Title
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
I. Gevers
Victor De Marez
Luna De Bruyne
Walter Daelemans
37
0
0
31 Mar 2025
Generative Linguistics, Large Language Models, and the Social Nature of Scientific Success
Sophie Hao
ELM
AI4CE
56
0
0
25 Mar 2025
The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation
Jie He
Tao Wang
Deyi Xiong
Qun Liu
ELM
LRM
82
27
0
05 Mar 2025
Towards the Development of Balanced Synthetic Data for Correcting Grammatical Errors in Arabic: An Approach Based on Error Tagging Model and Synthetic Data Generating Model
Ahlam Alrehili
Areej Alhothali
81
0
0
07 Feb 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
105
19
0
17 Jan 2025
ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese
Yikang Liu
Yeting Shen
Hongao Zhu
Lilong Xu
Zhiheng Qian
...
Jialong Tang
Pei Zhang
Baosong Yang
Rui-cang Wang
Hai Hu
45
2
0
09 Nov 2024
PRIMO: Progressive Induction for Multi-hop Open Rule Generation
Jianyu Liu
Sheng Bi
Guilin Qi
29
0
0
02 Nov 2024
LoRA vs Full Fine-tuning: An Illusion of Equivalence
Reece Shuttleworth
Jacob Andreas
Antonio Torralba
Pratyusha Sharma
37
10
0
28 Oct 2024
Fine-tuning foundational models to code diagnoses from veterinary health records
Mayla R. Boguslav
Adam Kiehl
David Kott
G. Joseph Strecker
Tracy Webb
Nadia Saklou
Terri Ward
Michael Kirby
LM&MA
30
0
0
19 Oct 2024
Solving the Challenge Set without Solving the Task: On Winograd Schemas as a Test of Pronominal Coreference Resolution
Ian Porada
Jackie C.K. Cheung
44
0
0
12 Oct 2024
Zero-shot Commonsense Reasoning over Machine Imagination
Hyuntae Park
Yeachan Kim
Jun-Hyung Park
S. Lee
ReLM
VLM
LRM
29
1
0
12 Oct 2024
BeanCounter: A low-toxicity, large-scale, and open dataset of business-oriented text
Siyan Wang
Bradford Levy
34
2
0
26 Sep 2024
Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
Yuan Xin
Zehan Li
Ning Yu
Dingfan Chen
Mario Fritz
Michael Backes
Yang Zhang
PILM
MIACV
42
2
0
20 Aug 2024
Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons
Yifei Wang
Yuheng Chen
Wanting Wen
Yu Sheng
Linjing Li
D. Zeng
KELM
47
5
0
06 Aug 2024
Look Hear: Gaze Prediction for Speech-directed Human Attention
Sounak Mondal
Seoyoung Ahn
Zhibo Yang
Niranjan Balasubramanian
Dimitris Samaras
G. Zelinsky
Minh Hoai
47
1
0
28 Jul 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Jupinder Parmar
Shrimai Prabhumoye
Joseph Jennings
Bo Liu
Aastha Jhunjhunwala
Zhilin Wang
M. Patwary
M. Shoeybi
Bryan Catanzaro
53
6
0
08 Jul 2024
YuLan: An Open-source Large Language Model
Yutao Zhu
Kun Zhou
Kelong Mao
Wentong Chen
Yiding Sun
...
Wenbing Huang
Ze-Feng Gao
Yueguo Chen
Weizheng Lu
Ji-Rong Wen
ALM
ELM
44
0
0
28 Jun 2024
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Guilherme Penedo
Hynek Kydlícek
Loubna Ben Allal
Anton Lozhkov
Margaret Mitchell
Colin Raffel
Leandro von Werra
Thomas Wolf
56
194
0
25 Jun 2024
Multi-Prompting Decoder Helps Better Language Understanding
Zifeng Cheng
Zhaoling Chen
Zhiwei Jiang
Yafeng Yin
Shiping Ge
Yuliang Liu
Qing Gu
AI4CE
50
1
0
10 Jun 2024
BERTs are Generative In-Context Learners
David Samuel
48
5
0
07 Jun 2024
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models
Zachary Ankner
Cody Blakeney
Kartik K. Sreenivasan
Max Marion
Matthew L. Leavitt
Mansheej Paul
43
24
0
30 May 2024
Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning
Phakphum Artkaew
LRM
27
0
0
28 May 2024
A Survey on Transformers in NLP with Focus on Efficiency
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
MedIm
40
2
0
15 May 2024
Is Less More? Quality, Quantity and Context in Idiom Processing with Natural Language Models
Agne Knietaite
Adam Allsebrook
Anton Minkov
Adam Tomaszewski
Norbert Slinko
Richard Johnson
Thomas Pickard
Dylan Phelps
Aline Villavicencio
54
1
0
14 May 2024
Multi-Head Mixture-of-Experts
Xun Wu
Shaohan Huang
Wenhui Wang
Furu Wei
MoE
39
12
0
23 Apr 2024
SpaceByte: Towards Deleting Tokenization from Large Language Modeling
Kevin Slagle
37
3
0
22 Apr 2024
Large language models as oracles for instantiating ontologies with domain-specific knowledge
Giovanni Ciatto
Andrea Agiollo
Matteo Magnini
Andrea Omicini
37
7
0
05 Apr 2024
Predicting the Performance of Foundation Models via Agreement-on-the-Line
Aman Mehra
Rahul Saxena
Taeyoun Kim
Christina Baek
Zico Kolter
Aditi Raghunathan
UQCV
49
1
0
02 Apr 2024
On Zero-Shot Counterspeech Generation by LLMs
Punyajoy Saha
Aalok Agrawal
Abhik Jana
Chris Biemann
Animesh Mukherjee
43
12
0
22 Mar 2024
Abdelhak at SemEval-2024 Task 9 : Decoding Brainteasers, The Efficacy of Dedicated Models Versus ChatGPT
Abdelhak Kelious
Mounir Okirim
LRM
26
1
0
24 Feb 2024
Amplifying Training Data Exposure through Fine-Tuning with Pseudo-Labeled Memberships
Myung Gyo Oh
Hong Eun Ahn
L. Park
T.-H. Kwon
MIALM
AAML
37
0
0
19 Feb 2024
Common Sense Reasoning for Deepfake Detection
Yue Zhang
Ben Colman
Xiao Guo
Ali Shahriyari
Gaurav Bharaj
37
30
0
31 Jan 2024
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
Pardis Sadat Zahraei
Ali Emami
27
6
0
31 Jan 2024
Unlearning Traces the Influential Training Data of Language Models
Masaru Isonuma
Ivan Titov
MU
29
7
0
26 Jan 2024
MambaByte: Token-free Selective State Space Model
Junxiong Wang
Tushaar Gangavarapu
Jing Nathan Yan
Alexander M. Rush
Mamba
44
37
0
24 Jan 2024
MatSciRE: Leveraging Pointer Networks to Automate Entity and Relation Extraction for Material Science Knowledge-base Construction
Ankan Mullick
Akash Ghosh
G. Chaitanya
Samir Ghui
Tapas Nayak
Seung-Cheol Lee
S. Bhattacharjee
Pawan Goyal
13
10
0
18 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
Zujie Wen
Ke Xu
Qi Li
63
57
0
11 Jan 2024
Understanding LLMs: A Comprehensive Overview from Training to Inference
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
...
Xintao Hu
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
SyDa
35
65
0
04 Jan 2024
MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining
Jacob P. Portes
Alex Trott
Sam Havens
Daniel King
Abhinav Venigalla
Moin Nadeem
Nikhil Sardana
D. Khudia
Jonathan Frankle
26
16
0
29 Dec 2023
NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation
Peter West
Ronan Le Bras
Taylor Sorensen
Bill Yuchen Lin
Liwei Jiang
...
Khyathi Raghavi Chandu
Jack Hessel
Ashutosh Baheti
Chandra Bhagavatula
Yejin Choi
VLM
26
10
0
10 Dec 2023
SoUnD Framework: Analyzing (So)cial Representation in (Un)structured (D)ata
Mark Díaz
Sunipa Dev
Emily Reif
Remi Denton
Vinodkumar Prabhakaran
33
3
0
28 Nov 2023
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms
Aditi Jha
Sam Havens
Jeremey Dohmann
Alex Trott
Jacob P. Portes
ALM
19
11
0
22 Nov 2023
CASE: Commonsense-Augmented Score with an Expanded Answer Space
Wenkai Chen
Sahithya Ravi
Vered Shwartz
30
0
0
03 Nov 2023
FP8-LM: Training FP8 Large Language Models
Houwen Peng
Kan Wu
Yixuan Wei
Guoshuai Zhao
Yuxiang Yang
...
Zheng-Wei Zhang
Shuguang Liu
Joe Chau
Han Hu
Peng Cheng
MQ
59
40
0
27 Oct 2023
Copyright Violations and Large Language Models
Antonia Karamolegkou
Jiaang Li
Li Zhou
Anders Sogaard
25
55
0
20 Oct 2023
Do Language Models Learn about Legal Entity Types during Pretraining?
Claire Barale
Michael Rovatsos
Nehal Bhuta
ELM
33
2
0
19 Oct 2023
Integrating Symbolic Reasoning into Neural Generative Models for Design Generation
Maxwell J. Jacobson
Yexiang Xue
NAI
38
1
0
13 Oct 2023
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Mengzhou Xia
Tianyu Gao
Zhiyuan Zeng
Danqi Chen
40
268
0
10 Oct 2023
Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Chao-Han Huck Yang
Yile Gu
Yi-Chieh Liu
Shalini Ghosh
I. Bulyko
A. Stolcke
KELM
LRM
38
40
0
27 Sep 2023
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Thuat Nguyen
Chien Van Nguyen
Viet Dac Lai
Hieu Man
Nghia Trung Ngo
Franck Dernoncourt
Ryan A. Rossi
Thien Huu Nguyen
45
97
0
17 Sep 2023
1
2
3
4
5
6
Next