ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.02847
  4. Cited By
A Simple Method for Commonsense Reasoning
v1v2 (latest)

A Simple Method for Commonsense Reasoning

7 June 2018
Trieu H. Trinh
Quoc V. Le
    LRMReLM
ArXiv (abs)PDFHTML

Papers citing "A Simple Method for Commonsense Reasoning"

50 / 291 papers shown
Title
CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class
  Classification
CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification
Yongbin Li
Canran Xu
Guodong Long
Tao Shen
Chongyang Tao
Jing Jiang
73
1
0
11 Nov 2022
HERB: Measuring Hierarchical Regional Bias in Pre-trained Language
  Models
HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models
Yizhi Li
Ge Zhang
Bohao Yang
Chenghua Lin
Shi Wang
Anton Ragni
Jie Fu
58
10
0
05 Nov 2022
Semantic Novelty Detection and Characterization in Factual Text
  Involving Named Entities
Semantic Novelty Detection and Characterization in Factual Text Involving Named Entities
Nianzu Ma
Sahisnu Mazumder
A. Politowicz
Bing-Quan Liu
Eric Robertson
Scott Grigsby
77
1
0
31 Oct 2022
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal
  Proofs
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs
Albert Q. Jiang
Sean Welleck
Jin Peng Zhou
Wenda Li
Jiacheng Liu
M. Jamnik
Timothée Lacroix
Yuhuai Wu
Guillaume Lample
AIMat
155
181
0
21 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
XDoc: Unified Pre-training for Cross-Format Document Understanding
Jingye Chen
Tengchao Lv
Lei Cui
Changrong Zhang
Furu Wei
93
14
0
06 Oct 2022
COMPS: Conceptual Minimal Pair Sentences for testing Robust Property
  Knowledge and its Inheritance in Pre-trained Language Models
COMPS: Conceptual Minimal Pair Sentences for testing Robust Property Knowledge and its Inheritance in Pre-trained Language Models
Kanishka Misra
Julia Taylor Rayz
Allyson Ettinger
132
10
0
05 Oct 2022
Less is More: Task-aware Layer-wise Distillation for Language Model
  Compression
Less is More: Task-aware Layer-wise Distillation for Language Model Compression
Chen Liang
Simiao Zuo
Qingru Zhang
Pengcheng He
Weizhu Chen
Tuo Zhao
VLM
108
74
0
04 Oct 2022
WinoDict: Probing language models for in-context word acquisition
WinoDict: Probing language models for in-context word acquisition
Julian Martin Eisenschlos
Jeremy R. Cole
Fangyu Liu
William W. Cohen
KELM
58
13
0
25 Sep 2022
An Information Minimization Based Contrastive Learning Model for
  Unsupervised Sentence Embeddings Learning
An Information Minimization Based Contrastive Learning Model for Unsupervised Sentence Embeddings Learning
Shaobin Chen
Jie Zhou
Yuling Sun
Liang He
SSL
78
7
0
22 Sep 2022
Adapting Pretrained Text-to-Text Models for Long Text Sequences
Adapting Pretrained Text-to-Text Models for Long Text Sequences
Wenhan Xiong
Anchit Gupta
Shubham Toshniwal
Yashar Mehdad
Wen-tau Yih
RALMVLM
120
31
0
21 Sep 2022
Probing for Understanding of English Verb Classes and Alternations in
  Large Pre-trained Language Models
Probing for Understanding of English Verb Classes and Alternations in Large Pre-trained Language Models
David K. Yi
James V. Bruno
Jiayu Han
Peter Zukerman
Shane Steinert-Threlkeld
32
1
0
11 Sep 2022
Elaboration-Generating Commonsense Question Answering at Scale
Elaboration-Generating Commonsense Question Answering at Scale
Wenya Wang
Vivek Srikumar
Hannaneh Hajishirzi
Noah A. Smith
ELMLRM
76
15
0
02 Sep 2022
Why Do Neural Language Models Still Need Commonsense Knowledge to Handle
  Semantic Variations in Question Answering?
Why Do Neural Language Models Still Need Commonsense Knowledge to Handle Semantic Variations in Question Answering?
Sunjae Kwon
Cheongwoong Kang
Jiyeon Han
Jaesik Choi
59
0
0
01 Sep 2022
Evaluate Confidence Instead of Perplexity for Zero-shot Commonsense
  Reasoning
Evaluate Confidence Instead of Perplexity for Zero-shot Commonsense Reasoning
Letian Peng
Z. Li
Hai Zhao
ReLMLRM
45
1
0
23 Aug 2022
Image as a Foreign Language: BEiT Pretraining for All Vision and
  Vision-Language Tasks
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Wenhui Wang
Hangbo Bao
Li Dong
Johan Bjorck
Zhiliang Peng
...
Kriti Aggarwal
O. Mohammed
Saksham Singhal
Subhojit Som
Furu Wei
MLLMVLMViT
157
645
0
22 Aug 2022
Type-enriched Hierarchical Contrastive Strategy for Fine-Grained Entity
  Typing
Type-enriched Hierarchical Contrastive Strategy for Fine-Grained Entity Typing
Xinyu Zuo
Haijin Liang
Ning Jing
Shuang Zeng
Zhou Fang
Yu Luo
66
6
0
22 Aug 2022
Treeformer: Dense Gradient Trees for Efficient Attention Computation
Treeformer: Dense Gradient Trees for Efficient Attention Computation
Lovish Madaan
Srinadh Bhojanapalli
Himanshu Jain
Prateek Jain
64
7
0
18 Aug 2022
Towards Open-vocabulary Scene Graph Generation with Prompt-based
  Finetuning
Towards Open-vocabulary Scene Graph Generation with Prompt-based Finetuning
Tao He
Lianli Gao
Jingkuan Song
Yuan-Fang Li
VLM
88
53
0
17 Aug 2022
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Tim Dettmers
M. Lewis
Younes Belkada
Luke Zettlemoyer
MQ
147
666
0
15 Aug 2022
Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language
  Models
Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Margaret Li
Suchin Gururangan
Tim Dettmers
M. Lewis
Tim Althoff
Noah A. Smith
Luke Zettlemoyer
MoMe
110
154
0
05 Aug 2022
Multi Resolution Analysis (MRA) for Approximate Self-Attention
Multi Resolution Analysis (MRA) for Approximate Self-Attention
Zhanpeng Zeng
Sourav Pal
Jeffery Kline
G. Fung
Vikas Singh
40
8
0
21 Jul 2022
Language models show human-like content effects on reasoning tasks
Language models show human-like content effects on reasoning tasks
Ishita Dasgupta
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Hannah R. Sheahan
Antonia Creswell
D. Kumaran
James L. McClelland
Felix Hill
ReLMLRM
136
188
0
14 Jul 2022
Memory-Based Label-Text Tuning for Few-Shot Class-Incremental Learning
Memory-Based Label-Text Tuning for Few-Shot Class-Incremental Learning
Jinze Li
Yan Bai
Yihang Lou
Xiongkun Linghu
Jianzhong He
Shaoyun Xu
Tao Bai
VLMCLL
82
2
0
03 Jul 2022
Solving Quantitative Reasoning Problems with Language Models
Solving Quantitative Reasoning Problems with Language Models
Aitor Lewkowycz
Anders Andreassen
David Dohan
Ethan Dyer
Henryk Michalewski
...
Theo Gutman-Solo
Yuhuai Wu
Behnam Neyshabur
Guy Gur-Ari
Vedant Misra
ReLMELMLRM
222
865
0
29 Jun 2022
Emergent Abilities of Large Language Models
Emergent Abilities of Large Language Models
Jason W. Wei
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
...
Tatsunori Hashimoto
Oriol Vinyals
Percy Liang
J. Dean
W. Fedus
ELMReLMLRM
320
2,524
0
15 Jun 2022
On the Role of Bidirectionality in Language Model Pre-Training
On the Role of Bidirectionality in Language Model Pre-Training
Mikel Artetxe
Jingfei Du
Naman Goyal
Luke Zettlemoyer
Ves Stoyanov
200
17
0
24 May 2022
On Advances in Text Generation from Images Beyond Captioning: A Case
  Study in Self-Rationalization
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Shruti Palaskar
Akshita Bhagia
Yonatan Bisk
Florian Metze
A. Black
Ana Marasović
84
4
0
24 May 2022
Life after BERT: What do Other Muppets Understand about Language?
Life after BERT: What do Other Muppets Understand about Language?
Vladislav Lialin
Kevin Zhao
Namrata Shivagunde
Anna Rumshisky
110
6
0
21 May 2022
OPT: Open Pre-trained Transformer Language Models
OPT: Open Pre-trained Transformer Language Models
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
...
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLMOSLMAI4CE
392
3,707
0
02 May 2022
Modern Baselines for SPARQL Semantic Parsing
Modern Baselines for SPARQL Semantic Parsing
Debayan Banerjee
Pranav Ajit Nair
Jivat Neet Kaur
Ricardo Usbeck
Chris Biemann
85
32
0
27 Apr 2022
Decorate the Examples: A Simple Method of Prompt Design for Biomedical
  Relation Extraction
Decorate the Examples: A Simple Method of Prompt Design for Biomedical Relation Extraction
Hui-Syuan Yeh
Thomas Lavergne
Pierre Zweigenbaum
51
12
0
21 Apr 2022
Language Contamination Helps Explain the Cross-lingual Capabilities of
  English Pretrained Models
Language Contamination Helps Explain the Cross-lingual Capabilities of English Pretrained Models
Terra Blevins
Luke Zettlemoyer
151
92
0
17 Apr 2022
Probing Script Knowledge from Pre-Trained Models
Probing Script Knowledge from Pre-Trained Models
Zijian Jin
Xingyu Zhang
Mo Yu
Lifu Huang
60
5
0
16 Apr 2022
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding
  Language Models with Model Generated Signals
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals
Payal Bajaj
Chenyan Xiong
Guolin Ke
Xiaodong Liu
Di He
Saurabh Tiwary
Tie-Yan Liu
Paul N. Bennett
Xia Song
Jianfeng Gao
118
32
0
13 Apr 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
95
9
0
11 Apr 2022
Pretraining Text Encoders with Adversarial Mixture of Training Signal
  Generators
Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
MoE
78
16
0
07 Apr 2022
minicons: Enabling Flexible Behavioral and Representational Analyses of
  Transformer Language Models
minicons: Enabling Flexible Behavioral and Representational Analyses of Transformer Language Models
Kanishka Misra
84
63
0
24 Mar 2022
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through
  Regularized Self-Attention
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention
Yang Liu
Jiaxiang Liu
L. Chen
Yuxiang Lu
Shi Feng
Zhida Feng
Yu Sun
Hao Tian
Huancheng Wu
Hai-feng Wang
70
9
0
23 Mar 2022
Prototypical Verbalizer for Prompt-based Few-shot Tuning
Prototypical Verbalizer for Prompt-based Few-shot Tuning
Ganqu Cui
Shengding Hu
Ning Ding
Longtao Huang
Zhiyuan Liu
VLM
67
99
0
18 Mar 2022
Efficient Language Modeling with Sparse all-MLP
Efficient Language Modeling with Sparse all-MLP
Ping Yu
Mikel Artetxe
Myle Ott
Sam Shleifer
Hongyu Gong
Ves Stoyanov
Xian Li
MoE
88
11
0
14 Mar 2022
Speciesist Language and Nonhuman Animal Bias in English Masked Language
  Models
Speciesist Language and Nonhuman Animal Bias in English Masked Language Models
Masashi Takeshita
Rafal Rzepka
K. Araki
73
7
0
10 Mar 2022
Deep Lexical Hypothesis: Identifying personality structure in natural
  language
Deep Lexical Hypothesis: Identifying personality structure in natural language
A. Cutler
D. Condon
65
31
0
04 Mar 2022
Large-Scale Hate Speech Detection with Cross-Domain Transfer
Large-Scale Hate Speech Detection with Cross-Domain Transfer
Cagri Toraman
Furkan Şahinuç
E. Yilmaz
123
63
0
02 Mar 2022
Maximizing Communication Efficiency for Large-scale Training via 0/1
  Adam
Maximizing Communication Efficiency for Large-scale Training via 0/1 Adam
Yucheng Lu
Conglong Li
Minjia Zhang
Christopher De Sa
Yuxiong He
OffRLAI4CE
91
21
0
12 Feb 2022
AdaPrompt: Adaptive Model Training for Prompt-based NLP
AdaPrompt: Adaptive Model Training for Prompt-based NLP
Yulong Chen
Yang Liu
Li Dong
Shuohang Wang
Chenguang Zhu
Michael Zeng
Yue Zhang
VLM
102
48
0
10 Feb 2022
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A
  Large-Scale Generative Language Model
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Shaden Smith
M. Patwary
Brandon Norick
P. LeGresley
Samyam Rajbhandari
...
Mohammad Shoeybi
Yuxiong He
Michael Houston
Saurabh Tiwary
Bryan Catanzaro
MoE
165
744
0
28 Jan 2022
Instance-aware Prompt Learning for Language Understanding and Generation
Instance-aware Prompt Learning for Language Understanding and Generation
Feihu Jin
Jinliang Lu
Jiajun Zhang
Chengqing Zong
57
33
0
18 Jan 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Julien Abadji
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
CLL
97
159
0
17 Jan 2022
The Defeat of the Winograd Schema Challenge
The Defeat of the Winograd Schema Challenge
Vid Kocijan
E. Davis
Thomas Lukasiewicz
G. Marcus
L. Morgenstern
83
43
0
07 Jan 2022
Efficient Large Scale Language Modeling with Mixtures of Experts
Efficient Large Scale Language Modeling with Mixtures of Experts
Mikel Artetxe
Shruti Bhosale
Naman Goyal
Todor Mihaylov
Myle Ott
...
Jeff Wang
Luke Zettlemoyer
Mona T. Diab
Zornitsa Kozareva
Ves Stoyanov
MoE
237
201
0
20 Dec 2021
Previous
123456
Next