ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.12701
  4. Cited By
Eliciting and Understanding Cross-Task Skills with Task-Level
  Mixture-of-Experts

Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts

25 May 2022
Qinyuan Ye
Juan Zha
Xiang Ren
    MoE
ArXivPDFHTML

Papers citing "Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts"

50 / 112 papers shown
Title
Language Models as Knowledge Bases?
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
571
2,670
0
03 Sep 2019
Cosmos QA: Machine Reading Comprehension with Contextual Commonsense
  Reasoning
Cosmos QA: Machine Reading Comprehension with Contextual Commonsense Reasoning
Lifu Huang
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
AIMat
RALM
LRM
112
454
0
31 Aug 2019
Reasoning Over Paragraph Effects in Situations
Reasoning Over Paragraph Effects in Situations
Kevin Lin
Oyvind Tafjord
Peter Clark
Matt Gardner
75
115
0
16 Aug 2019
Quoref: A Reading Comprehension Dataset with Questions Requiring
  Coreferential Reasoning
Quoref: A Reading Comprehension Dataset with Questions Requiring Coreferential Reasoning
Pradeep Dasigi
Nelson F. Liu
Ana Marasović
Noah A. Smith
Matt Gardner
RALM
73
173
0
16 Aug 2019
Abductive Commonsense Reasoning
Abductive Commonsense Reasoning
Chandra Bhagavatula
Ronan Le Bras
Chaitanya Malaviya
Keisuke Sakaguchi
Ari Holtzman
Hannah Rashkin
Doug Downey
Scott Yih
Yejin Choi
ReLM
LRM
75
461
0
15 Aug 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
653
24,464
0
26 Jul 2019
ELI5: Long Form Question Answering
ELI5: Long Form Question Answering
Angela Fan
Yacine Jernite
Ethan Perez
David Grangier
Jason Weston
Michael Auli
AI4MH
ELM
89
618
0
22 Jul 2019
TWEETQA: A Social Media Focused Question Answering Dataset
TWEETQA: A Social Media Focused Question Answering Dataset
Wenhan Xiong
Jiawei Wu
Hong Wang
Vivek Kulkarni
Mo Yu
Shiyu Chang
Xiaoxiao Guo
William Yang Wang
55
77
0
14 Jul 2019
This Email Could Save Your Life: Introducing the Task of Email Subject
  Line Generation
This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation
Rui Zhang
Joel R. Tetreault
68
78
0
08 Jun 2019
Explain Yourself! Leveraging Language Models for Commonsense Reasoning
Explain Yourself! Leveraging Language Models for Commonsense Reasoning
Nazneen Rajani
Bryan McCann
Caiming Xiong
R. Socher
ReLM
LRM
79
565
0
06 Jun 2019
Multi-News: a Large-Scale Multi-Document Summarization Dataset and
  Abstractive Hierarchical Model
Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model
Alexander R. Fabbri
Irene Li
Tianwei She
Suyi Li
Dragomir R. Radev
81
588
0
04 Jun 2019
MathQA: Towards Interpretable Math Word Problem Solving with
  Operation-Based Formalisms
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
Aida Amini
Saadia Gabriel
Shanchuan Lin
Rik Koncel-Kedziorski
Yejin Choi
Hannaneh Hajishirzi
AIMat
ReLM
AI4CE
111
574
0
30 May 2019
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
Christopher Clark
Kenton Lee
Ming-Wei Chang
Tom Kwiatkowski
Michael Collins
Kristina Toutanova
224
1,527
0
24 May 2019
HellaSwag: Can a Machine Really Finish Your Sentence?
HellaSwag: Can a Machine Really Finish Your Sentence?
Rowan Zellers
Ari Holtzman
Yonatan Bisk
Ali Farhadi
Yejin Choi
170
2,485
0
19 May 2019
Structural Scaffolds for Citation Intent Classification in Scientific
  Publications
Structural Scaffolds for Citation Intent Classification in Scientific Publications
Arman Cohan
Bridger Waleed Ammar
Madeleine van Zuylen
Field Cady
52
252
0
02 Apr 2019
PAWS: Paraphrase Adversaries from Word Scrambling
PAWS: Paraphrase Adversaries from Word Scrambling
Yuan Zhang
Jason Baldridge
Luheng He
71
543
0
01 Apr 2019
Mining Discourse Markers for Unsupervised Sentence Representation
  Learning
Mining Discourse Markers for Unsupervised Sentence Representation Learning
Damien Sileo
Tim Van de Cruys
Camille Pradel
Philippe Muller
62
69
0
28 Mar 2019
Task2Vec: Task Embedding for Meta-Learning
Task2Vec: Task Embedding for Meta-Learning
Alessandro Achille
Michael Lam
Rahul Tewari
Avinash Ravichandran
Subhransu Maji
Charless C. Fowlkes
Stefano Soatto
Pietro Perona
SSL
77
315
0
10 Feb 2019
Parameter-Efficient Transfer Learning for NLP
Parameter-Efficient Transfer Learning for NLP
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
210
4,460
0
02 Feb 2019
DREAM: A Challenge Dataset and Models for Dialogue-Based Reading
  Comprehension
DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension
Kai Sun
Dian Yu
Jianshu Chen
Dong Yu
Yejin Choi
Claire Cardie
RALM
AIMat
57
295
0
01 Feb 2019
Multi-Task Deep Neural Networks for Natural Language Understanding
Multi-Task Deep Neural Networks for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
AI4CE
124
1,271
0
31 Jan 2019
Evaluating the State-of-the-Art of End-to-End Natural Language
  Generation: The E2E NLG Challenge
Evaluating the State-of-the-Art of End-to-End Natural Language Generation: The E2E NLG Challenge
Ondrej Dusek
Jekaterina Novikova
Verena Rieser
ELM
81
232
0
23 Jan 2019
QuaRel: A Dataset and Models for Answering Questions about Qualitative
  Relationships
QuaRel: A Dataset and Models for Answering Questions about Qualitative Relationships
Oyvind Tafjord
Peter Clark
Matt Gardner
Wen-tau Yih
Ashish Sabharwal
AIMat
50
79
0
20 Nov 2018
Wizard of Wikipedia: Knowledge-Powered Conversational agents
Wizard of Wikipedia: Knowledge-Powered Conversational agents
Emily Dinan
Stephen Roller
Kurt Shuster
Angela Fan
Michael Auli
Jason Weston
RALM
KELM
124
950
0
03 Nov 2018
Sentence Encoders on STILTs: Supplementary Training on Intermediate
  Labeled-data Tasks
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks
Jason Phang
Thibault Févry
Samuel R. Bowman
94
468
0
02 Nov 2018
CommonsenseQA: A Question Answering Challenge Targeting Commonsense
  Knowledge
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
140
1,733
0
02 Nov 2018
Abstractive Summarization of Reddit Posts with Multi-level Memory
  Networks
Abstractive Summarization of Reddit Posts with Multi-level Memory Networks
Byeongchang Kim
Hyunwoo J. Kim
Gunhee Kim
63
183
0
02 Nov 2018
ReCoRD: Bridging the Gap between Human and Machine Commonsense Reading
  Comprehension
ReCoRD: Bridging the Gap between Human and Machine Commonsense Reading Comprehension
Sheng Zhang
Xiaodong Liu
Jingjing Liu
Jianfeng Gao
Kevin Duh
Benjamin Van Durme
72
314
0
30 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
94,891
0
11 Oct 2018
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question
  Answering
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
171
2,655
0
25 Sep 2018
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain
  Semantic Parsing and Text-to-SQL Task
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task
Tao Yu
Rui Zhang
Kai-Chou Yang
Michihiro Yasunaga
Dongxu Wang
...
Irene Li
Qingning Yao
Shanelle Roman
Zilin Zhang
Dragomir R. Radev
RALM
95
1,233
0
24 Sep 2018
Hate Speech Dataset from a White Supremacy Forum
Hate Speech Dataset from a White Supremacy Forum
Ona de Gibert
Naiara Pérez
Aitor García-Pablos
Montse Cuadros
72
421
0
12 Sep 2018
Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book
  Question Answering
Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering
Todor Mihaylov
Peter Clark
Tushar Khot
Ashish Sabharwal
113
1,537
0
08 Sep 2018
Learning To Split and Rephrase From Wikipedia Edit History
Learning To Split and Rephrase From Wikipedia Edit History
Jan A. Botha
Manaal Faruqui
J. Alex
Jason Baldridge
Dipanjan Das
KELM
61
72
0
28 Aug 2018
Identifying Well-formed Natural Language Questions
Identifying Well-formed Natural Language Questions
Manaal Faruqui
Dipanjan Das
79
49
0
28 Aug 2018
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive
  Meaning Representations
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations
Mohammad Taher Pilehvar
Jose Camacho-Collados
195
489
0
28 Aug 2018
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional
  Neural Networks for Extreme Summarization
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
126
1,676
0
27 Aug 2018
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense
  Inference
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference
Rowan Zellers
Yonatan Bisk
Roy Schwartz
Yejin Choi
104
718
0
16 Aug 2018
The Natural Language Decathlon: Multitask Learning as Question Answering
The Natural Language Decathlon: Multitask Learning as Question Answering
Bryan McCann
N. Keskar
Caiming Xiong
R. Socher
AIMat
MLLM
BDL
142
645
0
20 Jun 2018
Neural Network Acceptability Judgments
Neural Network Acceptability Judgments
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
230
1,407
0
31 May 2018
DuoRC: Towards Complex Language Understanding with Paraphrased Reading
  Comprehension
DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension
Amrita Saha
Rahul Aralikatte
Mitesh M. Khapra
Karthik Sankaranarayanan
76
197
0
21 Apr 2018
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning
  Challenge
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
Peter Clark
Isaac Cowhey
Oren Etzioni
Tushar Khot
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
ELM
RALM
LRM
158
2,610
0
14 Mar 2018
FEVER: a large-scale dataset for Fact Extraction and VERification
FEVER: a large-scale dataset for Fact Extraction and VERification
James Thorne
Andreas Vlachos
Christos Christodoulopoulos
Arpit Mittal
HILM
145
1,657
0
14 Mar 2018
Routing Networks: Adaptive Selection of Non-linear Functions for
  Multi-Task Learning
Routing Networks: Adaptive Selection of Non-linear Functions for Multi-Task Learning
Clemens Rosenbaum
Tim Klinger
Matthew D Riemer
84
246
0
03 Nov 2017
Constructing Datasets for Multi-hop Reading Comprehension Across
  Documents
Constructing Datasets for Multi-hop Reading Comprehension Across Documents
Johannes Welbl
Pontus Stenetorp
Sebastian Riedel
SyDa
RALM
96
511
0
17 Oct 2017
Seq2SQL: Generating Structured Queries from Natural Language using
  Reinforcement Learning
Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
Victor Zhong
Caiming Xiong
R. Socher
RALM
95
1,196
0
31 Aug 2017
Crowdsourcing Multiple Choice Science Questions
Crowdsourcing Multiple Choice Science Questions
Johannes Welbl
Nelson F. Liu
Matt Gardner
AI4Ed
79
506
0
19 Jul 2017
Zero-Shot Relation Extraction via Reading Comprehension
Zero-Shot Relation Extraction via Reading Comprehension
Omer Levy
Minjoon Seo
Eunsol Choi
Luke Zettlemoyer
ReLM
70
694
0
13 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
701
131,652
0
12 Jun 2017
Program Induction by Rationale Generation : Learning to Solve and
  Explain Algebraic Word Problems
Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems
Wang Ling
Dani Yogatama
Chris Dyer
Phil Blunsom
AIMat
79
729
0
11 May 2017
Previous
123
Next