ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.05748
  4. Cited By
GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based
  on Transformer Networks

GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks

13 September 2021
Weicheng Ma
Renze Lou
Kai Zhang
Lili Wang
Soroush Vosoughi
ArXivPDFHTML

Papers citing "GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks"

4 / 4 papers shown
Title
What Does BERT Look At? An Analysis of BERT's Attention
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
209
1,592
0
11 Jun 2019
Are Sixteen Heads Really Better than One?
Are Sixteen Heads Really Better than One?
Paul Michel
Omer Levy
Graham Neubig
MoE
97
1,060
0
25 May 2019
Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy
  Lifting, the Rest Can Be Pruned
Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Elena Voita
David Talbot
F. Moiseev
Rico Sennrich
Ivan Titov
104
1,134
0
23 May 2019
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in
  Conversations
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
Soujanya Poria
Devamanyu Hazarika
Navonil Majumder
Gautam Naik
Min Zhang
Rada Mihalcea
98
1,065
0
05 Oct 2018
1