ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.09084
  4. Cited By
Do Syntax Trees Help Pre-trained Transformers Extract Information?

Do Syntax Trees Help Pre-trained Transformers Extract Information?

20 August 2020
Devendra Singh Sachan
Yuhao Zhang
Peng Qi
William L. Hamilton
ArXivPDFHTML

Papers citing "Do Syntax Trees Help Pre-trained Transformers Extract Information?"

30 / 30 papers shown
Title
Punctuation Restoration Improves Structure Understanding Without Supervision
Punctuation Restoration Improves Structure Understanding Without Supervision
Junghyun Min
Minho Lee
Woochul Lee
Yeonsoo Lee
71
1
0
13 Feb 2024
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
36
33
0
27 May 2020
TACRED Revisited: A Thorough Evaluation of the TACRED Relation
  Extraction Task
TACRED Revisited: A Thorough Evaluation of the TACRED Relation Extraction Task
Christoph Alt
Aleksandra Gabryszak
Leonhard Hennig
94
154
0
30 Apr 2020
Universal Dependencies v2: An Evergrowing Multilingual Treebank
  Collection
Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
Joakim Nivre
M. Marneffe
Filip Ginter
Jan Hajivc
Christopher D. Manning
S. Pyysalo
Sebastian Schuster
Francis M. Tyers
Daniel Zeman
VLM
13
511
0
22 Apr 2020
Stanza: A Python Natural Language Processing Toolkit for Many Human
  Languages
Stanza: A Python Natural Language Processing Toolkit for Many Human Languages
Peng Qi
Yuhao Zhang
Yuhui Zhang
Jason Bolton
Christopher D. Manning
AI4TS
225
1,681
0
16 Mar 2020
Dependency-Guided LSTM-CRF for Named Entity Recognition
Dependency-Guided LSTM-CRF for Named Entity Recognition
Zhanming Jie
Wei Lu
33
95
0
23 Sep 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
346
24,160
0
26 Jul 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
156
8,386
0
19 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
172
1,586
0
11 Jun 2019
Analyzing the Structure of Attention in a Transformer Language Model
Analyzing the Structure of Attention in a Transformer Language Model
Jesse Vig
Yonatan Belinkov
41
361
0
07 Jun 2019
BERT Rediscovers the Classical NLP Pipeline
BERT Rediscovers the Classical NLP Pipeline
Ian Tenney
Dipanjan Das
Ellie Pavlick
MILM
SSeg
75
1,458
0
15 May 2019
Efficient Dependency-Guided Named Entity Recognition
Efficient Dependency-Guided Named Entity Recognition
Zhanming Jie
Aldrian Obaja Muis
Wei Lu
18
40
0
19 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
808
93,936
0
11 Oct 2018
Graph Convolution over Pruned Dependency Trees Improves Relation
  Extraction
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction
Yuhao Zhang
Peng Qi
Christopher D. Manning
GNN
53
726
0
26 Sep 2018
Relational inductive biases, deep learning, and graph networks
Relational inductive biases, deep learning, and graph networks
Peter W. Battaglia
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
V. Zambaldi
...
Pushmeet Kohli
M. Botvinick
Oriol Vinyals
Yujia Li
Razvan Pascanu
AI4CE
NAI
339
3,101
0
04 Jun 2018
Linguistically-Informed Self-Attention for Semantic Role Labeling
Linguistically-Informed Self-Attention for Semantic Role Labeling
Emma Strubell
Pat Verga
D. Andor
David J. Weiss
Andrew McCallum
OffRL
56
379
0
23 Apr 2018
Deep contextualized word representations
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
74
11,520
0
15 Feb 2018
Graph Attention Networks
Graph Attention Networks
Petar Velickovic
Guillem Cucurull
Arantxa Casanova
Adriana Romero
Pietro Lio
Yoshua Bengio
GNN
283
19,902
0
30 Oct 2017
Representation Learning on Graphs: Methods and Applications
Representation Learning on Graphs: Methods and Applications
William L. Hamilton
Rex Ying
J. Leskovec
GNN
100
1,970
0
17 Sep 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
314
129,831
0
12 Jun 2017
Encoding Sentences with Graph Convolutional Networks for Semantic Role
  Labeling
Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling
Diego Marcheggiani
Ivan Titov
GNN
NAI
37
830
0
14 Mar 2017
Deep Biaffine Attention for Neural Dependency Parsing
Deep Biaffine Attention for Neural Dependency Parsing
Timothy Dozat
Christopher D. Manning
88
1,220
0
06 Nov 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
199
10,412
0
21 Jul 2016
Gaussian Error Linear Units (GELUs)
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
139
4,934
0
27 Jun 2016
Neural Semantic Role Labeling with Dependency Path Embeddings
Neural Semantic Role Labeling with Dependency Path Embeddings
Michael Roth
Mirella Lapata
34
189
0
24 May 2016
End-to-End Relation Extraction using LSTMs on Sequences and Tree
  Structures
End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures
Makoto Miwa
Joey Tianyi Zhou
92
1,184
0
05 Jan 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.1K
192,638
0
10 Dec 2015
Training Very Deep Networks
Training Very Deep Networks
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
79
1,675
0
22 Jul 2015
A Dependency-Based Neural Network for Relation Classification
A Dependency-Based Neural Network for Relation Classification
Yang Liu
Furu Wei
Sujian Li
Heng Ji
M. Zhou
Houfeng Wang
30
231
0
16 Jul 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
519
149,474
0
22 Dec 2014
1