Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.09084
Cited By
Do Syntax Trees Help Pre-trained Transformers Extract Information?
20 August 2020
Devendra Singh Sachan
Yuhao Zhang
Peng Qi
William L. Hamilton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Do Syntax Trees Help Pre-trained Transformers Extract Information?"
30 / 30 papers shown
Title
Punctuation Restoration Improves Structure Understanding Without Supervision
Junghyun Min
Minho Lee
Woochul Lee
Yeonsoo Lee
71
1
0
13 Feb 2024
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
36
33
0
27 May 2020
TACRED Revisited: A Thorough Evaluation of the TACRED Relation Extraction Task
Christoph Alt
Aleksandra Gabryszak
Leonhard Hennig
94
154
0
30 Apr 2020
Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
Joakim Nivre
M. Marneffe
Filip Ginter
Jan Hajivc
Christopher D. Manning
S. Pyysalo
Sebastian Schuster
Francis M. Tyers
Daniel Zeman
VLM
13
511
0
22 Apr 2020
Stanza: A Python Natural Language Processing Toolkit for Many Human Languages
Peng Qi
Yuhao Zhang
Yuhui Zhang
Jason Bolton
Christopher D. Manning
AI4TS
225
1,681
0
16 Mar 2020
Dependency-Guided LSTM-CRF for Named Entity Recognition
Zhanming Jie
Wei Lu
33
95
0
23 Sep 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
346
24,160
0
26 Jul 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
156
8,386
0
19 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
172
1,586
0
11 Jun 2019
Analyzing the Structure of Attention in a Transformer Language Model
Jesse Vig
Yonatan Belinkov
41
361
0
07 Jun 2019
BERT Rediscovers the Classical NLP Pipeline
Ian Tenney
Dipanjan Das
Ellie Pavlick
MILM
SSeg
75
1,458
0
15 May 2019
Efficient Dependency-Guided Named Entity Recognition
Zhanming Jie
Aldrian Obaja Muis
Wei Lu
18
40
0
19 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
808
93,936
0
11 Oct 2018
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction
Yuhao Zhang
Peng Qi
Christopher D. Manning
GNN
53
726
0
26 Sep 2018
Relational inductive biases, deep learning, and graph networks
Peter W. Battaglia
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
V. Zambaldi
...
Pushmeet Kohli
M. Botvinick
Oriol Vinyals
Yujia Li
Razvan Pascanu
AI4CE
NAI
339
3,101
0
04 Jun 2018
Linguistically-Informed Self-Attention for Semantic Role Labeling
Emma Strubell
Pat Verga
D. Andor
David J. Weiss
Andrew McCallum
OffRL
56
379
0
23 Apr 2018
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
74
11,520
0
15 Feb 2018
Graph Attention Networks
Petar Velickovic
Guillem Cucurull
Arantxa Casanova
Adriana Romero
Pietro Lio
Yoshua Bengio
GNN
283
19,902
0
30 Oct 2017
Representation Learning on Graphs: Methods and Applications
William L. Hamilton
Rex Ying
J. Leskovec
GNN
100
1,970
0
17 Sep 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
314
129,831
0
12 Jun 2017
Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling
Diego Marcheggiani
Ivan Titov
GNN
NAI
37
830
0
14 Mar 2017
Deep Biaffine Attention for Neural Dependency Parsing
Timothy Dozat
Christopher D. Manning
88
1,220
0
06 Nov 2016
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
199
10,412
0
21 Jul 2016
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
139
4,934
0
27 Jun 2016
Neural Semantic Role Labeling with Dependency Path Embeddings
Michael Roth
Mirella Lapata
34
189
0
24 May 2016
End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures
Makoto Miwa
Joey Tianyi Zhou
92
1,184
0
05 Jan 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.1K
192,638
0
10 Dec 2015
Training Very Deep Networks
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
79
1,675
0
22 Jul 2015
A Dependency-Based Neural Network for Relation Classification
Yang Liu
Furu Wei
Sujian Li
Heng Ji
M. Zhou
Houfeng Wang
30
231
0
16 Jul 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
519
149,474
0
22 Dec 2014
1