ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
Effectiveness of Pre-training for Few-shot Intent Classification
Effectiveness of Pre-training for Few-shot Intent Classification
Haode Zhang
Yuwei Zhang
Li-Ming Zhan
Jiaxin Chen
Guangyuan Shi
Xiao-Ming Wu
Albert Y. S. Lam
VLM
118
46
0
13 Sep 2021
Wine is Not v i n. -- On the Compatibility of Tokenizations Across
  Languages
Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages
Antonis Maronikolakis
Philipp Dufter
Hinrich Schütze
86
17
0
13 Sep 2021
How to Select One Among All? An Extensive Empirical Study Towards the
  Robustness of Knowledge Distillation in Natural Language Understanding
How to Select One Among All? An Extensive Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding
Tianda Li
Ahmad Rashid
A. Jafari
Pranav Sharma
A. Ghodsi
Mehdi Rezagholizadeh
AAML
122
5
0
13 Sep 2021
Extracting Event Temporal Relations via Hyperbolic Geometry
Extracting Event Temporal Relations via Hyperbolic Geometry
Xingwei Tan
Gabriele Pergola
Yulan He
57
24
0
12 Sep 2021
Compute and Energy Consumption Trends in Deep Learning Inference
Compute and Energy Consumption Trends in Deep Learning Inference
Radosvet Desislavov
Fernando Martínez-Plumed
José Hernández-Orallo
77
119
0
12 Sep 2021
XCoref: Cross-document Coreference Resolution in the Wild
XCoref: Cross-document Coreference Resolution in the Wild
Anastasia Zhukova
Felix Hamborg
K. Donnay
Bela Gipp
48
4
0
11 Sep 2021
Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense
  Language Understanding
Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding
Shane Storks
Qiaozi Gao
Yichi Zhang
J. Chai
ReLMLRM
111
23
0
10 Sep 2021
Examining Cross-lingual Contextual Embeddings with Orthogonal Structural
  Probes
Examining Cross-lingual Contextual Embeddings with Orthogonal Structural Probes
Tomasz Limisiewicz
David Marevcek
42
3
0
10 Sep 2021
Integrating Approaches to Word Representation
Integrating Approaches to Word Representation
Yuval Pinter
NAI
94
5
0
10 Sep 2021
RoR: Read-over-Read for Long Document Machine Reading Comprehension
RoR: Read-over-Read for Long Document Machine Reading Comprehension
Jing Zhao
Junwei Bao
Yifan Wang
Yongwei Zhou
Youzheng Wu
Xiaodong He
Bowen Zhou
AIMat
114
24
0
10 Sep 2021
On the validity of pre-trained transformers for natural language
  processing in the software engineering domain
On the validity of pre-trained transformers for natural language processing in the software engineering domain
Julian von der Mosel
Alexander Trautsch
Steffen Herbold
74
68
0
10 Sep 2021
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural
  Machine Translation
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation
Haoran Xu
Benjamin Van Durme
Kenton W. Murray
110
62
0
09 Sep 2021
Filling the Gaps in Ancient Akkadian Texts: A Masked Language Modelling
  Approach
Filling the Gaps in Ancient Akkadian Texts: A Masked Language Modelling Approach
Koren Lazar
Benny Saret
Asaf Yehudai
W. Horowitz
N. Wasserman
Gabriel Stanovsky
74
23
0
09 Sep 2021
Efficient Nearest Neighbor Language Models
Efficient Nearest Neighbor Language Models
Junxian He
Graham Neubig
Taylor Berg-Kirkpatrick
RALM
278
106
0
09 Sep 2021
Unsupervised Pre-training with Structured Knowledge for Improving
  Natural Language Inference
Unsupervised Pre-training with Structured Knowledge for Improving Natural Language Inference
Xiaoyu Yang
Xiao-Dan Zhu
Zhan Shi
Tianda Li
SSL
59
1
0
08 Sep 2021
Sustainable Modular Debiasing of Language Models
Sustainable Modular Debiasing of Language Models
Anne Lauscher
Tobias Lüken
Goran Glavaš
150
124
0
08 Sep 2021
Towards Natural Language Interfaces for Data Visualization: A Survey
Towards Natural Language Interfaces for Data Visualization: A Survey
Leixian Shen
Enya Shen
Yuyu Luo
Xiaocong Yang
Xuming Hu
Xiongshuai Zhang
Zhiwei Tai
Jianmin Wang
113
146
0
08 Sep 2021
How much pretraining data do language models need to learn syntax?
How much pretraining data do language models need to learn syntax?
Laura Pérez-Mayos
Miguel Ballesteros
Leo Wanner
62
32
0
07 Sep 2021
Learning grounded word meaning representations on similarity graphs
Learning grounded word meaning representations on similarity graphs
Mariella Dimiccoli
H. Wendt
Pau Batlle
41
1
0
07 Sep 2021
Datasets: A Community Library for Natural Language Processing
Datasets: A Community Library for Natural Language Processing
Quentin Lhoest
Albert Villanova del Moral
Yacine Jernite
A. Thakur
Patrick von Platen
...
Thibault Goehringer
Victor Mustar
François Lagunas
Alexander M. Rush
Thomas Wolf
302
614
0
07 Sep 2021
GPT-3 Models are Poor Few-Shot Learners in the Biomedical Domain
GPT-3 Models are Poor Few-Shot Learners in the Biomedical Domain
M. Moradi
Kathrin Blagec
F. Haberl
Matthias Samwald
LM&MAAI4MH
103
66
0
06 Sep 2021
Sent2Span: Span Detection for PICO Extraction in the Biomedical Text
  without Span Annotations
Sent2Span: Span Detection for PICO Extraction in the Biomedical Text without Span Annotations
Shifeng Liu
Yifang Sun
Bing Li
Wei Wang
Florence T. Bourgeois
A. Dunn
52
14
0
06 Sep 2021
Re-entry Prediction for Online Conversations via Self-Supervised
  Learning
Re-entry Prediction for Online Conversations via Self-Supervised Learning
Lingzhi Wang
Xingshan Zeng
Huang Hu
Kam-Fai Wong
Daxin Jiang
68
6
0
05 Sep 2021
Multi-modal Representation Learning for Video Advertisement Content
  Structuring
Multi-modal Representation Learning for Video Advertisement Content Structuring
Daya Guo
Zhaoyang Zeng
41
4
0
04 Sep 2021
Finetuned Language Models Are Zero-Shot Learners
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALMUQCV
393
3,813
0
03 Sep 2021
Imposing Relation Structure in Language-Model Embeddings Using
  Contrastive Learning
Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning
Christos Theodoropoulos
James Henderson
Andrei Catalin Coman
Marie-Francine Moens
65
15
0
02 Sep 2021
Causal Inference in Natural Language Processing: Estimation, Prediction,
  Interpretation and Beyond
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond
Amir Feder
Katherine A. Keith
Emaad A. Manzoor
Reid Pryzant
Dhanya Sridhar
...
Roi Reichart
Margaret E. Roberts
Brandon M Stewart
Victor Veitch
Diyi Yang
CML
123
246
0
02 Sep 2021
Survey of Low-Resource Machine Translation
Survey of Low-Resource Machine Translation
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
124
164
0
01 Sep 2021
Capturing Stance Dynamics in Social Media: Open Challenges and Research
  Directions
Capturing Stance Dynamics in Social Media: Open Challenges and Research Directions
Rabab Alkhalifa
A. Zubiaga
83
21
0
01 Sep 2021
Sentence Bottleneck Autoencoders from Transformer Language Models
Sentence Bottleneck Autoencoders from Transformer Language Models
Ivan Montero
Nikolaos Pappas
Noah A. Smith
AI4CE
87
29
0
31 Aug 2021
Sense representations for Portuguese: experiments with sense embeddings
  and deep neural language models
Sense representations for Portuguese: experiments with sense embeddings and deep neural language models
Jéssica Rodrigues da Silva
Helena de Medeiros Caseli
36
3
0
31 Aug 2021
APS: Active Pretraining with Successor Features
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
120
123
0
31 Aug 2021
Backdoor Attacks on Pre-trained Models by Layerwise Weight Poisoning
Backdoor Attacks on Pre-trained Models by Layerwise Weight Poisoning
Linyang Li
Demin Song
Xiaonan Li
Jiehang Zeng
Ruotian Ma
Xipeng Qiu
147
141
0
31 Aug 2021
SANSformers: Self-Supervised Forecasting in Electronic Health Records
  with Attention-Free Models
SANSformers: Self-Supervised Forecasting in Electronic Health Records with Attention-Free Models
Yogesh Kumar
Alexander Ilin
H. Salo
S. Kulathinal
M. Leinonen
Pekka Marttinen
AI4TSMedIm
41
0
0
31 Aug 2021
Structured Prediction in NLP -- A survey
Structured Prediction in NLP -- A survey
Chauhan Dev
Naman Biyani
Nirmal P. Suthar
Prashant Kumar
Priyanshu Agarwal
AI4TSAI4CE
112
0
0
31 Aug 2021
How Does Adversarial Fine-Tuning Benefit BERT?
How Does Adversarial Fine-Tuning Benefit BERT?
J. Ebrahimi
Hao Yang
Wei Zhang
AAML
58
4
0
31 Aug 2021
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Yue Liu
Xinyang Jiang
Donglin Bai
Yuge Zhang
Ningxin Zheng
Xuanyi Dong
Lu Liu
Yuqing Yang
Dongsheng Li
73
10
0
30 Aug 2021
GeoVectors: A Linked Open Corpus of OpenStreetMap Embeddings on World
  Scale
GeoVectors: A Linked Open Corpus of OpenStreetMap Embeddings on World Scale
Nicolas Tempelmeier
Simon Gottschalk
Elena Demidova
71
15
0
30 Aug 2021
Span Fine-tuning for Pre-trained Language Models
Span Fine-tuning for Pre-trained Language Models
Rongzhou Bao
Zhuosheng Zhang
Hai Zhao
55
2
0
29 Aug 2021
NoiER: An Approach for Training more Reliable Fine-TunedDownstream Task
  Models
NoiER: An Approach for Training more Reliable Fine-TunedDownstream Task Models
Myeongjun Jang
Thomas Lukasiewicz
67
4
0
29 Aug 2021
Sentence Structure and Word Relationship Modeling for Emphasis Selection
Sentence Structure and Word Relationship Modeling for Emphasis Selection
Haoran Yang
Wai Lam
40
0
0
29 Aug 2021
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural
  Language Understanding
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding
Guoqing Zheng
Giannis Karamanolakis
Kai Shu
Ahmed Hassan Awadallah
SSL
62
1
0
28 Aug 2021
Automatic Text Evaluation through the Lens of Wasserstein Barycenters
Automatic Text Evaluation through the Lens of Wasserstein Barycenters
Pierre Colombo
Guillaume Staerman
Chloé Clavel
Pablo Piantanida
205
41
0
27 Aug 2021
Deep learning models are not robust against noise in clinical text
Deep learning models are not robust against noise in clinical text
M. Moradi
Kathrin Blagec
Matthias Samwald
OOD
66
6
0
27 Aug 2021
Evaluating the Robustness of Neural Language Models to Input
  Perturbations
Evaluating the Robustness of Neural Language Models to Input Perturbations
M. Moradi
Matthias Samwald
AAML
101
102
0
27 Aug 2021
Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive
  Text Summarization
Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization
Chujie Zheng
Kunpeng Zhang
Harry J. Wang
Ling Fan
Zhe Wang
60
7
0
26 Aug 2021
LocTex: Learning Data-Efficient Visual Representations from Localized
  Textual Supervision
LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision
Zhijian Liu
Simon Stent
Jie Li
John Gideon
Song Han
VLM
106
10
0
26 Aug 2021
Rethinking Why Intermediate-Task Fine-Tuning Works
Rethinking Why Intermediate-Task Fine-Tuning Works
Ting-Yun Chang
Chi-Jen Lu
LRM
96
30
0
26 Aug 2021
Models In a Spelling Bee: Language Models Implicitly Learn the Character
  Composition of Tokens
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens
Itay Itzhak
Omer Levy
74
20
0
25 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer
  Models via Low-Rank Approximation
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
103
12
0
24 Aug 2021
Previous
123...293031...899091
Next