Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.06226
Cited By
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
19 August 2018
Taku Kudo
John Richardson
Re-assign community
ArXiv (abs)
PDF
HTML
Github (10925★)
Papers citing
"SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"
50 / 1,950 papers shown
Title
Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech Translation
Chih-Chiang Chang
Hung-yi Lee
90
13
0
22 Mar 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
589
3,757
0
21 Mar 2022
Teaching language models to support answers with verified quotes
Jacob Menick
Maja Trebacz
Vladimir Mikulik
John Aslanides
Francis Song
...
Mia Glaese
Susannah Young
Lucy Campbell-Gillingham
G. Irving
Nat McAleese
ELM
RALM
316
267
0
21 Mar 2022
AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization
Moussa Kamal Eddine
Nadi Tomeh
Nizar Habash
Joseph Le Roux
Michalis Vazirgiannis
75
46
0
21 Mar 2022
Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability
Yoshinari Fujinuma
Jordan L. Boyd-Graber
Katharina Kann
AAML
143
26
0
21 Mar 2022
Better Language Model with Hypernym Class Prediction
Richard He Bai
Tong Wang
Alessandro Sordoni
Peng Shi
139
16
0
21 Mar 2022
Towards Structuring Real-World Data at Scale: Deep Learning for Extracting Key Oncology Information from Clinical Text with Patient-Level Supervision
Sam Preston
Mu-Hsin Wei
Rajesh Rao
Robert Tinn
Naoto Usuyama
...
Paul D. Tittel
Naveen Valluri
Tristan Naumann
Carlo Bifulco
Hoifung Poon
65
6
0
20 Mar 2022
Sequence-to-Sequence Knowledge Graph Completion and Question Answering
Apoorv Saxena
Adrian Kochsiek
Rainer Gemulla
AIMat
133
129
0
19 Mar 2022
Similarity and Content-based Phonetic Self Attention for Speech Recognition
Kyuhong Shim
Wonyong Sung
73
8
0
19 Mar 2022
Towards Lithuanian grammatical error correction
Lukas Stankevivcius
Mantas Lukovsevivcius
3DV
48
4
0
18 Mar 2022
BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages
Manuel Mager
Arturo Oncevay
Elisabeth Mager
Katharina Kann
Ngoc Thang Vu
88
19
0
16 Mar 2022
Memorizing Transformers
Yuhuai Wu
M. Rabe
DeLesley S. Hutchins
Christian Szegedy
RALM
109
179
0
16 Mar 2022
Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
72
34
0
16 Mar 2022
Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation
Wenxuan Wang
Wenxiang Jiao
Yongchang Hao
Xing Wang
Shuming Shi
Zhaopeng Tu
Michael Lyu
AIMat
74
26
0
16 Mar 2022
Does Corpus Quality Really Matter for Low-Resource Languages?
Mikel Artetxe
Itziar Aldabe
Rodrigo Agerri
Olatz Perez-de-Viñaspre
Aitor Soroa Etxabe
65
20
0
15 Mar 2022
Training a Tokenizer for Free with Private Federated Learning
Eugene Bagdasaryan
Congzheng Song
Rogier van Dalen
M. Seigel
Áine Cahill
FedML
55
5
0
15 Mar 2022
Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation
Yong Cheng
Ankur Bapna
Orhan Firat
Yuan Cao
Pidong Wang
Wolfgang Macherey
72
14
0
15 Mar 2022
Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data
Shuyan Zhou
Li Zhang
Yue Yang
Qing Lyu
Pengcheng Yin
Chris Callison-Burch
Graham Neubig
87
29
0
14 Mar 2022
Block-Recurrent Transformers
DeLesley S. Hutchins
Imanol Schlag
Yuhuai Wu
Ethan Dyer
Behnam Neyshabur
103
100
0
11 Mar 2022
State of the Art in Artificial Intelligence applied to the Legal Domain
João Dias
Pedro A. Santos
Nuno Cordeiro
Ana Antunes
Bruno Martins
J. Baptista
C. Gonccalves
AILaw
60
9
0
10 Mar 2022
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation
Yutong Chen
Fangyun Wei
Xiao Sun
Zhirong Wu
Stephen Lin
SLR
88
104
0
08 Mar 2022
Extracting linguistic speech patterns of Japanese fictional characters using subword units
Mika Kishino
Kanako Komiya
26
0
0
05 Mar 2022
From Simultaneous to Streaming Machine Translation by Leveraging Streaming History
Javier Iranzo-Sánchez
Jorge Civera Saiz
Alfons Juan
CLL
129
12
0
04 Mar 2022
Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages
Vaidehi Patil
Partha P. Talukdar
Sunita Sarawagi
120
22
0
03 Mar 2022
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale
Laurent Sartran
Samuel Barrett
A. Kuncoro
Milovs Stanojević
Phil Blunsom
Chris Dyer
98
50
0
01 Mar 2022
JParaCrawl v3.0: A Large-scale English-Japanese Parallel Corpus
Makoto Morishita
Katsuki Chousa
Jun Suzuki
Masaaki Nagata
53
28
0
25 Feb 2022
Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies
Zhengxuan Wu
Alex Tamkin
Isabel Papadimitriou
90
11
0
24 Feb 2022
Matching Papers and Reviewers at Large Conferences
Kevin Leyton-Brown
Mausam
Yatin Nandwani
Hedayat Zarkoob
Chris Cameron
N. Newman
Dinesh Raghu
OODD
MQ
69
32
0
24 Feb 2022
A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Alyssa Lees
Vinh Q. Tran
Yi Tay
Jeffrey Scott Sorensen
Jai Gupta
Donald Metzler
Lucy Vasserman
90
193
0
22 Feb 2022
Models and Datasets for Cross-Lingual Summarisation
Laura Perez-Beltrachini
Mirella Lapata
87
49
0
19 Feb 2022
ST-MoE: Designing Stable and Transferable Sparse Expert Models
Barret Zoph
Irwan Bello
Sameer Kumar
Nan Du
Yanping Huang
J. Dean
Noam M. Shazeer
W. Fedus
MoE
218
205
0
17 Feb 2022
Curriculum optimization for low-resource speech recognition
Anastasia Kuznetsova
Anurag Kumar
Jennifer Drexler Fox
Francis M. Tyers
45
3
0
17 Feb 2022
MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition
Jin Sakuma
Tatsuya Komatsu
Robin Scheibler
44
6
0
17 Feb 2022
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation
Tao Ge
Si-Qing Chen
Furu Wei
MoE
91
23
0
16 Feb 2022
General-purpose, long-context autoregressive modeling with Perceiver AR
Curtis Hawthorne
Andrew Jaegle
Cătălina Cangea
Sebastian Borgeaud
C. Nash
...
Hannah R. Sheahan
Neil Zeghidour
Jean-Baptiste Alayrac
João Carreira
Jesse Engel
110
66
0
15 Feb 2022
BLUE at Memotion 2.0 2022: You have my Image, my Text and my Transformer
Ana-Maria Bucur
Adrian Cosma
Ioan-Bogdan Iordache
70
13
0
15 Feb 2022
ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning
J. Tan
Y. Tan
C. Chan
Joon Huang Chuah
VLM
ViT
77
19
0
11 Feb 2022
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Ioannis Tsiamas
Gerard I. Gállego
José A. R. Fonollosa
Marta R. Costa-jussá
VLM
66
43
0
09 Feb 2022
Competition-Level Code Generation with AlphaCode
Yujia Li
David Choi
Junyoung Chung
Nate Kushman
Julian Schrittwieser
...
Esme Sutherland Robson
Pushmeet Kohli
Nando de
Koray Kavukcuoglu
Oriol Vinyals
195
1,439
0
08 Feb 2022
mSLAM: Massively multilingual joint pre-training for speech and text
Ankur Bapna
Colin Cherry
Yu Zhang
Ye Jia
Melvin Johnson
Yong Cheng
Simran Khanuja
Jason Riesa
Alexis Conneau
VLM
73
114
0
03 Feb 2022
Unified Scaling Laws for Routed Language Models
Aidan Clark
Diego de Las Casas
Aurelia Guy
A. Mensch
Michela Paganini
...
Oriol Vinyals
Jack W. Rae
Erich Elsen
Koray Kavukcuoglu
Karen Simonyan
MoE
123
187
0
02 Feb 2022
Examining Scaling and Transfer of Language Model Architectures for Machine Translation
Biao Zhang
Behrooz Ghorbani
Ankur Bapna
Yong Cheng
Xavier Garcia
Jonathan Shen
Orhan Firat
84
23
0
01 Feb 2022
XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages
Tushar Abhishek
Shivprasad Sagare
Bhavyajeet Singh
Anubhav Sharma
Manish Gupta
Vasudeva Varma
49
9
0
01 Feb 2022
Correcting diacritics and typos with a ByT5 transformer model
Lukas Stankevicius
M. Lukoševičius
J. Kapočiūtė-Dzikienė
Monika Briediene
Tomas Krilavičius
62
21
0
31 Jan 2022
Anticipation-Free Training for Simultaneous Machine Translation
Chih-Chiang Chang
Shun-Po Chuang
Hung-yi Lee
64
7
0
30 Jan 2022
Does Transliteration Help Multilingual Language Modeling?
Ibraheem Muhammad Moosa
Mahmud Elahi Akhter
Ashfia Binte Habib
94
12
0
29 Jan 2022
Schema-Free Dependency Parsing via Sequence Generation
Boda Lin
Zijun Yao
Jiaxin Shi
S. Cao
Binghao Tang
Si Li
Yong Luo
Juanzi Li
Lei Hou
62
0
0
28 Jan 2022
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
240
96
0
28 Jan 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
993
9,796
0
28 Jan 2022
Neural-FST Class Language Model for End-to-End Speech Recognition
A. Bruguier
Duc Le
Rohit Prabhavalkar
Dangna Li
Zhe Liu
Bo Wang
Eun Chang
Fuchun Peng
Ozlem Kalinli
M. Seltzer
72
6
0
28 Jan 2022
Previous
1
2
3
...
24
25
26
...
37
38
39
Next