Exploring the Limits of Language Modeling

7 February 2016

Papers citing "Exploring the Limits of Language Modeling"

50 / 167 papers shown

Title
Video Corpus Moment Retrieval with Contrastive Learning Hao Zhang Aixin Sun Wei Jing Guoshun Nan Liangli Zhen Qiufeng Wang Rick Siow Mong Goh 44 81 0 13 May 2021
Towards A Multi-agent System for Online Hate Speech Detection Gaurav Sahu R. Cohen Olga Vechtomova 16 9 0 03 May 2021
Local word statistics affect reading times independently of surprisal Adam Goodkind K. Bicknell 14 11 0 07 Mar 2021
End-to-end deep meta modelling to calibrate and optimize energy consumption and comfort Max H. Cohen Sylvain Le Corff M. Charbit Marius Preda Gilles Noziere AI4CE 18 11 0 01 Feb 2021
Domain-aware Neural Language Models for Speech Recognition Linda Liu Yile Gu Aditya Gourav Ankur Gandhe Shashank Kalmane Denis Filimonov Ariya Rastrow I. Bulyko 36 21 0 05 Jan 2021
Unsupervised Learning of Discourse Structures using a Tree Autoencoder Patrick Huber Giuseppe Carenini 32 4 0 17 Dec 2020
Accurate 3D Object Detection using Energy-Based Models Fredrik K. Gustafsson Martin Danelljan Thomas B. Schon 3DPC 38 10 0 08 Dec 2020
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters Hicham El Boukkouri Olivier Ferret Thomas Lavergne Hiroshi Noji Pierre Zweigenbaum Junichi Tsujii 77 156 0 20 Oct 2020
Vulgaris: Analysis of a Corpus for Middle-Age Varieties of Italian Language Andrea Zugarini Matteo Tiezzi Marco Maggini 11 2 0 12 Oct 2020
Near-imperceptible Neural Linguistic Steganography via Self-Adjusting Arithmetic Coding Jiaming Shen Heng Ji Jiawei Han 15 33 0 01 Oct 2020
Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News Reuben Tan Bryan A. Plummer Kate Saenko AAML 26 72 0 16 Sep 2020
Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus Cal Peyser S. Mavandadi Tara N. Sainath J. Apfel Ruoming Pang Shankar Kumar 29 46 0 24 Aug 2020
Efficient Urdu Caption Generation using Attention based LSTM Inaam Ilahi Hafiz Muhammad Abdullah Zia Ahtazaz Ehsan Rauf Tabassam Armaghan Ahmed VLM 21 2 0 02 Aug 2020
Learning for Video Compression with Recurrent Auto-Encoder and Recurrent Probability Model Ren Yang Fabian Mentzer Luc Van Gool Radu Timofte 18 138 0 24 Jun 2020
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Andrew Rouditchenko Angie Boggust David Harwath Brian Chen D. Joshi ... Rogerio Feris Brian Kingsbury M. Picheny Antonio Torralba James R. Glass SSL 22 141 0 16 Jun 2020
NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing Nikita Klyuchnikov I. Trofimov Ekaterina Artemova Mikhail Salnikov M. Fedorov Evgeny Burnaev VLM 15 101 0 12 Jun 2020
Language Models are Few-Shot Learners Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan ... Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever Dario Amodei BDL 77 40,200 0 28 May 2020
A Systematic Assessment of Syntactic Generalization in Neural Language Models Jennifer Hu Jon Gauthier Peng Qian Ethan Gotlieb Wilcox R. Levy ELM 35 212 0 07 May 2020
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training Linjie Li Yen-Chun Chen Yu Cheng Zhe Gan Licheng Yu Jingjing Liu MLLM VLM OffRL AI4TS 46 493 0 01 May 2020
TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP John X. Morris Eli Lifland Jin Yong Yoo J. E. Grigsby Di Jin Yanjun Qi SILM 27 69 0 29 Apr 2020
Sequence Model Design for Code Completion in the Modern IDE Gareth Ari Aye Gail E. Kaiser 20 30 0 10 Apr 2020
A Survey on Contextual Embeddings Qi Liu Matt J. Kusner Phil Blunsom 225 146 0 16 Mar 2020
Visual Grounding in Video for Unsupervised Word Translation Gunnar A. Sigurdsson Jean-Baptiste Alayrac Aida Nematzadeh Lucas Smaira Mateusz Malinowski João Carreira Phil Blunsom Andrew Zisserman VGen 16 49 0 11 Mar 2020
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence Kihyuk Sohn David Berthelot Chun-Liang Li Zizhao Zhang Nicholas Carlini E. D. Cubuk Alexey Kurakin Han Zhang Colin Raffel AAML 104 3,467 0 21 Jan 2020
Montage: A Neural Network Language Model-Guided JavaScript Engine Fuzzer Suyoung Lee HyungSeok Han S. Cha Sooel Son 17 85 0 13 Jan 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Colin Raffel Noam M. Shazeer Adam Roberts Katherine Lee Sharan Narang Michael Matena Yanqi Zhou Wei Li Peter J. Liu AIMat 126 19,493 0 23 Oct 2019
Optimizing Speech Recognition For The Edge Yuan Shangguan Jian Li Qiao Liang R. Álvarez Ian McGraw 28 64 0 26 Sep 2019
Learning Dense Representations for Entity Retrieval D. Gillick Sayali Kulkarni L. Lansing Alessandro Presta Jason Baldridge Eugene Ie Diego Garcia-Olano RALM 28 201 0 23 Sep 2019
Analysing Neural Language Models: Contextual Decomposition Reveals Default Reasoning in Number and Gender Assignment Jaap Jumelet Willem H. Zuidema Dieuwke Hupkes LRM 33 37 0 19 Sep 2019
PaLM: A Hybrid Parser and Language Model Hao Peng Roy Schwartz Noah A. Smith AIMat 23 15 0 04 Sep 2019
Optimizing Multi-GPU Parallelization Strategies for Deep Learning Training Saptadeep Pal Eiman Ebrahimi A. Zulfiqar Yaosheng Fu Victor Zhang Szymon Migacz D. Nellans Puneet Gupta 34 55 0 30 Jul 2019
Selection via Proxy: Efficient Data Selection for Deep Learning Cody Coleman Christopher Yeh Stephen Mussmann Baharan Mirzasoleiman Peter Bailis Percy Liang J. Leskovec Matei A. Zaharia 26 329 0 26 Jun 2019
Learning Video Representations using Contrastive Bidirectional Transformer Chen Sun Fabien Baradel Kevin Patrick Murphy Cordelia Schmid SSL ViT 27 133 0 13 Jun 2019
Likelihood Ratios for Out-of-Distribution Detection Jie Jessie Ren Peter J. Liu Emily Fertig Jasper Snoek Ryan Poplin M. DePristo Joshua V. Dillon Balaji Lakshminarayanan OODD 50 716 0 07 Jun 2019
Defending Against Neural Fake News Rowan Zellers Ari Holtzman Hannah Rashkin Yonatan Bisk Ali Farhadi Franziska Roesner Yejin Choi AAML 55 999 0 29 May 2019
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network V. Wan Chun-an Chan Tom Kenter Jakub Vít R. Clark 19 75 0 17 May 2019
Gmail Smart Compose: Real-Time Assisted Writing Mengzhao Chen Benjamin Lee G. Bansal Yuan Cao Shuyuan Zhang ... Yinan Wang Andrew M. Dai Z. Chen Timothy Sohn Yonghui Wu 16 203 0 17 May 2019
Generating Long Sequences with Sparse Transformers R. Child Scott Gray Alec Radford Ilya Sutskever 16 1,851 0 23 Apr 2019
Unsupervised Deep Structured Semantic Models for Commonsense Reasoning Shuohang Wang Sheng Zhang Yelong Shen Xiaodong Liu Jingjing Liu Jianfeng Gao Jing Jiang LRM 22 15 0 03 Apr 2019
Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State Richard Futrell Ethan Gotlieb Wilcox Takashi Morita Peng Qian Miguel Ballesteros R. Levy MILM 42 191 0 08 Mar 2019
Structural Supervision Improves Learning of Non-Local Grammatical Dependencies Ethan Gotlieb Wilcox Peng Qian Richard Futrell Miguel Ballesteros R. Levy 26 55 0 03 Mar 2019
Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks Hafiz Malik 13 26 0 18 Feb 2019
Generating Natural Language Explanations for Visual Question Answering using Scene Graphs and Visual Attention Shalini Ghosh Giedrius Burachas Arijit Ray Avi Ziskind 19 65 0 15 Feb 2019
Cross-lingual Language Model Pretraining Guillaume Lample Alexis Conneau 25 2,710 0 22 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Zihang Dai Zhilin Yang Yiming Yang J. Carbonell Quoc V. Le Ruslan Salakhutdinov VLM 38 3,674 0 09 Jan 2019
Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing Support Systems Victor Makarenkov Lior Rokach Bracha Shapira 18 35 0 08 Jan 2019
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation Cristina Garbacea Samuel Carton Shiyan Yan Qiaozhu Mei ELM 25 29 0 02 Jan 2019
Learning Private Neural Language Modeling with Attentive Aggregation Shaoxiong Ji Shirui Pan Guodong Long Xue Li Jing Jiang Zi Huang FedML MoMe 16 136 0 17 Dec 2018
Inferring the size of the causal universe: features and fusion of causal attribution networks Daniel Berenberg James P. Bagrow CML 6 0 0 14 Dec 2018
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs Sachin Kumar Yulia Tsvetkov 22 70 0 10 Dec 2018