Title
Stealing the Decoding Algorithms of Language Models A. Naseh Kalpesh Krishna Mohit Iyyer Amir Houmansadr MLAU 56 20 0 08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT Yihan Cao Siyu Li Yixin Liu Zhiling Yan Yutong Dai Philip S. Yu Lichao Sun 38 509 0 07 Mar 2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism Hannah Rose Kirk Wenjie Yin Bertie Vidgen Paul Röttger 29 117 0 07 Mar 2023
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset Hugo Laurenccon Lucile Saulnier Thomas Wang Christopher Akiki Albert Villanova del Moral ... Violette Lepercq Suzana Ilić Margaret Mitchell Sasha Luccioni Yacine Jernite AI4CE AILaw 44 163 0 07 Mar 2023
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models Victor C. Dibia VLM 32 80 0 06 Mar 2023
OpenICL: An Open-Source Framework for In-context Learning Zhenyu Wu Yaoxiang Wang Jiacheng Ye Jiangtao Feng Jingjing Xu Yu Qiao Zhiyong Wu 29 49 0 06 Mar 2023
Data Portraits: Recording Foundation Model Training Data Marc Marone Benjamin Van Durme 143 30 0 06 Mar 2023
Prismer: A Vision-Language Model with Multi-Task Experts Shikun Liu Linxi Fan Edward Johns Zhiding Yu Chaowei Xiao Anima Anandkumar VLM MLLM 49 21 0 04 Mar 2023
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM Rachel Bawden François Yvon VLM LRM 32 60 0 03 Mar 2023
Competence-Based Analysis of Language Models Adam Davies Jize Jiang Chengxiang Zhai ELM 34 4 0 01 Mar 2023
How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks Xuanting Chen Junjie Ye Can Zu Nuo Xu Rui Zheng Minlong Peng Jie Zhou Tao Gui Qi Zhang Xuanjing Huang AI4MH ELM 38 79 0 01 Mar 2023
EvoPrompting: Language Models for Code-Level Neural Architecture Search Angelica Chen David Dohan David R. So VLM LRM 33 84 0 28 Feb 2023
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following Seonghyeon Ye Hyeonbin Hwang Sohee Yang Hyeongu Yun Yireun Kim Minjoon Seo LRM 32 34 0 28 Feb 2023
HugNLP: A Unified and Comprehensive Library for Natural Language Processing Rongxiang Weng Nuo Chen Qiushi Sun Wenkang Huang Chengyu Wang Ming Gao 32 3 0 28 Feb 2023
LLaMA: Open and Efficient Foundation Language Models Hugo Touvron Thibaut Lavril Gautier Izacard Xavier Martinet Marie-Anne Lachaux ... Faisal Azhar Aurelien Rodriguez Armand Joulin Edouard Grave Guillaume Lample ALM PILM 100 12,418 0 27 Feb 2023
Finding Support Examples for In-Context Learning Xiaonan Li Xipeng Qiu 32 89 0 27 Feb 2023
Fast Attention Requires Bounded Entries Josh Alman Zhao Song 30 81 0 26 Feb 2023
Does a Neural Network Really Encode Symbolic Concepts? Mingjie Li Quanshi Zhang 34 30 0 25 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation Haixing Dai Zheng Liu Wenxiong Liao Xiaoke Huang Yihan Cao ... Lichao Sun Quanzheng Li Dinggang Shen Tianming Liu Xiang Li 43 154 0 25 Feb 2023
Semantic Mechanical Search with Large Vision and Language Models Satvik Sharma Huang Huang K. Shivakumar A. Imran Ryan Hoque Brian Ichter Ken Goldberg LM&Ro VLM 29 5 0 24 Feb 2023
Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback Baolin Peng Michel Galley Pengcheng He Hao Cheng Yujia Xie ... Qiuyuan Huang Lars Liden Zhou Yu Weizhu Chen Jianfeng Gao KELM HILM LRM 25 380 0 24 Feb 2023
In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages Asim Ersoy Gerson Vizcarra T. Mayeesha Benjamin Muller 28 2 0 23 Feb 2023
Active Prompting with Chain-of-Thought for Large Language Models Shizhe Diao Pengcheng Wang Yong Lin Tong Zhang ReLM KELM LLMAG LRM 41 122 0 23 Feb 2023
On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective Jindong Wang Xixu Hu Wenxin Hou Hao Chen Runkai Zheng ... Weirong Ye Xiubo Geng Binxing Jiao Yue Zhang Xingxu Xie AI4MH 52 221 0 22 Feb 2023
In-context Example Selection with Influences Nguyen Tai Eric Wong 29 48 0 21 Feb 2023
$k$ NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models Yangsibo Huang Daogao Liu Zexuan Zhong Weijia Shi Y. Lee RALM ALM 43 14 0 21 Feb 2023
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems Yihao Feng Shentao Yang Shujian Zhang Jianguo Zhang Caiming Xiong Mi Zhou Haiquan Wang OffRL 31 24 0 20 Feb 2023
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation Lorenz Kuhn Y. Gal Sebastian Farquhar UQLM 48 261 0 19 Feb 2023
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT Qihuang Zhong Liang Ding Juhua Liu Bo Du Dacheng Tao AI4MH 66 238 0 19 Feb 2023
Complex QA and language models hybrid architectures, Survey Xavier Daull P. Bellot Emmanuel Bruno Vincent Martin Elisabeth Murisasco ELM 36 15 0 17 Feb 2023
Auditing large language models: a three-layered approach Jakob Mokander Jonas Schuett Hannah Rose Kirk Luciano Floridi AILaw MLAU 55 196 0 16 Feb 2023
Counting Carbon: A Survey of Factors Influencing the Emissions of Machine Learning A. Luccioni Alex Hernandez-Garcia 34 45 0 16 Feb 2023
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training Hongzheng Chen Cody Hao Yu Shuai Zheng Zhen Zhang Zhiru Zhang Yida Wang 33 6 0 16 Feb 2023
Measuring the Instability of Fine-Tuning Yupei Du D. Nguyen 27 4 0 15 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation Lin Zheng Jianbo Yuan Lei Yu Lingpeng Kong DiffM 41 57 0 11 Feb 2023
Distillation of encoder-decoder transformers for sequence labelling M. Farina D. Pappadopulo Anant Gupta Leslie Huang Ozan Irsoy Thamar Solorio VLM 107 3 0 10 Feb 2023
Toolformer: Language Models Can Teach Themselves to Use Tools Timo Schick Jane Dwivedi-Yu Roberto Dessì Roberta Raileanu Maria Lomeli Luke Zettlemoyer Nicola Cancedda Thomas Scialom SyDa RALM 43 1,608 0 09 Feb 2023
Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models Mohammadreza Banaei Klaudia Bałazy Artur Kasymov R. Lebret Jacek Tabor Karl Aberer OffRL 21 0 0 08 Feb 2023
The Gradient of Generative AI Release: Methods and Considerations Irene Solaiman 36 98 0 05 Feb 2023
FineDeb: A Debiasing Framework for Language Models Akash Saravanan Dhruv Mullick Habibur Rahman Nidhi Hegde FedML AI4CE 26 4 0 05 Feb 2023
The Science of Detecting LLM-Generated Texts Ruixiang Tang Yu-Neng Chuang Xia Hu DeLMO 42 169 0 04 Feb 2023
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents Zihao Wang Shaofei Cai Guanzhou Chen Hoang Trung-Dung Xiaojian Ma Yitao Liang LM&Ro LLMAG 60 318 0 03 Feb 2023
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment Hao Liu Wilson Yan Pieter Abbeel 34 25 0 02 Feb 2023
Benchmarking Large Language Models for News Summarization Tianyi Zhang Faisal Ladhak Esin Durmus Percy Liang Kathleen McKeown Tatsunori B. Hashimoto ELM 43 487 0 31 Jan 2023
REPLUG: Retrieval-Augmented Black-Box Language Models Weijia Shi Sewon Min Michihiro Yasunaga Minjoon Seo Rich James M. Lewis Luke Zettlemoyer Wen-tau Yih RALM VLM KELM 85 586 0 30 Jan 2023
Large Language Models for Biomedical Knowledge Graph Construction: Information extraction from EMR notes Vahan Arsenyan Spartak Bughdaryan Fadi Shaya Kent Small Davit Shahnazaryan 35 10 0 29 Jan 2023
Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus Alex Warstadt Leshem Choshen Aaron Mueller Adina Williams Ethan Gotlieb Wilcox Chengxu Zhuang 30 54 0 27 Jan 2023
Affective Faces for Goal-Driven Dyadic Communication Scott Geng Revant Teotia Purva Tendulkar Sachit Menon Carl Vondrick VGen 34 19 0 26 Jan 2023
Explainable AI does not provide the explanations end-users are asking for Savio Rozario G. Cevora XAI 20 0 0 25 Jan 2023
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning Malte Ostendorff Georg Rehm CLIP VLM CLL 46 24 0 23 Jan 2023