Title
Beyond Imitation: Learning Key Reasoning Steps from Dual Chain-of-Thoughts in Reasoning Distillation Chengwei Dai Kun Li Wei Zhou Song Hu LRM 58 6 0 30 May 2024
Why Larger Language Models Do In-context Learning Differently? Zhenmei Shi Junyi Wei Zhuoyan Xu Yingyu Liang 37 23 0 30 May 2024
Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions Zhe Hu Tuo Liang Jing Li Yiren Lu Yunlai Zhou Yiran Qiao Jing Ma Yu Yin 64 3 0 29 May 2024
Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI Wei-Bang Jiang Li-Ming Zhao Bao-Liang Lu 60 71 0 29 May 2024
Are PPO-ed Language Models Hackable? Suraj Anand David Getzen 37 0 0 28 May 2024
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Alexander Hägele Elie Bakouch Atli Kosson Loubna Ben Allal Leandro von Werra Martin Jaggi 46 37 0 28 May 2024
Towards a theory of how the structure of language is acquired by deep neural networks Francesco Cagnetta Matthieu Wyart 41 9 0 28 May 2024
FinerCut: Finer-grained Interpretable Layer Pruning for Large Language Models Yang Zhang Yawei Li Xinpeng Wang Qianli Shen Barbara Plank Bernd Bischl Mina Rezaei Kenji Kawaguchi 68 10 0 28 May 2024
Self-Guiding Exploration for Combinatorial Problems Zangir Iklassov Yali Du Farkhad Akimov Martin Takáč LRM 32 3 0 28 May 2024
Metaheuristics and Large Language Models Join Forces: Toward an Integrated Optimization Approach Camilo Chacón Sartori Christian Blum Filippo Bistaffa Guillem Rodríguez Corominas AIFin 61 4 0 28 May 2024
Phase Transitions in the Output Distribution of Large Language Models Julian Arnold Flemming Holtorf Frank Schafer Niels Lörch 51 1 0 27 May 2024
Saturn: Sample-efficient Generative Molecular Design using Memory Manipulation Jeff Guo Philippe Schwaller Mamba 61 7 0 27 May 2024
Assessing Empathy in Large Language Models with Real-World Physician-Patient Interactions Man Luo Christopher J. Warren Lu Cheng Haidar M Abdul-Muhsin Imon Banerjee LM&MA AI4MH 42 10 0 26 May 2024
The global landscape of academic guidelines for generative AI and Large Language Models Junfeng Jiao S. Afroogh Kevin Chen David Atkinson Amit Dhurandhar 92 4 0 26 May 2024
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models Xudong Lu Aojun Zhou Yuhui Xu Renrui Zhang Peng Gao Hongsheng Li 45 7 0 25 May 2024
Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention Andrew Li Xianle Feng Siddhant Narang Austin Peng Tianle Cai Raj Sanjay Shah Sashank Varma LRM 46 6 0 25 May 2024
Unsupervised Meta-Learning via In-Context Learning Anna Vettoruzzo Lorenzo Braccaioli Joaquin Vanschoren M. Nowaczyk SSL 78 0 0 25 May 2024
Open-Vocabulary SAM3D: Understand Any 3D Scene Hanchen Tai Qingdong He Jiangning Zhang Yijie Qian Zhenyu Zhang Xiaobin Hu Yabiao Wang Yong Liu VLM 71 1 0 24 May 2024
Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs Siyuan Guo Aniket Didolkar Nan Rosemary Ke Anirudh Goyal Ferenc Huszár Bernhard Schölkopf 59 4 0 24 May 2024
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks Jacob Russin Sam Whitman McGrath Danielle J. Williams Lotem Elber-Dorozko AI4CE 110 3 0 24 May 2024
Quantifying the Gain in Weak-to-Strong Generalization Moses Charikar Chirag Pabbaraju Kirankumar Shiragur ELM 53 19 0 24 May 2024
AstroPT: Scaling Large Observation Models for Astronomy Michael J. Smith Ryan J. Roberts E. Angeloudi M. Huertas-Company 61 1 0 23 May 2024
Lessons from the Trenches on Reproducible Evaluation of Language Models Stella Biderman Hailey Schoelkopf Lintang Sutawika Leo Gao J. Tow ... Xiangru Tang Kevin A. Wang Genta Indra Winata Franccois Yvon Andy Zou ELM ALM 143 55 3 23 May 2024
Large language models can be zero-shot anomaly detectors for time series? Sarah Alnegheimish Linh Nguyen Laure Berti-Equille K. Veeramachaneni AI4TS 45 13 0 23 May 2024
MultiCast: Zero-Shot Multivariate Time Series Forecasting Using LLMs Georgios Chatzigeorgakidis Konstantinos Lentzos Dimitrios Skoutas AI4TS 48 3 0 23 May 2024
Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs Jaewoo Yang Hayun Kim Younghoon Kim 52 12 0 23 May 2024
Explainable Few-shot Knowledge Tracing Haoxuan Li Jifan Yu Y. Ouyang Zhuang Liu Wenge Rong Juan-Zi Li Zhang Xiong 48 1 0 23 May 2024
Can Large Language Models Create New Knowledge for Spatial Reasoning Tasks? Thomas Greatrix Roger Whitaker Liam Turner Walter Colombo LRM 52 1 0 23 May 2024
Focus Anywhere for Fine-grained Multi-page Document Understanding Chenglong Liu Haoran Wei Jinyue Chen Lingyu Kong Zheng Ge Zining Zhu Liang Zhao Jian‐Yuan Sun Chunrui Han Xiangyu Zhang 46 22 0 23 May 2024
Large Language Models' Detection of Political Orientation in Newspapers Alessio Buscemi Daniele Proverbio 40 3 0 23 May 2024
How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator Subhash Kantamneni Ziming Liu Max Tegmark 26 2 0 23 May 2024
Implicit In-context Learning Zhuowei Li Zihao Xu Ligong Han Yunhe Gao Song Wen Di Liu Hao Wang Dimitris N. Metaxas 67 2 0 23 May 2024
Carbon Connect: An Ecosystem for Sustainable Computing Benjamin C. Lee David Brooks Arthur van Benthem Udit Gupta G. Hills ... Emma Strubell Gu-Yeon Wei Adam Wierman Yuan Yao Minlan Yu 30 2 0 22 May 2024
Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation Cyril Chhun Fabian M. Suchanek Chloé Clavel LRM 49 15 0 22 May 2024
A Survey of Robotic Language Grounding: Tradeoffs between Symbols and Embeddings Vanya Cohen J. Liu Raymond J. Mooney Stefanie Tellex David Watkins LM&Ro 62 12 0 21 May 2024
Securing the Future of GenAI: Policy and Technology Mihai Christodorescu Craven Soheil Feizi Neil Zhenqiang Gong Mia Hoffmann ... Jessica Newman Emelia Probasco Yanjun Qi Khawaja Shams Turek SILM 82 3 0 21 May 2024
EchoPT: A Pretrained Transformer Architecture that Predicts 2D In-Air Sonar Images for Mobile Robotics Jan Steckel W. Jansen Nico Huebel MDE 48 0 0 21 May 2024
Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning Guanglin Zhou Zhongyi Han Shiming Chen Erdun Gao Liming Zhu Salman Khan Xin Gao Lina Yao VLM 66 4 0 20 May 2024
Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs Siyu Lou Yuntian Chen Xiaodan Liang Liang Lin Quanshi Zhang 74 2 0 20 May 2024
Asymptotic theory of in-context learning by linear attention Yue M. Lu Mary I. Letey Jacob A. Zavatone-Veth Anindita Maiti Cengiz Pehlevan 42 11 0 20 May 2024
Large Language Models are Biased Reinforcement Learners William M. Hayes Nicolas Yax Stefano Palminteri OffRL 50 1 0 19 May 2024
Mitigating Interpretation Bias in Rock Records with Large Language Models: Insights from Paleoenvironmental Analysis Luoqi Wang Haipeng Li Linshu Hu Jiarui Cai Zhenhong Du AI4CE 45 0 0 17 May 2024
Function Extrapolation with Neural Networks and Its Application for Manifolds Guy Hay N. Sharon 57 1 0 17 May 2024
Rethinking ChatGPT's Success: Usability and Cognitive Behaviors Enabled by Auto-regressive LLMs' Prompting Xinzhe Li Ming Liu 59 0 0 17 May 2024
Can formal argumentative reasoning enhance LLMs performances? Federico Castagna I. Sassoon Simon Parsons LRM LLMAG 30 2 0 16 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models Xianzheng Ma Yash Bhalgat Brandon Smart Shuai Chen Xinghui Li ... Matthias Nießner Ian D Reid Angel X. Chang Iro Laina V. Prisacariu LRM 42 14 0 16 May 2024
What is it for a Machine Learning Model to Have a Capability? Jacqueline Harding Nathaniel Sharadin ELM 45 3 0 14 May 2024
Hearing Touch: Audio-Visual Pretraining for Contact-Rich Manipulation Jared Mejia Victoria Dean Tess Hellebrekers Abhinav Gupta 62 13 0 14 May 2024
Improving Transformers with Dynamically Composable Multi-Head Attention Da Xiao Qingye Meng Shengping Li Xingyuan Yuan 40 3 0 14 May 2024
When Large Language Models Meet Optical Networks: Paving the Way for Automation Danshi Wang Yidi Wang Xiaotian Jiang Yao Zhang Yue Pang Min Zhang 42 5 0 14 May 2024