v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019

Sharan Narang

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,925 papers shown

Title
RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning Yinpei Dai Jayjun Lee Nima Fazeli Joyce Chai 71 13 0 23 Sep 2024
Speechworthy Instruction-tuned Language Models Hyundong Justin Cho Nicolaas Jedema Leonardo F. R. Ribeiro Karishma Sharma Pedro Szekely Alessandro Moschitti Ruben Janssen Jonathan May ALM 85 1 0 23 Sep 2024
Multi-modal Generative AI: Multi-modal LLMs, Diffusions and the Unification X. Wang Yuwei Zhou Bin Huang Hong Chen Wenwu Zhu DiffM 158 9 0 23 Sep 2024
Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method Weichao Zhang Ruqing Zhang Jiafeng Guo Maarten de Rijke Yixing Fan Xueqi Cheng 156 16 0 23 Sep 2024
Can pre-trained language models generate titles for research papers? Tohida Rehman Debarshi Kumar Sanyal S. Chattopadhyay 99 3 0 22 Sep 2024
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment Yuxiao Chen Keqin Li Wentao Bao Deep Patel Yu Kong Martin Renqiang Min Dimitris N. Metaxas DiffM 93 1 0 22 Sep 2024
Work Smarter Not Harder: Simple Imitation Learning with CS-PIBT Outperforms Large Scale Imitation Learning for MAPF Rishi Veerapaneni Arthur Jakobsson Kevin Ren Samuel Kim Jiaoyang Li Maxim Likhachev 76 1 0 22 Sep 2024
Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization Minyi Zhao Jie Wang Zerui Li Jiyuan Zhang Zhenbang Sun Shuigeng Zhou MLLM VLM 138 0 0 22 Sep 2024
SAC-KG: Exploiting Large Language Models as Skilled Automatic Constructors for Domain Knowledge Graphs Hanzhu Chen Xu Shen Qitan Lv Jie Wang Xiaoqi Ni Jieping Ye 79 10 0 22 Sep 2024
Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses Hung-Ting Su Ya-Ching Hsu Xudong Lin Xiang Qian Shi Yulei Niu Han-Yuan Hsu Hung-yi Lee Winston H. Hsu LRM 55 1 0 22 Sep 2024
Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics Burooj Ghani Vincent J. Kalkman Bob Planqué Willem-Pier Vellinga L. Gill Dan Stowell VLM 69 6 0 21 Sep 2024
AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model Kazuma Komiya Yoshihisa Fukuhara 60 0 0 21 Sep 2024
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs Ehsan Kabir Md. Arafat Kabir Austin R. J. Downey Jason D. Bakos David Andrews Miaoqing Huang GNN 66 0 0 21 Sep 2024
One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks Sebastian Nehrdich Oliver Hellwig Kurt Keutzer 62 5 0 20 Sep 2024
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning Daniele Rege Cambrin Giuseppe Gallipoli Irene Benedetto Luca Cagliero Paolo Garza 55 0 0 20 Sep 2024
ShizishanGPT: An Agricultural Large Language Model Integrating Tools and Resources Shuting Yang Zehui Liu Wolfgang Mayer RALM 56 3 0 20 Sep 2024
Towards Long-Context Time Series Foundation Models Nina Żukowska Mononito Goswami Michał Wiliński Willa Potosnak Artur Dubrawski AI4TS 61 3 0 20 Sep 2024
EMMeTT: Efficient Multimodal Machine Translation Training Piotr Żelasko Zhehuai Chen Mengru Wang Daniel Galvez Oleksii Hrinchuk Shuoyang Ding Ke Hu Jagadeesh Balam Vitaly Lavrukhin Boris Ginsburg 85 1 0 20 Sep 2024
Imagine yourself: Tuning-Free Personalized Image Generation Zecheng He Bo Sun Felix Juefei-Xu Haoyu Ma Ankit Ramchandani ... Ning Zhang Peizhao Zhang Roshan Sumbaly Peter Vajda Animesh Sinha DiffM 102 19 0 20 Sep 2024
Towards LifeSpan Cognitive Systems Yu Wang Chi Han Tongtong Wu Xiaoxin He Wangchunshu Zhou ... Zexue He Wei Wang Gholamreza Haffari Heng Ji Julian McAuley KELM CLL 486 2 0 20 Sep 2024
Exploring Scaling Laws for Local SGD in Large Language Model Training Qiaozhi He Xiaomin Zhuang Zhihua Wu 92 4 0 20 Sep 2024
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition Stephen Zhang Vardan Papyan VLM 166 3 0 20 Sep 2024
Cross-Domain Content Generation with Domain-Specific Small Language Models Ankit Maloo Abhinav Garg CLL 47 0 0 19 Sep 2024
Exploring Large Language Models for Product Attribute Value Identification Kassem Sabeh Mouna Kacimi Johann Gamper Robert Litschko Barbara Plank 75 2 0 19 Sep 2024
Text2Traj2Text: Learning-by-Synthesis Framework for Contextual Captioning of Human Movement Trajectories Hikaru Asano Ryo Yonetani Taiki Sekii Hiroki Ouchi 105 0 0 19 Sep 2024
Enhancing SLM via ChatGPT and Dataset Augmentation Tom Pieper Mohamad Ballout U. Krumnack Gunther Heidemann Kai-Uwe Kühnberger 97 0 0 19 Sep 2024
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights Mohamad Ballout U. Krumnack Gunther Heidemann Kai-Uwe Kühnberger 95 3 0 19 Sep 2024
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Xiaotian Han Yiren Jian Xuefeng Hu Haogeng Liu Yiqi Wang ... Yuang Ai Huaibo Huang Ran He Zhenheng Yang Quanzeng You LRM AI4CE 62 22 0 19 Sep 2024
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward Dongheng Li Yongchang Hao Lili Mou 114 2 0 19 Sep 2024
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language Models Shengsheng Qian Zuyi Zhou Dizhan Xue Bing Wang Changsheng Xu LRM 154 2 0 19 Sep 2024
Small Language Models are Equation Reasoners Bumjun Kim Kunha Lee Juyeon Kim Sangam Lee ReLM LRM 45 3 0 19 Sep 2024
AudioComposer: Towards Fine-grained Audio Generation with Natural Language Descriptions Yun Wang Hangting Chen Dongchao Yang Zhiyong Wu Xixin Wu DiffM 97 2 0 19 Sep 2024
Tokenization for Molecular Foundation Models Alexius Wadell Anoushka Bhutani Venkatasubramanian Viswanathan 476 1 0 19 Sep 2024
Ethical software requirements from user reviews: A systematic literature review Aakash Sorathiya Gouri Ginde 40 2 0 18 Sep 2024
Fine-Tuning a Time Series Foundation Model with Wasserstein Loss Andrei Chernov AI4TS 38 0 0 18 Sep 2024
Computational Imaging for Long-Term Prediction of Solar Irradiance Leron Julian Haejoon Lee S. Kar Aswin C. Sankaranarayanan 74 0 0 18 Sep 2024
FLARE: Fusing Language Models and Collaborative Architectures for Recommender Enhancement Liam Hebert Marialena Kyriakidi Hubert Pham Krishna Sayana James Pine Sukhdeep S. Sodhi Ambarish Jash VLM 101 4 0 18 Sep 2024
Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning Ilaria Manco Justin Salamon Oriol Nieto 62 2 0 17 Sep 2024
Enriching Datasets with Demographics through Large Language Models: What's in a Name? Khaled AlNuaimi Gautier Marti Mathieu Ravaut Abdulla Alketbi Andreas Henschel Raed Jaradat 68 1 0 17 Sep 2024
Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement Simon Yu Liangyu Chen Sara Ahmadian Marzieh Fadaee 80 7 0 17 Sep 2024
SOAP: Improving and Stabilizing Shampoo using Adam Nikhil Vyas Depen Morwani Rosie Zhao Itai Shapira David Brandfonbrener Lucas Janson Sham Kakade Sham Kakade 169 38 0 17 Sep 2024
Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models Divij Gupta Anubhav Bhatti Surajsinh Parmar AI4TS 87 2 0 17 Sep 2024
Leveraging Distillation Techniques for Document Understanding: A Case Study with FLAN-T5 Marcel Lamott Muhammad Armaghan Shakir 70 0 0 17 Sep 2024
Evaluating the Impact of Compression Techniques on Task-Specific Performance of Large Language Models Bishwash Khanal Jeffery M. Capone 94 1 0 17 Sep 2024
Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction Erwin D. López Z. Cheng Tang Atsushi Shimada 48 1 0 17 Sep 2024
Chain-of-Thought Prompting for Speech Translation Ke Hu Zhehuai Chen Chao-Han Huck Yang Piotr Żelasko Oleksii Hrinchuk Vitaly Lavrukhin Jagadeesh Balam Boris Ginsburg LRM 173 9 0 17 Sep 2024
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse Maojia Song Shang Hong Sim Rishabh Bhardwaj Hai Leong Chieu Navonil Majumder Soujanya Poria 132 12 0 17 Sep 2024
Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models Bingchen Liu Ehsan Akhgari Alexander Visheratin Aleks Kamko Linmiao Xu Shivam Shrirao Joao Souza Suhail Doshi Daiqing Li Daiqing Li DiffM MLLM 111 60 0 16 Sep 2024
FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models Luca Comanducci Paolo Bestagini Stefano Tubaro 69 7 0 16 Sep 2024
Exploring Fine-tuned Generative Models for Keyphrase Selection: A Case Study for Russian Anna Glazkova Dmitry A. Morozov 59 1 0 16 Sep 2024