v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019

Sharan Narang

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,973 papers shown

Title
Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming Tommaso Pasini Alejo López-Ávila Husam Quteineh Gerasimos Lampouras Jinhua Du Yubing Wang Ze Li Yusen Sun 70 0 0 08 May 2024
Critical Infrastructure Protection: Generative AI, Challenges, and Opportunities Yagmur Yigit M. Ferrag Iqbal H. Sarker Leandros A. Maglaras Christos Chrysoulas Naghmeh Moradpoor Helge Janicke 65 8 0 08 May 2024
APrompt4EM: Augmented Prompt Tuning for Generalized Entity Matching Yikuan Xia Jiazun Chen Xinchi Li Jun Gao VLM 120 3 0 08 May 2024
THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models Prannay Kaul Zhizhong Li Hao Yang Yonatan Dukler Ashwin Swaminathan C. Taylor Stefano Soatto HILM 174 18 0 08 May 2024
Large Language Models for Cyber Security: A Systematic Literature Review HanXiang Xu Shenao Wang Ningke Li Kaidi Wang Yanjie Zhao Kai Chen Ting Yu Yang Liu Haoyu Wang 139 43 0 08 May 2024
Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking Emre Can Acikgoz Mete Erdogan Deniz Yuret 86 8 0 07 May 2024
Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense Siqi Shen Lajanugen Logeswaran Moontae Lee Honglak Lee Soujanya Poria Rada Mihalcea AI4MH LRM ELM 115 33 0 07 May 2024
Switchable Decision: Dynamic Neural Generation Networks Shujian Zhang Korawat Tanwisuth Chengyue Gong Pengcheng He Mi Zhou BDL 77 0 0 07 May 2024
Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks Georgios Pantazopoulos Amit Parekh Malvina Nikandrou Alessandro Suglia 115 5 0 07 May 2024
Mitigating Clickbait: An Approach to Spoiler Generation Using Multitask Learning Sayantan Pal Souvik Das Rohini Srihari 63 1 0 07 May 2024
Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation Ryan Wong Necati Cihan Camgöz Richard Bowden SLR 104 26 0 07 May 2024
Evaluating Text Summaries Generated by Large Language Models Using OpenAI's GPT Hassan Shakil Atqiya Munawara Mahi Phuoc Nguyen Zeydy Ortiz M. Mardini ELM 62 6 0 07 May 2024
Utilizing GPT to Enhance Text Summarization: A Strategy to Minimize Hallucinations Hassan Shakil Zeydy Ortiz Grant C. Forbes 101 3 0 07 May 2024
Long Context Alignment with Short Instructions and Synthesized Positions Wenhao Wu Yizhong Wang Yao Fu Xiang Yue Dawei Zhu Sujian Li SyDa 86 19 0 07 May 2024
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization Tianyi Zhang Jonah Yi Zhaozhuo Xu Anshumali Shrivastava MQ 68 32 0 07 May 2024
FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference Runheng Liu Xingchen Xiao Heyan Huang Zewen Chi Zhijing Wu RALM KELM 89 0 0 07 May 2024
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore Junchao Wu Runzhe Zhan Derek F. Wong Shu Yang Xuebo Liu Lidia S. Chao Min Zhang DeLMO 125 5 0 07 May 2024
Self-Improving Customer Review Response Generation Based on LLMs Guy Azov Tatiana Pelc Adi Fledel Alon Gila Kamhi 76 2 0 06 May 2024
Position: Leverage Foundational Models for Black-Box Optimization Xingyou Song Yingtao Tian Robert Tjarko Lange Chansoo Lee Yujin Tang Yutian Chen 99 9 0 06 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond Zheng Zhu Xiaofeng Wang Wangbo Zhao Chen Min Nianchen Deng ... Dawei Zhao Liang Xiao Jian-jun Zhao Jiwen Lu Guan Huang VGen LM&Ro 187 48 0 06 May 2024
Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval Jiacheng Cheng Hijung Valentina Shin Nuno Vasconcelos Bryan C. Russell Fabian Caba Heilbron VLM 70 1 0 06 May 2024
Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training Zexuan Zhong Mengzhou Xia Danqi Chen Mike Lewis MoE 110 19 0 06 May 2024
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform Ziqi Gao Qichao Wang Aochuan Chen Zijing Liu Bingzhe Wu Liang Chen Jia Li 105 35 0 05 May 2024
SkelCap: Automated Generation of Descriptive Text from Skeleton Keypoint Sequences Ali Emre Keskin H. Keles SLR 63 0 0 05 May 2024
Enabling Patient-side Disease Prediction via the Integration of Patient Narratives Zhixiang Su Yinan Zhang Jiazheng Jing Jie Xiao Zhiqi Shen 42 0 0 05 May 2024
Data-Efficient Molecular Generation with Hierarchical Textual Inversion Seojin Kim Jaehyun Nam Sihyun Yu Younghoon Shin Jinwoo Shin 137 3 0 05 May 2024
Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization Hamed Zamani Michael Bendersky 120 29 0 05 May 2024
Assessing Adversarial Robustness of Large Language Models: An Empirical Study Zeyu Yang Zhao Meng Xiaochen Zheng Roger Wattenhofer ELM AAML 93 10 0 04 May 2024
Large Language Models estimate fine-grained human color-concept associations Kushin Mukherjee Timothy T. Rogers Karen B. Schloss VLM 106 4 0 04 May 2024
Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health Records Gyubok Lee Sunjun Kweon Seongsu Bae Edward Choi 66 2 0 04 May 2024
CALRec: Contrastive Alignment of Generative LLMs For Sequential Recommendation Yaoyiran Li Xiang Zhai M. Alzantot Keyi Yu Ivan Vulić Anna Korhonen Mohamed Hammad 90 16 0 03 May 2024
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models Piotr Padlewski Max Bain Matthew Henderson Zhongkai Zhu Nishant Relan ... Che Zheng Cyprien de Masson dÁutume Dani Yogatama Mikel Artetxe Yi Tay VLM 152 27 0 03 May 2024
Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling Subhendu Khatuya Rajdeep Mukherjee Akash Ghosh Manjunath Hegde Koustuv Dasgupta Niloy Ganguly Saptarshi Ghosh Pawan Goyal 74 3 0 03 May 2024
Instruction-Guided Bullet Point Summarization of Long Financial Earnings Call Transcripts Subhendu Khatuya Koushiki Sinha Niloy Ganguly Saptarshi Ghosh Pawan Goyal 63 4 0 03 May 2024
Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset Hsuvas Borkakoty Luis Espinosa-Anke 91 1 0 03 May 2024
SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India Salam Michael Singh Shubhmoy Kumar Garg Amitesh Misra Aaditeshwar Seth Tanmoy Chakraborty 73 0 0 03 May 2024
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model Weiqi Zhang Jiexia Ye Ke Yi Yongzi Yu Ziyue Li Jia Li Fugee Tsung AI4TS AI4CE 103 29 0 03 May 2024
Understanding Position Bias Effects on Fairness in Social Multi-Document Summarization Olubusayo Olabisi Ameeta Agrawal 92 2 0 03 May 2024
Large Language Models for UAVs: Current State and Pathways to the Future Shumaila Javaid Nasir Saeed Bin He 102 26 0 02 May 2024
COPAL: Continual Pruning in Large Language Generative Models Srikanth Malla Joon Hee Choi Chiho Choi VLM CLL 92 2 0 02 May 2024
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance Kelvin C. K. Chan Yang Zhao Xuhui Jia Ming-Hsuan Yang Huisheng Wang 123 3 0 02 May 2024
DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines Ye Tian Zhen Jia Ziyue Luo Yida Wang Chuan Wu AI4CE 61 4 0 02 May 2024
Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting Transcripts Lotem Golany Filippo Galgani Maya Mamo Nimrod Parasol Omer Vandsburger Nadav Bar Ido Dagan 90 2 0 02 May 2024
Modeling Empathetic Alignment in Conversation Jiamin Yang David Jurgens 72 0 0 02 May 2024
SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models Burak Can Biner Farrin Marouf Sofian Umur Berkay Karakacs Duygu Ceylan Erkut Erdem Aykut Erdem 77 9 0 01 May 2024
Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social Media Gregorios A. Katsios Ning Sa Ankita Bhaumik T. Strzalkowski 78 0 0 01 May 2024
When Quantization Affects Confidence of Large Language Models? Irina Proskurina Luc Brun Guillaume Metzler Julien Velcin MQ 131 2 0 01 May 2024
Investigating Automatic Scoring and Feedback using Large Language Models G. Katuka Alexander Gain Yen-Yun Yu AI4Ed ALM 66 3 0 01 May 2024
CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions Donghee Choi Mogan Gim Donghyeon Park Mujeen Sung Hyunjae Kim Jaewoo Kang Jihun Choi 77 1 0 01 May 2024
Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning Lucas-Andrei Thil Mirela Popa Gerasimos Spanakis LLMAG 49 2 0 01 May 2024