ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,973 papers shown
Title
GOLD: Geometry Problem Solver with Natural Language Description
GOLD: Geometry Problem Solver with Natural Language Description
Jiaxin Zhang
Yashar Moshfeghi
ReLM
41
5
0
01 May 2024
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference
  Learning
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning
Yuxi Xie
Anirudh Goyal
Wenyue Zheng
Min-Yen Kan
Timothy Lillicrap
Kenji Kawaguchi
Michael Shieh
ReLMLRM
152
126
0
01 May 2024
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target
  Identification with Large Multimodal Models
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models
Hongzhan Lin
Zixin Chen
Ziyang Luo
Mingfei Cheng
Jing Ma
Guang Chen
103
6
0
01 May 2024
DFKI-NLP at SemEval-2024 Task 2: Towards Robust LLMs Using Data
  Perturbations and MinMax Training
DFKI-NLP at SemEval-2024 Task 2: Towards Robust LLMs Using Data Perturbations and MinMax Training
Bhuvanesh Verma
Lisa Raithel
54
1
0
01 May 2024
Towards a Search Engine for Machines: Unified Ranking for Multiple
  Retrieval-Augmented Large Language Models
Towards a Search Engine for Machines: Unified Ranking for Multiple Retrieval-Augmented Large Language Models
Alireza Salemi
Hamed Zamani
79
5
0
30 Apr 2024
GUing: A Mobile GUI Search Engine using a Vision-Language Model
GUing: A Mobile GUI Search Engine using a Vision-Language Model
Jialiang Wei
A. Courbis
Thomas Lambolais
Binbin Xu
P. Bernard
Gérard Dray
Walid Maalej
DiffMCLIP
82
6
0
30 Apr 2024
ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text
  data on social disorders in children and adolescents
ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents
Hoang-Thang Ta
Abu Bakar Siddiqur Rahman
Lotfollah Najjar
Alexander Gelbukh
27
0
0
30 Apr 2024
Sõnajaht: Definition Embeddings and Semantic Search for Reverse
  Dictionary Creation
Sõnajaht: Definition Embeddings and Semantic Search for Reverse Dictionary Creation
Aleksei Dorkin
Kairit Sirts
111
2
0
30 Apr 2024
Multi-hop Question Answering over Knowledge Graphs using Large Language
  Models
Multi-hop Question Answering over Knowledge Graphs using Large Language Models
Abir Chakraborty
KELMRALM
97
6
0
30 Apr 2024
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Yucheng Hu
Yuxing Lu
RALM
138
25
0
30 Apr 2024
Q-Newton: Hybrid Quantum-Classical Scheduling for Accelerating Neural Network Training with Newton's Gradient Descent
Q-Newton: Hybrid Quantum-Classical Scheduling for Accelerating Neural Network Training with Newton's Gradient Descent
Pingzhi Li
Junyu Liu
Hanrui Wang
Tianlong Chen
219
2
0
30 Apr 2024
Multi-Page Document Visual Question Answering using Self-Attention
  Scoring Mechanism
Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Lei Kang
Rubèn Pérez Tito
Ernest Valveny
Dimosthenis Karatzas
71
5
0
29 Apr 2024
Spivavtor: An Instruction Tuned Ukrainian Text Editing Model
Spivavtor: An Instruction Tuned Ukrainian Text Editing Model
Aman Saini
Artem Chernodub
Vipul Raheja
Vivek Kulkarni
56
3
0
29 Apr 2024
It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation
  of Patient Comments
It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient Comments
Petter Maehlum
David Samuel
R. Norman
Elma Jelin
Oyvind Bjertnaes
Lilja Ovrelid
Erik Velldal
73
4
0
29 Apr 2024
Benchmarking Benchmark Leakage in Large Language Models
Benchmarking Benchmark Leakage in Large Language Models
Ruijie Xu
Zengzhi Wang
Run-Ze Fan
Pengfei Liu
131
54
0
29 Apr 2024
Credible, Unreliable or Leaked?: Evidence Verification for Enhanced
  Automated Fact-checking
Credible, Unreliable or Leaked?: Evidence Verification for Enhanced Automated Fact-checking
Zacharias Chrysidis
Stefanos-Iordanis Papadopoulos
Symeon Papadopoulos
P. Petrantonakis
81
9
0
29 Apr 2024
PoPE: Legendre Orthogonal Polynomials Based Position Encoding for Large
  Language Models
PoPE: Legendre Orthogonal Polynomials Based Position Encoding for Large Language Models
Arpit Aggarwal
55
0
0
29 Apr 2024
LangBiTe: A Platform for Testing Bias in Large Language Models
LangBiTe: A Platform for Testing Bias in Large Language Models
Sergio Morales
Robert Clarisó
Jordi Cabot
43
2
0
29 Apr 2024
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in
  the Wild
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild
Donggyun Kim
Seongwoong Cho
Semin Kim
Chong Luo
Seunghoon Hong
VLM
90
3
0
29 Apr 2024
BMRetriever: Tuning Large Language Models as Better Biomedical Text
  Retrievers
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers
Ran Xu
Wenqi Shi
Yue Yu
Yuchen Zhuang
Yanqiao Zhu
M. D. Wang
Joyce C. Ho
Chao Zhang
Carl Yang
LM&MA
103
25
0
29 Apr 2024
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
Xinyu Ma
Xuebo Liu
Derek F. Wong
Jun Rao
Bei Li
Liang Ding
Lidia S. Chao
Dacheng Tao
Min Zhang
65
3
0
29 Apr 2024
ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question
  Answering by Understanding Vietnamese Text in Images
ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images
Huy Quang Pham
Thang Kien-Bao Nguyen
Quan Van Nguyen
Dan Quang Tran
Nghia Hieu Nguyen
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
97
4
0
29 Apr 2024
Exploring the Limits of Fine-grained LLM-based Physics Inference via
  Premise Removal Interventions
Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions
Jordan Meadows
Tamsin James
André Freitas
ReLMLRMAI4CE
82
1
0
29 Apr 2024
Towards Unbiased Evaluation of Detecting Unanswerable Questions in
  EHRSQL
Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL
Yongjin Yang
Sihyeon Kim
Sangmook Kim
Gyubok Lee
Se-Young Yun
Edward Choi
78
2
0
29 Apr 2024
ir_explain: a Python Library of Explainable IR Methods
ir_explain: a Python Library of Explainable IR Methods
Siyang Song
Harsh Agarwal
Venktesh V
Avishek Anand
Swastik Mohanty
Debapriyo Majumdar
Mandar Mitra
XAI
146
1
0
29 Apr 2024
Towards Incremental Learning in Large Language Models: A Critical Review
Towards Incremental Learning in Large Language Models: A Critical Review
M. Jovanovic
Peter Voss
ELMCLLKELM
126
5
0
28 Apr 2024
Modeling Orthographic Variation Improves NLP Performance for Nigerian
  Pidgin
Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin
Pin-Jie Lin
Merel C. J. Scholman
Muhammed Saeed
Vera Demberg
96
3
0
28 Apr 2024
SOUL: Unlocking the Power of Second-Order Optimization for LLM
  Unlearning
SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning
Jinghan Jia
Yihua Zhang
Yimeng Zhang
Jiancheng Liu
Bharat Runwal
James Diffenderfer
B. Kailkhura
Sijia Liu
MU
204
50
0
28 Apr 2024
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with
  Fine-tuned Large Language Model
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model
Zhengpeng Shi
Haoran Luo
LRMALM
93
2
0
28 Apr 2024
LLMParser: An Exploratory Study on Using Large Language Models for Log
  Parsing
LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing
Zeyang Ma
A. Chen
Dong Jae Kim
Tse-Husn Chen
Shaowei Wang
90
56
0
27 Apr 2024
Transfer Learning Enhanced Single-choice Decision for Multi-choice
  Question Answering
Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering
Chenhao Cui
Yufan Jiang
Shuangzhi Wu
Zhoujun Li
FaML
65
0
0
27 Apr 2024
Instance-free Text to Point Cloud Localization with Relative Position
  Awareness
Instance-free Text to Point Cloud Localization with Relative Position Awareness
Lichao Wang
Zhihao Yuan
Jinke Ren
Shuguang Cui
Zhen Li
123
0
0
27 Apr 2024
Recall, Retrieve and Reason: Towards Better In-Context Relation
  Extraction
Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction
Guozheng Li
Peng Wang
Wenjun Ke
Yikai Guo
Ke Ji
Ziyu Shang
Jiajun Liu
Zijie Xu
LRMReLM
107
5
0
27 Apr 2024
Meta In-Context Learning Makes Large Language Models Better Zero and
  Few-Shot Relation Extractors
Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors
Guozheng Li
Peng Wang
Jiajun Liu
Yikai Guo
Ke Ji
Ziyu Shang
Zijie Xu
LRM
106
10
0
27 Apr 2024
Empirical Analysis of Dialogue Relation Extraction with Large Language
  Models
Empirical Analysis of Dialogue Relation Extraction with Large Language Models
Guozheng Li
Zijie Xu
Ziyu Shang
Jiajun Liu
Ke Ji
Yikai Guo
97
2
0
27 Apr 2024
Building a Large Japanese Web Corpus for Large Language Models
Building a Large Japanese Web Corpus for Large Language Models
Naoaki Okazaki
Kakeru Hattori
Hirai Shota
Hiroki Iida
Masanari Ohi
Kazuki Fujii
Taishi Nakamura
Mengsay Loem
Rio Yokota
Sakae Mizuki
110
7
0
27 Apr 2024
From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets
From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets
Manuel Tonneau
Diyi Liu
Samuel Fraiberger
Ralph Schroeder
Scott A. Hale
Paul Röttger
121
7
0
27 Apr 2024
Temporal Scaling Law for Large Language Models
Temporal Scaling Law for Large Language Models
Yizhe Xiong
Xiansheng Chen
Xin Ye
Hui Chen
Zijia Lin
...
Zhenpeng Su
Wei Huang
Jianwei Niu
Jiawei Han
Guiguang Ding
131
10
0
27 Apr 2024
CEval: A Benchmark for Evaluating Counterfactual Text Generation
CEval: A Benchmark for Evaluating Counterfactual Text Generation
Van Bach Nguyen
Jorg Schlotterer
Christin Seifert
107
7
0
26 Apr 2024
Making Better Use of Unlabelled Data in Bayesian Active Learning
Making Better Use of Unlabelled Data in Bayesian Active Learning
Freddie Bickford-Smith
Adam Foster
Tom Rainforth
110
4
0
26 Apr 2024
Neuro-Symbolic Embedding for Short and Effective Feature Selection via
  Autoregressive Generation
Neuro-Symbolic Embedding for Short and Effective Feature Selection via Autoregressive Generation
Nanxu Gong
Wangyang Ying
Dongjie Wang
Yanjie Fu
130
12
0
26 Apr 2024
Automated Data Visualization from Natural Language via Large Language
  Models: An Exploratory Study
Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study
Yang Wu
Yao Wan
Hongyu Zhang
Yulei Sui
Wucai Wei
Wei Zhao
Guandong Xu
Hai Jin
60
25
0
26 Apr 2024
A Survey of Generative Search and Recommendation in the Era of Large
  Language Models
A Survey of Generative Search and Recommendation in the Era of Large Language Models
Chak Tou Leong
Xinyu Lin
Wenjie Wang
Fuli Feng
Liang Pang
Wenjie Li
Liqiang Nie
Xiangnan He
Tat-Seng Chua
3DVLRM
101
9
0
25 Apr 2024
Make Your LLM Fully Utilize the Context
Make Your LLM Fully Utilize the Context
Shengnan An
Zexiong Ma
Zeqi Lin
Nanning Zheng
Jian-Guang Lou
SyDa
178
67
0
25 Apr 2024
Continual Learning of Large Language Models: A Comprehensive Survey
Continual Learning of Large Language Models: A Comprehensive Survey
Haizhou Shi
Zihao Xu
Hengyi Wang
Weiyi Qin
Wenyuan Wang
Yibin Wang
Zifeng Wang
Sayna Ebrahimi
Hao Wang
CLLKELMLRM
167
88
0
25 Apr 2024
TELA: Text to Layer-wise 3D Clothed Human Generation
TELA: Text to Layer-wise 3D Clothed Human Generation
Junting Dong
Qi Fang
Zehuan Huang
Xudong Xu
Jingbo Wang
Sida Peng
Bo Dai
3DH
69
10
0
25 Apr 2024
ProbGate at EHRSQL 2024: Enhancing SQL Query Generation Accuracy through
  Probabilistic Threshold Filtering and Error Handling
ProbGate at EHRSQL 2024: Enhancing SQL Query Generation Accuracy through Probabilistic Threshold Filtering and Error Handling
Sangryul Kim
Donghee Han
Sehyun Kim
83
3
0
25 Apr 2024
Tele-FLM Technical Report
Tele-FLM Technical Report
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Chao Wang
...
Yequan Wang
Zhongjiang He
Zhongyuan Wang
Xuelong Li
Tiejun Huang
83
4
0
25 Apr 2024
Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud
Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud
Ayumu Saito
Prachi Kudeshia
Jiju Poovvancheri
3DPC
174
9
0
25 Apr 2024
Asking and Answering Questions to Extract Event-Argument Structures
Asking and Answering Questions to Extract Event-Argument Structures
Md Nayem Uddin
Enfa Rose George
Eduardo Blanco
Steven Corman
76
3
0
25 Apr 2024
Previous
123...626364...198199200
Next