ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.05858
  4. Cited By
CTRL: A Conditional Transformer Language Model for Controllable
  Generation
v1v2 (latest)

CTRL: A Conditional Transformer Language Model for Controllable Generation

11 September 2019
N. Keskar
Bryan McCann
Lav Varshney
Caiming Xiong
R. Socher
    AI4CE
ArXiv (abs)PDFHTMLGithub (1885★)

Papers citing "CTRL: A Conditional Transformer Language Model for Controllable Generation"

50 / 85 papers shown
Title
Teaching Large Language Models to Reason through Learning and Forgetting
Teaching Large Language Models to Reason through Learning and Forgetting
Tianwei Ni
Allen Nie
Sapana Chaudhary
Yao Liu
Huzefa Rangwala
Rasool Fakoor
ReLMCLLLRM
468
0
0
15 Apr 2025
Looking beyond the next token
Looking beyond the next token
Abitha Thankaraj
Yiding Jiang
J. Zico Kolter
Yonatan Bisk
LRM
125
1
0
15 Apr 2025
SIFT-50M: A Large-Scale Multilingual Dataset for Speech Instruction Fine-Tuning
SIFT-50M: A Large-Scale Multilingual Dataset for Speech Instruction Fine-Tuning
Prabhat Pandey
Rupak Vignesh Swaminathan
K V Vijay Girish
Arunasish Sen
Jian Xie
Grant P. Strimel
Andreas Schwarz
442
2
0
12 Apr 2025
The Challenge of Achieving Attributability in Multilingual Table-to-Text Generation with Question-Answer Blueprints
The Challenge of Achieving Attributability in Multilingual Table-to-Text Generation with Question-Answer Blueprints
Aden Haussmann
LMTD
153
0
0
29 Mar 2025
Evolving Form and Function: Dual-Objective Optimization in Neural Symbolic Regression Networks
Amanda Bertschinger
James P. Bagrow
Joshua Bongard
146
1
0
24 Feb 2025
Repetition Neurons: How Do Language Models Produce Repetitions?
Repetition Neurons: How Do Language Models Produce Repetitions?
Tatsuya Hiraoka
Kentaro Inui
MILM
134
9
0
21 Feb 2025
Slamming: Training a Speech Language Model on One GPU in a Day
Slamming: Training a Speech Language Model on One GPU in a Day
Gallil Maimon
Avishai Elmakies
Yossi Adi
83
3
0
19 Feb 2025
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Shenghong He
OffRL
479
0
0
10 Feb 2025
High-Fidelity Simultaneous Speech-To-Speech Translation
High-Fidelity Simultaneous Speech-To-Speech Translation
Tom Labiausse
Laurent Mazaré
Edouard Grave
P. Pérez
Alexandre Défossez
Neil Zeghidour
480
1
0
05 Feb 2025
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
Helia Hashemi
J. Eisner
Corby Rosset
Benjamin Van Durme
Chris Kedzie
138
4
0
03 Jan 2025
A Survey of Controllable Learning: Methods and Applications in Information Retrieval
A Survey of Controllable Learning: Methods and Applications in Information Retrieval
Chenglei Shen
Xiao Zhang
Teng Shi
Changshuo Zhang
Guofu Xie
Jun Xu
137
6
0
03 Jan 2025
PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health
PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health
Huy-Hien Vu
Huy Anh Nguyen
Adithya Ganesan
Swanie Juhng
Oscar Kjell
...
Margaret L. Kern
Ryan L. Boyd
L. Ungar
H. Andrew Schwartz
J. Eichstaedt
145
0
0
03 Jan 2025
Non-Halting Queries: Exploiting Fixed Points in LLMs
Non-Halting Queries: Exploiting Fixed Points in LLMs
Ghaith Hammouri
Kemal Derya
B. Sunar
68
0
0
08 Oct 2024
Selective Prompt Anchoring for Code Generation
Selective Prompt Anchoring for Code Generation
Yuan Tian
Tianyi Zhang
237
3
0
17 Aug 2024
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Lucio La Cava
Davide Costa
Andrea Tagarelli
DeLMO
98
3
0
12 Jul 2024
From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Maor Ivgi
Ori Yoran
Jonathan Berant
Mor Geva
HILM
118
8
0
08 Jul 2024
A Framework for Real-time Safeguarding the Text Generation of Large Language Model
A Framework for Real-time Safeguarding the Text Generation of Large Language Model
Ximing Dong
Dayi Lin
Shaowei Wang
Ahmed E. Hassan
123
1
0
29 Apr 2024
Constructing Highly Inductive Contexts for Dialogue Safety through
  Controllable Reverse Generation
Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation
Zhexin Zhang
Jiale Cheng
Hao Sun
Jiawen Deng
Fei Mi
Yasheng Wang
Lifeng Shang
Minlie Huang
SILM
141
9
0
04 Dec 2022
Why is constrained neural language generation particularly challenging?
Why is constrained neural language generation particularly challenging?
Cristina Garbacea
Qiaozhu Mei
121
15
0
11 Jun 2022
Computer-Generated Music for Tabletop Role-Playing Games
Computer-Generated Music for Tabletop Role-Playing Games
Lucas N. Ferreira
Levi H. S. Lelis
E. Whitehead
155
44
0
16 Aug 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
552
10,591
0
17 Feb 2020
How Decoding Strategies Affect the Verifiability of Generated Text
How Decoding Strategies Affect the Verifiability of Generated Text
Luca Massarelli
Fabio Petroni
Aleksandra Piktus
Myle Ott
Tim Rocktaschel
Vassilis Plachouras
Fabrizio Silvestri
Sebastian Riedel
118
50
0
09 Nov 2019
Pretrained AI Models: Performativity, Mobility, and Change
Pretrained AI Models: Performativity, Mobility, and Change
Lav Varshney
N. Keskar
R. Socher
60
20
0
07 Sep 2019
Answers Unite! Unsupervised Metrics for Reinforced Summarization Models
Answers Unite! Unsupervised Metrics for Reinforced Summarization Models
Thomas Scialom
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
76
150
0
04 Sep 2019
SumQE: a BERT-based Summary Quality Estimation Model
SumQE: a BERT-based Summary Quality Estimation Model
Stratos Xenouleas
Prodromos Malakasiotis
Marianna Apidianaki
Ion Androutsopoulos
59
37
0
02 Sep 2019
Neural Text Summarization: A Critical Evaluation
Neural Text Summarization: A Critical Evaluation
Wojciech Kry'sciñski
N. Keskar
Bryan McCann
Caiming Xiong
R. Socher
90
367
0
23 Aug 2019
Neural Text Generation with Unlikelihood Training
Neural Text Generation with Unlikelihood Training
Sean Welleck
Ilia Kulikov
Stephen Roller
Emily Dinan
Kyunghyun Cho
Jason Weston
MU
68
583
0
12 Aug 2019
ELI5: Long Form Question Answering
ELI5: Long Form Question Answering
Angela Fan
Yacine Jernite
Ethan Perez
David Grangier
Jason Weston
Michael Auli
AI4MHELM
108
624
0
22 Jul 2019
Large Memory Layers with Product Keys
Large Memory Layers with Product Keys
Guillaume Lample
Alexandre Sablayrolles
MarcÁurelio Ranzato
Ludovic Denoyer
Hervé Jégou
MoE
67
135
0
10 Jul 2019
Tagged Back-Translation
Tagged Back-Translation
Isaac Caswell
Ciprian Chelba
David Grangier
123
220
0
15 Jun 2019
Unsupervised Question Answering by Cloze Translation
Unsupervised Question Answering by Cloze Translation
Patrick Lewis
Ludovic Denoyer
Sebastian Riedel
51
139
0
12 Jun 2019
Explain Yourself! Leveraging Language Models for Commonsense Reasoning
Explain Yourself! Leveraging Language Models for Commonsense Reasoning
Nazneen Rajani
Bryan McCann
Caiming Xiong
R. Socher
ReLMLRM
99
566
0
06 Jun 2019
Defending Against Neural Fake News
Defending Against Neural Fake News
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
AAML
140
1,032
0
29 May 2019
Generating Long Sequences with Sparse Transformers
Generating Long Sequences with Sparse Transformers
R. Child
Scott Gray
Alec Radford
Ilya Sutskever
136
1,919
0
23 Apr 2019
The Curious Case of Neural Text Degeneration
The Curious Case of Neural Text Degeneration
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
209
3,213
0
22 Apr 2019
Unifying Question Answering, Text Classification, and Regression via
  Span Extraction
Unifying Question Answering, Text Classification, and Regression via Span Extraction
N. Keskar
Bryan McCann
Caiming Xiong
R. Socher
BDL
62
21
0
19 Apr 2019
Filling Gender & Number Gaps in Neural Machine Translation with
  Black-box Context Injection
Filling Gender & Number Gaps in Neural Machine Translation with Black-box Context Injection
Amit Moryossef
Roee Aharoni
Yoav Goldberg
46
52
0
08 Mar 2019
Memory-Efficient Adaptive Optimization
Memory-Efficient Adaptive Optimization
Rohan Anil
Vineet Gupta
Tomer Koren
Y. Singer
ODL
79
49
0
30 Jan 2019
Cross-lingual Language Model Pretraining
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
116
2,751
0
22 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
263
3,747
0
09 Jan 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,324
0
11 Oct 2018
Model Cards for Model Reporting
Model Cards for Model Reporting
Margaret Mitchell
Simone Wu
Andrew Zaldivar
Parker Barnes
Lucy Vasserman
Ben Hutchinson
Elena Spitzer
Inioluwa Deborah Raji
Timnit Gebru
144
1,910
0
05 Oct 2018
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question
  Answering
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
215
2,703
0
25 Sep 2018
The Natural Language Decathlon: Multitask Learning as Question Answering
The Natural Language Decathlon: Multitask Learning as Question Answering
Bryan McCann
N. Keskar
Caiming Xiong
R. Socher
AIMatMLLMBDL
149
646
0
20 Jun 2018
A Simple Method for Commonsense Reasoning
A Simple Method for Commonsense Reasoning
Trieu H. Trinh
Quoc V. Le
LRMReLM
102
434
0
07 Jun 2018
Hierarchical Neural Story Generation
Hierarchical Neural Story Generation
Angela Fan
M. Lewis
Yann N. Dauphin
DiffM
183
1,631
0
13 May 2018
Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive
  Strategies
Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies
Max Grusky
Mor Naaman
Yoav Artzi
100
559
0
30 Apr 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
1.1K
7,201
0
20 Apr 2018
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Noam M. Shazeer
Mitchell Stern
ODL
86
1,053
0
11 Apr 2018
Fast Decoding in Sequence Models using Discrete Latent Variables
Fast Decoding in Sequence Models using Discrete Latent Variables
Łukasz Kaiser
Aurko Roy
Ashish Vaswani
Niki Parmar
Samy Bengio
Jakob Uszkoreit
Noam M. Shazeer
83
232
0
09 Mar 2018
12
Next