Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.04304
Cited By
A Deep Reinforced Model for Abstractive Summarization
11 May 2017
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Deep Reinforced Model for Abstractive Summarization"
50 / 713 papers shown
Title
Bridging Research and Readers: A Multi-Modal Automated Academic Papers Interpretation System
Feng Jiang
Kuang Wang
Haizhou Li
21
3
0
17 Jan 2024
Make Them Spill the Beans! Coercive Knowledge Extraction from (Production) LLMs
Zhuo Zhang
Guangyu Shen
Guanhong Tao
Shuyang Cheng
Xiangyu Zhang
41
13
0
08 Dec 2023
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Marwa Abdulhai
Isadora White
Charles Burton Snell
Charles Sun
Joey Hong
Yuexiang Zhai
Kelvin Xu
Sergey Levine
LLMAG
OffRL
LRM
39
31
0
30 Nov 2023
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning
Swaroop Nath
H. Khadilkar
Pushpak Bhattacharyya
OffRL
23
0
0
29 Nov 2023
STEER: Unified Style Transfer with Expert Reinforcement
Skyler Hallinan
Faeze Brahman
Ximing Lu
Jaehun Jung
Sean Welleck
Yejin Choi
OffRL
13
14
0
13 Nov 2023
Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations
Joey Hong
Sergey Levine
Anca Dragan
OffRL
LLMAG
42
24
0
09 Nov 2023
Successor Features for Efficient Multisubject Controlled Text Generation
Mengyao Cao
Mehdi Fatemi
Jackie Chi Kit Cheung
Samira Shabanian
BDL
37
0
0
03 Nov 2023
Boosting Summarization with Normalizing Flows and Aggressive Training
Yu Yang
Xiaotong Shen
AI4CE
TPM
24
0
0
01 Nov 2023
Vanishing Gradients in Reinforcement Finetuning of Language Models
Noam Razin
Hattie Zhou
Omid Saremi
Vimal Thilak
Arwen Bradley
Preetum Nakkiran
Josh Susskind
Etai Littwin
18
7
0
31 Oct 2023
Beyond MLE: Convex Learning for Text Generation
Chenze Shao
Zhengrui Ma
Min Zhang
Yang Feng
30
3
0
26 Oct 2023
Follow-on Question Suggestion via Voice Hints for Voice Assistants
B. Fetahu
Pedro Faustini
Giuseppe Castellucci
Anjie Fang
Oleg Rokhlenko
S. Malmasi
20
2
0
25 Oct 2023
SuperHF: Supervised Iterative Learning from Human Feedback
Gabriel Mukobi
Peter Chatain
Su Fong
Robert Windesheim
Gitta Kutyniok
Kush S. Bhatia
Silas Alberti
ALM
42
6
0
25 Oct 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Xunliang Cai
Dongyan Zhao
Ran Wang
Rui Yan
27
4
0
24 Oct 2023
Enhancing Abstractiveness of Summarization Models through Calibrated Distillation
Hwanjun Song
Igor Shalyminov
Hang Su
Siffi Singh
Kaisheng Yao
Saab Mansour
30
6
0
20 Oct 2023
Large-Scale and Multi-Perspective Opinion Summarization with Diverse Review Subsets
Han Jiang
Rui Wang
Zhihua Wei
Yu Li
Xinpeng Wang
37
4
0
20 Oct 2023
Surveying the Landscape of Text Summarization with Deep Learning: A Comprehensive Review
Guanghua Wang
Weili Wu
AI4TS
AILaw
38
3
0
13 Oct 2023
Goodhart's Law in Reinforcement Learning
Jacek Karwowski
Oliver Hayman
Xingjian Bai
Klaus Kiendlhofer
Charlie Griffin
Joar Skalse
34
9
0
13 Oct 2023
Calibrating Likelihoods towards Consistency in Summarization Models
Polina Zablotskaia
Misha Khalman
Rishabh Joshi
Livio Baldini Soares
Shoshana Jakobovits
Joshua Maynez
Shashi Narayan
31
3
0
12 Oct 2023
A Brief History of Prompt: Leveraging Language Models. (Through Advanced Prompting)
G. Muktadir
SILM
34
8
0
30 Sep 2023
Unsupervised Multi-document Summarization with Holistic Inference
Haopeng Zhang
Sangwoo Cho
Kaiqiang Song
Xiaoyang Wang
Hongwei Wang
Jiawei Zhang
Dong Yu
21
3
0
08 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey
Kamal Acharya
Waleed Raza
Carlos Dourado
Alvaro Velasquez
Houbing Song
NAI
OffRL
32
16
0
02 Sep 2023
Transformers as Support Vector Machines
Davoud Ataee Tarzanagh
Yingcong Li
Christos Thrampoulidis
Samet Oymak
48
43
0
31 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
53
10
0
28 Aug 2023
Inducing Causal Structure for Abstractive Text Summarization
Luyao Chen
Ruqing Zhang
Wei Huang
Wei Chen
J. Guo
Xueqi Cheng
CML
21
1
0
24 Aug 2023
Prompt-Based Length Controlled Generation with Reinforcement Learning
Renlong Jie
Xiaojun Meng
Lifeng Shang
Xin Jiang
Qun Liu
19
8
0
23 Aug 2023
Discrete Prompt Compression with Reinforcement Learning
Hoyoun Jung
Kyung-Joong Kim
32
24
0
17 Aug 2023
A new solution and concrete implementation steps for Artificial General Intelligence
Yong-Hua Chen
Ting Zeng
Jun Zhang
34
0
0
12 Aug 2023
Neural Conversation Models and How to Rein Them in: A Survey of Failures and Fixes
Fabian Galetzka
Anne Beyer
David Schlangen
AI4CE
32
1
0
11 Aug 2023
Redundancy Aware Multi-Reference Based Gainwise Evaluation of Extractive Summarization
Mousumi Akter
Shubhra (Santu) Karmaker
23
1
0
04 Aug 2023
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
Giorgio Franceschelli
Mirco Musolesi
AI4CE
40
20
0
31 Jul 2023
DRL4Route: A Deep Reinforcement Learning Framework for Pick-up and Delivery Route Prediction
Xiaowei Mao
Haomin Wen
Hengrui Zhang
Huaiyu Wan
Lixia Wu
Jianbin Zheng
Haoyuan Hu
Youfang Lin
AI4TS
72
12
0
30 Jul 2023
Uncertainty in Natural Language Generation: From Theory to Applications
Joris Baan
Nico Daheim
Evgenia Ilia
Dennis Ulmer
Haau-Sing Li
Raquel Fernández
Barbara Plank
Rico Sennrich
Chrysoula Zerva
Wilker Aziz
UQLM
34
40
0
28 Jul 2023
f-Divergence Minimization for Sequence-Level Knowledge Distillation
Yuqiao Wen
Zichao Li
Wenyu Du
Lili Mou
32
53
0
27 Jul 2023
On the Effectiveness of Offline RL for Dialogue Response Generation
Paloma Sodhi
Felix Wu
Ethan R. Elenberg
Kilian Q. Weinberger
Ryan T. McDonald
OffRL
19
5
0
23 Jul 2023
BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Chaoya Jiang
Haiyang Xu
Wei Ye
Qinghao Ye
Chenliang Li
Mingshi Yan
Bin Bi
Shikun Zhang
Fei Huang
Songfang Huang
VLM
34
9
0
17 Jul 2023
Advancements in Scientific Controllable Text Generation Methods
Arnav Goel
Medha Hira
Avinash Anand
Siddhesh Bangar
R. Shah
27
7
0
08 Jul 2023
Mitigating the Learning Bias towards Repetition by Self-Contrastive Training for Open-Ended Generation
Jian Guan
Minlie Huang
32
0
0
04 Jul 2023
Max-Margin Token Selection in Attention Mechanism
Davoud Ataee Tarzanagh
Yingcong Li
Xuechen Zhang
Samet Oymak
40
38
0
23 Jun 2023
Semi-Offline Reinforcement Learning for Optimized Text Generation
Changyu Chen
Xiting Wang
Yiqiao Jin
Victor Ye Dong
Li Dong
Jie Cao
Yi Liu
Rui Yan
OffRL
21
15
0
16 Jun 2023
CUED at ProbSum 2023: Hierarchical Ensemble of Summarization Models
Potsawee Manakul
Yassir Fathullah
Adian Liusie
Vyas Raina
Vatsal Raina
Mark Gales
29
12
0
08 Jun 2023
Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization
M. Trabelsi
H. Uzunalioglu
16
1
0
07 Jun 2023
Towards End-to-end Speech-to-text Summarization
Raul Monteiro
Diogo Pernes
9
1
0
06 Jun 2023
Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning
Peggy Tang
Junbin Gao
Lei Zhang
Zhiyong Wang
27
1
0
06 Jun 2023
Fine-Tuning Language Models with Advantage-Induced Policy Alignment
Banghua Zhu
Hiteshi Sharma
Felipe Vieira Frujeri
Shi Dong
Chenguang Zhu
Michael I. Jordan
Jiantao Jiao
OSLM
28
39
0
04 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
29
23
0
01 Jun 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
72
3,398
0
29 May 2023
Zero-shot Visual Question Answering with Language Model Feedback
Yifan Du
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Ji-Rong Wen
21
13
0
26 May 2023
Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization
Pranav Ajit Nair
Sukomal Pal
Pradeepika Verm
MoMe
34
2
0
26 May 2023
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Ximing Lu
Faeze Brahman
Peter West
Jaehun Jang
Khyathi Raghavi Chandu
...
Bill Yuchen Lin
Skyler Hallinan
Xiang Ren
Sean Welleck
Yejin Choi
28
26
0
24 May 2023
Alt-Text with Context: Improving Accessibility for Images on Twitter
Nikita Srivatsan
Sofia Samaniego
Omar U. Florez
Taylor Berg-Kirkpatrick
22
3
0
24 May 2023
Previous
1
2
3
4
5
...
13
14
15
Next