Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.10198
Cited By
Generating Wikipedia by Summarizing Long Sequences
30 January 2018
Peter J. Liu
Mohammad Saleh
Etienne Pot
Ben Goodrich
Ryan Sepassi
Lukasz Kaiser
Noam M. Shazeer
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generating Wikipedia by Summarizing Long Sequences"
50 / 199 papers shown
Title
Small Clips, Big Gains: Learning Long-Range Refocused Temporal Information for Video Super-Resolution
Xingyu Zhou
Wei Long
Jingbo Lu
Shiyin Jiang
Weiyi You
Haifeng Wu
Shuhang Gu
48
0
0
04 May 2025
AttentionDefense: Leveraging System Prompt Attention for Explainable Defense Against Novel Jailbreaks
Charlotte Siska
Anush Sankaran
AAML
50
0
0
10 Apr 2025
Advancements in Natural Language Processing for Automatic Text Summarization
Nevidu Jayatilleke
Ruvan Weerasinghe
Nipuna Senanayake
201
1
0
27 Feb 2025
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
Zekun Xi
Wenbiao Yin
Jizhan Fang
Jialong Wu
Runnan Fang
N. Zhang
Jiang Yong
Pengjun Xie
Fei Huang
Huajun Chen
SyDa
LRM
114
7
0
21 Feb 2025
EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Mengna Zhu
Kaisheng Zeng
Mao Wang
Kaiming Xiao
Lei Hou
Hongbin Huang
Juanzi Li
271
1
0
16 Dec 2024
All or None: Identifiable Linear Properties of Next-token Predictors in Language Modeling
Emanuele Marconato
Sébastien Lachapelle
Sebastian Weichwald
Luigi Gresele
69
3
0
30 Oct 2024
More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding
Yuan Tang
Xu Han
Xianzhi Li
Qiao Yu
Jinfeng Xu
Yixue Hao
Long Hu
Min Chen
53
1
0
28 Aug 2024
Leveraging Entailment Judgements in Cross-Lingual Summarisation
Huajian Zhang
Laura Perez-Beltrachini
HILM
44
0
0
01 Aug 2024
VoCo-LLaMA: Towards Vision Compression with Large Language Models
Xubing Ye
Yukang Gan
Xiaoke Huang
Yixiao Ge
Yansong Tang
MLLM
VLM
43
23
0
18 Jun 2024
Scientific Hypothesis Generation by a Large Language Model: Laboratory Validation in Breast Cancer Treatment
A. Abdel-Rehim
Hector Zenil
Oghenejokpeme I. Orhobor
Marie Fisher
Ross J. Collins
...
Gareth W. Fearnley
Emma Tate
Holly X. Smith
Larisa B. Soldatova
Ross D. King
LM&MA
74
5
0
20 May 2024
Transformers as Transducers
Lena Strobl
Dana Angluin
David Chiang
Jonathan Rawski
Ashish Sabharwal
31
5
0
02 Apr 2024
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
Yijia Shao
Yucheng Jiang
Theodore A. Kanell
Peter Xu
Omar Khattab
Monica S. Lam
LLMAG
KELM
44
35
0
22 Feb 2024
How Smooth Is Attention?
Valérie Castin
Pierre Ablin
Gabriel Peyré
AAML
40
9
0
22 Dec 2023
Object Recognition as Next Token Prediction
Kaiyu Yue
Borchun Chen
Jonas Geiping
Hengduo Li
Tom Goldstein
Ser-Nam Lim
40
9
0
04 Dec 2023
PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization
Joseph Peper
Wenzhao Qiu
Lu Wang
30
0
0
16 Nov 2023
Surveying the Landscape of Text Summarization with Deep Learning: A Comprehensive Review
Guanghua Wang
Weili Wu
AI4TS
AILaw
38
3
0
13 Oct 2023
Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation
Chen Dun
Mirian Hipolito Garcia
Guoqing Zheng
Ahmed Hassan Awadallah
Anastasios Kyrillidis
Robert Sim
90
6
0
04 Oct 2023
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Albert Mohwald
36
15
0
28 Sep 2023
Small-scale proxies for large-scale Transformer training instabilities
Mitchell Wortsman
Peter J. Liu
Lechao Xiao
Katie Everett
A. Alemi
...
Jascha Narain Sohl-Dickstein
Kelvin Xu
Jaehoon Lee
Justin Gilmer
Simon Kornblith
40
86
0
25 Sep 2023
Multi-document Summarization: A Comparative Evaluation
Kushan Hewapathirana
Nisansa de Silva
Sri Lanka
ELM
16
3
0
10 Sep 2023
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
Jie Huang
Ming-Yu Liu
Peng Xu
M. Shoeybi
Kevin Chen-Chuan Chang
Bryan Catanzaro
RALM
37
33
0
15 Aug 2023
Arithmetic with Language Models: from Memorization to Computation
Davide Maltoni
Matteo Ferrara
KELM
LRM
45
5
0
02 Aug 2023
In-Context Learning Learns Label Relationships but Is Not Conventional Learning
Jannik Kossen
Y. Gal
Tom Rainforth
44
30
0
23 Jul 2023
Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis
Zhiyu Jin
Xuli Shen
Bin Li
Xiangyang Xue
34
36
0
14 Jun 2023
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Griffin Adams
Alexander R. Fabbri
Faisal Ladhak
Kathleen McKeown
Noémie Elhadad
18
10
0
28 May 2023
A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation
Xipin Wei
Junhui Chen
Zirui Zheng
Li Guo
Lantian Li
Dong Wang
27
3
0
26 May 2023
Large Language Models are Not Yet Human-Level Evaluators for Abstractive Summarization
Chenhui Shen
Liying Cheng
Xuan-Phi Nguyen
Yang You
Lidong Bing
ELM
ALM
47
64
0
22 May 2023
A Hierarchical Encoding-Decoding Scheme for Abstractive Multi-document Summarization
Chenhui Shen
Liying Cheng
Xuan-Phi Nguyen
Yang You
Lidong Bing
30
10
0
15 May 2023
Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts
Haochen Tan
Han Wu
Wei Shao
Xinyun Zhang
Mingjie Zhan
Zhaohui Hou
Ding Liang
Linqi Song
47
0
0
13 May 2023
The Current State of Summarization
Fabian Retkowski
23
6
0
08 May 2023
Learning to Compress Prompts with Gist Tokens
Jesse Mu
Xiang Lisa Li
Noah D. Goodman
VLM
53
207
0
17 Apr 2023
Improving Autoregressive NLP Tasks via Modular Linearized Attention
Victor Agostinelli
Lizhong Chen
27
1
0
17 Apr 2023
An Iterative Optimizing Framework for Radiology Report Summarization with ChatGPT
Chong Ma
Zihao Wu
Jiaqi Wang
Shaochen Xu
Yaonai Wei
...
Tuo Zhang
Dajiang Zhu
Dinggang Shen
Tianming Liu
Xiang Li
MedIm
LM&MA
47
97
0
17 Apr 2023
MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks
Weicheng Kuo
A. Piergiovanni
Dahun Kim
Xiyang Luo
Benjamin Caine
...
Luowei Zhou
Andrew M. Dai
Zhifeng Chen
Claire Cui
A. Angelova
MLLM
VLM
37
23
0
29 Mar 2023
Energy-efficient Task Adaptation for NLP Edge Inference Leveraging Heterogeneous Memory Architectures
Zirui Fu
Aleksandre Avaliani
M. Donato
46
1
0
25 Mar 2023
XWikiGen: Cross-lingual Summarization for Encyclopedic Text Generation in Low Resource Languages
Dhaval Taunk
Shivprasad Sagare
Anupam Patil
Shivansh Subramanian
Manish Gupta
Vasudeva Varma
25
3
0
22 Mar 2023
RIOT: Recursive Inertial Odometry Transformer for Localisation from Low-Cost IMU Measurements
James Brotchie
Wenchao Li
A. Greentree
A. Kealy
30
8
0
03 Mar 2023
Elementwise Language Representation
Du-Yeong Kim
Jeeeun Kim
36
0
0
27 Feb 2023
ChatGPT: Jack of all trades, master of none
Jan Kocoñ
Igor Cichecki
Oliwier Kaszyca
Mateusz Kochanek
Dominika Szydło
...
Maciej Piasecki
Lukasz Radliñski
Konrad Wojtasik
Stanislaw Wo'zniak
Przemyslaw Kazienko
AI4MH
42
528
0
21 Feb 2023
Generating a Structured Summary of Numerous Academic Papers: Dataset and Method
Shuaiqi Liu
Jiannong Cao
Ruosong Yang
Zhiyuan Wen
51
16
0
09 Feb 2023
Efficient Attention via Control Variates
Lin Zheng
Jianbo Yuan
Chong-Jun Wang
Lingpeng Kong
34
18
0
09 Feb 2023
Long Text and Multi-Table Summarization: Dataset and Method
Shuaiqi Liu
Jiannong Cao
Ruosong Yang
Zhiyuan Wen
RALM
27
21
0
08 Feb 2023
A Survey of Deep Learning: From Activations to Transformers
Johannes Schneider
Michalis Vlachos
ViT
MedIm
AI4TS
AI4CE
50
10
0
01 Feb 2023
Do Multi-Document Summarization Models Synthesize?
Jay DeYoung
Stephanie C. Martinez
Iain J. Marshall
Byron C. Wallace
24
8
0
31 Jan 2023
Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing
Shruthi Bannur
Stephanie L. Hyland
Qianchu Liu
Fernando Pérez-García
Maximilian Ilse
...
Maria T. A. Wetscherek
M. Lungren
A. Nori
Javier Alvarez-Valle
Ozan Oktay
36
115
0
11 Jan 2023
OASum: Large-Scale Open Domain Aspect-based Summarization
Xianjun Yang
Kaiqiang Song
Sangwoo Cho
Xiaoyang Wang
Xiaoman Pan
Linda R. Petzold
Dong Yu
RALM
26
24
0
19 Dec 2022
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem
Chenyi Yu
Weinan Zhang
H. Lai
Zheng Tian
L. Kneip
Jun Wang
33
15
0
18 Dec 2022
SumREN: Summarizing Reported Speech about Events in News
R. Reddy
Heba Elfardy
Hou Pong Chan
Kevin Small
Chenhui Xu
28
5
0
02 Dec 2022
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems
Kumar Shridhar
Jakub Macina
Mennatallah El-Assady
Tanmay Sinha
Manu Kapur
Mrinmaya Sachan
AIMat
41
46
0
23 Nov 2022
Exploring the Efficacy of Pre-trained Checkpoints in Text-to-Music Generation Task
Shangda Wu
Maosong Sun
17
20
0
21 Nov 2022
1
2
3
4
Next