MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation
of VideosComputer Vision and Pattern Recognition (CVPR), 2023 |
TCR: Short Video Title Generation and Cover Selection with Attention
RefinementPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2023 |
Align and Attend: Multimodal Summarization with Dual Contrastive LossesComputer Vision and Pattern Recognition (CVPR), 2023 |
Grafting Pre-trained Models for Multimodal Headline GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
Vision Guided Generative Pre-trained Language Models for Multimodal
Abstractive SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |