Attention-Guided Answer Distillation for Machine Reading Comprehension

Attention-Guided Answer Distillation for Machine Reading Comprehension

23 August 2018

Dongsheng Li

Papers citing "Attention-Guided Answer Distillation for Machine Reading Comprehension"

15 / 15 papers shown

Title
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models Aviv Bick Kevin Y. Li Eric P. Xing J. Zico Kolter Albert Gu Mamba 58 24 0 19 Aug 2024
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning Zhen Wang Yikang Shen Leonid Karlinsky Rogerio Feris Huan Sun Yoon Kim VLM VPVLM 44 108 0 06 Mar 2023
Swing Distillation: A Privacy-Preserving Knowledge Distillation Framework Junzhuo Li Xinwei Wu Weilong Dong Shuangzhi Wu Chao Bian Deyi Xiong 31 3 0 16 Dec 2022
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages Alireza Mohammadshahi Vassilina Nikoulina Alexandre Berard Caroline Brun James Henderson Laurent Besacier VLM MoE LRM 29 20 0 20 Oct 2022
Revisiting Label Smoothing and Knowledge Distillation Compatibility: What was Missing? Keshigeyan Chandrasegaran Ngoc-Trung Tran Yunqing Zhao Ngai-man Cheung 88 41 0 29 Jun 2022
Multilingual AMR Parsing with Noisy Knowledge Distillation Deng Cai Xin Li Jackie Chun-Sing Ho Lidong Bing W. Lam 27 18 0 30 Sep 2021
Knowledge Distillation as Semiparametric Inference Tri Dao G. Kamath Vasilis Syrgkanis Lester W. Mackey 40 31 0 20 Apr 2021
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers Wenhui Wang Hangbo Bao Shaohan Huang Li Dong Furu Wei MQ 30 257 0 31 Dec 2020
Knowledge Distillation: A Survey Jianping Gou B. Yu Stephen J. Maybank Dacheng Tao VLM 23 2,851 0 09 Jun 2020
MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers Wenhui Wang Furu Wei Li Dong Hangbo Bao Nan Yang Ming Zhou VLM 47 1,209 0 25 Feb 2020
A Survey on Machine Reading Comprehension Systems Razieh Baradaran Razieh Ghiasi Hossein Amirkhani FaML 13 85 0 06 Jan 2020
Hint-Based Training for Non-Autoregressive Machine Translation Zhuohan Li Zi Lin Di He Fei Tian Tao Qin Liwei Wang Tie-Yan Liu 31 72 0 15 Sep 2019
MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension Alon Talmor Jonathan Berant 20 172 0 31 May 2019
Efficient Video Classification Using Fewer Frames S. Bhardwaj Mukundhan Srinivasan Mitesh M. Khapra 40 88 0 27 Feb 2019
Multi-style Generative Reading Comprehension Kyosuke Nishida Itsumi Saito Kosuke Nishida Kazutoshi Shinoda Atsushi Otsuka Hisako Asano J. Tomita 22 70 0 08 Jan 2019