
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Papers citing "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language"
50 / 557 papers shown
Title |
---|
![]() Self-supervised ASR Models and Features For Dysarthric and Elderly
Speech Recognition Shujie Hu Xurong Xie Mengzhe Geng Zengrui Jin Jiajun Deng ...Yi Wang Mingyu Cui Tianzi Wang Helen Meng Xunying Liu |
![]() Towards Robust Speech Representation Learning for Thousands of Languages William Chen Wangyou Zhang Yifan Peng Xinjian Li Jinchuan Tian Jiatong Shi Xuankai Chang Soumi Maiti Karen Livescu Shinji Watanabe |
![]() Siamese Vision Transformers are Scalable Audio-visual Learners Yan-Bo Lin Gedas Bertasius |