On the compression of shallow non-causal ASR models using knowledge
distillation and tied-and-reduced decoder for low-latency on-device speech
recognition
Papers citing "On the compression of shallow non-causal ASR models using knowledge
distillation and tied-and-reduced decoder for low-latency on-device speech
recognition"