Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal
  Selective Self-Training
v1v2 (latest)

Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training

    VLM

Papers citing "Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training"