ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.04198
109
48
v1v2v3v4 (latest)

Preech: A System for Privacy-Preserving Speech Transcription

9 September 2019
Shimaa Ahmed
Amrita Roy Chowdhury
Kassem Fawaz
P. Ramanathan
ArXiv (abs)PDFHTML
Abstract

New Advances in machine learning have made Automated Speech Recognition (ASR) systems practical. ASR systems can transcribe speech data at scale. Unfortunately, these systems pose serious privacy threats as speech is a rich source of sensitive acoustic and textual information. Although offline ASR eliminates the privacy risks, its transcription performance is inferior to that of cloud-based ASR systems, especially for real-world use cases. In this paper, we propose Prϵϵ\epsilon\epsilonϵϵch, an end-to-end speech transcription system which lies at an intermediate point in the privacy-utility spectrum of speech transcription. It protects the acoustic features of the speakers' voices and protects the privacy of the textual content at an improved performance relative to offline ASR. Additionally, Prϵϵ\epsilon\epsilonϵϵch provides several control knobs to allow customizable utility-usability-privacy trade-off. It relies on cloud-based services to transcribe a speech file after applying a series of privacy-preserving operations on the user's side. We perform a comprehensive evaluation of Prϵϵ\epsilon\epsilonϵϵch, using diverse real-world datasets, that demonstrates its effectiveness. Prϵϵ\epsilon\epsilonϵϵch provides transcriptions at a 2% to 32.25% (mean 17.34%) relative improvement in word error rate over Deep Speech, while fully obfuscating the speakers' voice biometrics and allowing only a differentially private view of the textual content.

View on arXiv
Comments on this paper