ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.00158
  4. Cited By
Speaker Recognition from Raw Waveform with SincNet

Speaker Recognition from Raw Waveform with SincNet

29 July 2018
Mirco Ravanelli
Yoshua Bengio
ArXivPDFHTML

Papers citing "Speaker Recognition from Raw Waveform with SincNet"

50 / 260 papers shown
Title
End-to-end spoofing detection with raw waveform CLDNNs
End-to-end spoofing detection with raw waveform CLDNNs
Heinrich Dinkel
Nanxin Chen
Y. Qian
Kai Yu
46
77
0
26 Jul 2020
Optimization of data-driven filterbank for automatic speaker
  verification
Optimization of data-driven filterbank for automatic speaker verification
S. K. Sarangi
Md. Sahidullah
G. Saha
23
38
0
21 Jul 2020
Memory based fusion for multi-modal deep learning
Memory based fusion for multi-modal deep learning
Darshana Priyasad
Tharindu Fernando
Simon Denman
Sridha Sridharan
Clinton Fookes
20
0
0
16 Jul 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6
  Challenge
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Ashish Arora
Desh Raj
Aswin Shanmugam Subramanian
Ke Li
Bar Ben Yair
Matthew Maciejewski
Piotr Żelasko
Leibny Paola García-Perera
Shinji Watanabe
Sanjeev Khudanpur
39
9
0
14 Jun 2020
Uniphore's submission to Fearless Steps Challenge Phase-2
Uniphore's submission to Fearless Steps Challenge Phase-2
Karthik Pandia
C. Spera
22
0
0
10 Jun 2020
CSTNet: Contrastive Speech Translation Network for Self-Supervised
  Speech Representation Learning
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning
Sameer Khurana
Antoine Laurent
James R. Glass
SSL
19
12
0
04 Jun 2020
SNR-Based Teachers-Student Technique for Speech Enhancement
SNR-Based Teachers-Student Technique for Speech Enhancement
Xiang Hao
Xiangdong Su
Zhiyu Wang
Qiang Zhang
Huali Xu
Guanglai Gao
26
15
0
29 May 2020
End-to-End Auditory Object Recognition via Inception Nucleus
End-to-End Auditory Object Recognition via Inception Nucleus
M. K. Ebrahimpour
Timothy M. Shea
Andreea Danielescu
D. Noelle
Christopher T. Kello
8
8
0
25 May 2020
Identify Speakers in Cocktail Parties with End-to-End Attention
Identify Speakers in Cocktail Parties with End-to-End Attention
Junzhe Zhu
M. Hasegawa-Johnson
Leda Sari
14
2
0
22 May 2020
Active Speakers in Context
Active Speakers in Context
Juan Carlos León Alcázar
Fabian Caba Heilbron
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbelaez
Guohao Li
32
61
0
20 May 2020
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
Won Ik Cho
Donghyun Kwak
J. Yoon
N. Kim
29
26
0
17 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for
  researchers
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
27
151
0
08 May 2020
Segment Aggregation for short utterances speaker verification using raw
  waveforms
Segment Aggregation for short utterances speaker verification using raw waveforms
Seung-bin Kim
Jee-weon Jung
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
6
5
0
07 May 2020
Cross-modal Speaker Verification and Recognition: A Multilingual
  Perspective
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
26
25
0
28 Apr 2020
From Inference to Generation: End-to-end Fully Self-supervised
  Generation of Human Face from Speech
From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
Hyeong-Seok Choi
Changdae Park
Kyogu Lee
CVBM
17
29
0
13 Apr 2020
Learning to fool the speaker recognition
Learning to fool the speaker recognition
Jiguo Li
Xinfeng Zhang
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
AAML
30
21
0
07 Apr 2020
Universal Adversarial Perturbations Generative Network for Speaker
  Recognition
Universal Adversarial Perturbations Generative Network for Speaker Recognition
Jiguo Li
Xinfeng Zhang
Chuanmin Jia
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
AAML
20
45
0
07 Apr 2020
Speaker Recognition using SincNet and X-Vector Fusion
Speaker Recognition using SincNet and X-Vector Fusion
Mayank Tripathi
Divyanshu Singh
Seba Susan
23
7
0
05 Apr 2020
Improved RawNet with Feature Map Scaling for Text-independent Speaker
  Verification using Raw Waveforms
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms
Jee-weon Jung
Seung-bin Kim
Hye-jin Shim
Ju-ho Kim
Ha-Jin Yu
18
60
0
01 Apr 2020
AM-MobileNet1D: A Portable Model for Speaker Recognition
AM-MobileNet1D: A Portable Model for Speaker Recognition
João Antônio Chagas Nunes
David Macêdo
Cleber Zanchettin
20
22
0
31 Mar 2020
A Comparison of Metric Learning Loss Functions for End-To-End Speaker
  Verification
A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification
Juan Manuel Coria
H. Bredin
Sahar Ghannay
S. Rosset
23
15
0
31 Mar 2020
In defence of metric learning for speaker recognition
In defence of metric learning for speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
Minjae Lee
Hee-Soo Heo
Soyeon Choe
Chiheon Ham
Sung-Ye Jung
Bong-Jin Lee
Icksang Han
32
432
0
26 Mar 2020
Speaker Identification using EEG
Speaker Identification using EEG
G. Krishna
Co Tran
Mason Carnahan
Ahmed H. Tewfik
19
0
0
07 Mar 2020
CGCNN: Complex Gabor Convolutional Neural Network on raw speech
CGCNN: Complex Gabor Convolutional Neural Network on raw speech
Paul-Gauthier Noé
Titouan Parcollet
Mohamed Morchid
22
29
0
11 Feb 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
32
81
0
02 Jan 2020
Large-scale Multi-modal Person Identification in Real Unconstrained
  Environments
Large-scale Multi-modal Person Identification in Real Unconstrained Environments
Jiajie Ye
Y. Guan
Junfa Liu
Xinghong Huang
Hong Zhang
18
1
0
17 Dec 2019
Speaker detection in the wild: Lessons learned from JSALT 2019
Speaker detection in the wild: Lessons learned from JSALT 2019
Leibny Paola García-Perera
Jesus Villalba
H. Bredin
Jun Du
Diego Castán
...
Wassim Bouaziz
Hadrien Titeux
Emmanuel Dupoux
Kong Aik Lee
Najim Dehak
16
29
0
02 Dec 2019
Deep learning methods in speaker recognition: a review
Deep learning methods in speaker recognition: a review
Dávid Sztahó
György Szaszák
A. Beke
VLM
23
46
0
14 Nov 2019
WaveletKernelNet: An Interpretable Deep Neural Network for Industrial
  Intelligent Diagnosis
WaveletKernelNet: An Interpretable Deep Neural Network for Industrial Intelligent Diagnosis
Tianfu Li
Zhibin Zhao
Chuang Sun
Li Cheng
Xuefeng Chen
Ruqaing Yan
Ruize Gao
27
316
0
12 Nov 2019
Small-Footprint Keyword Spotting on Raw Audio Data with
  Sinc-Convolutions
Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-Convolutions
Simon Mittermaier
Ludwig Kurzinger
Bernd Waschneck
Gerhard Rigoll
22
57
0
05 Nov 2019
pyannote.audio: neural building blocks for speaker diarization
pyannote.audio: neural building blocks for speaker diarization
H. Bredin
Ruiqing Yin
Juan Manuel Coria
G. Gelly
Pavel Korshunov
Marvin Lavechin
D. Fustes
Hadrien Titeux
Wassim Bouaziz
Marie-Philippe Gill
200
313
0
04 Nov 2019
Sum-Product Networks for Robust Automatic Speaker Identification
Sum-Product Networks for Robust Automatic Speaker Identification
Aaron Nicolson
K. Paliwal
TPM
19
1
0
26 Oct 2019
Overlap-aware diarization: resegmentation using neural end-to-end
  overlapped speech detection
Overlap-aware diarization: resegmentation using neural end-to-end overlapped speech detection
Latané Bullock
H. Bredin
Leibny Paola García-Perera
22
94
0
25 Oct 2019
Filterbank design for end-to-end speech separation
Filterbank design for end-to-end speech separation
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
26
69
0
23 Oct 2019
Cross-Representation Transferability of Adversarial Attacks: From
  Spectrograms to Audio Waveforms
Cross-Representation Transferability of Adversarial Attacks: From Spectrograms to Audio Waveforms
K. M. Koerich
M. Esmailpour
Sajjad Abdoli
A. Britto
Alessandro Lameiras Koerich
AAML
30
1
0
22 Oct 2019
Acoustic Model Adaptation from Raw Waveforms with SincNet
Acoustic Model Adaptation from Raw Waveforms with SincNet
Joachim Fainberg
Ondˇrej Klejch
Erfan Loweimi
P. Bell
Steve Renals
11
14
0
30 Sep 2019
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully
  Convolutional Networks
Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks
Changle Liu
Sze-Wei Fu
You-Jin Li
Jen-Wei Huang
Hsin-Min Wang
Yu Tsao
30
50
0
26 Sep 2019
Understanding Semantics from Speech Through Pre-training
Understanding Semantics from Speech Through Pre-training
P. Wang
Liangchen Wei
Yong Cao
Jinghui Xie
Yuji Cao
Zaiqing Nie
SSL
VLM
8
6
0
24 Sep 2019
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice
  Frequency for Text-to-Speech Synthesis
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis
Xin Wang
Junichi Yamagishi
14
31
0
27 Aug 2019
Universal Adversarial Audio Perturbations
Universal Adversarial Audio Perturbations
Sajjad Abdoli
L. G. Hafemann
Jérôme Rony
Ismail Ben Ayed
P. Cardinal
Alessandro Lameiras Koerich
AAML
25
51
0
08 Aug 2019
Sound source detection, localization and classification using
  consecutive ensemble of CRNN models
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
16
66
0
02 Aug 2019
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission
  to ASVspoof 2019 Challenge
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge
Hossein Zeinali
Themos Stafylakis
Georgia Athanasopoulou
Johan Rohdin
Ioannis Gkinis
L. Burget
J. Černocký
34
66
0
13 Jul 2019
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion
  Recognition
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
J. Epps
Björn W. Schuller
36
99
0
13 Jul 2019
Towards Explainable Music Emotion Recognition: The Route via Mid-level
  Features
Towards Explainable Music Emotion Recognition: The Route via Mid-level Features
Shreyan Chowdhury
Andreu Vall
Verena Haunschmid
Gerhard Widmer
14
35
0
08 Jul 2019
Spatial Pyramid Encoding with Convex Length Normalization for
  Text-Independent Speaker Verification
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Youngmoon Jung
Younggwan Kim
Hyungjun Lim
Yeunju Choi
Hoirin Kim
21
32
0
19 Jun 2019
Deep Learning for Audio Signal Processing
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo-wen Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
24
586
0
30 Apr 2019
Improving Deep Speech Denoising by Noisy2Noisy Signal Mapping
Improving Deep Speech Denoising by Noisy2Noisy Signal Mapping
N. Alamdari
A. Azarang
N. Kehtarnavaz
30
42
0
26 Apr 2019
End-to-End Environmental Sound Classification using a 1D Convolutional
  Neural Network
End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network
Sajjad Abdoli
P. Cardinal
Alessandro Lameiras Koerich
39
270
0
18 Apr 2019
RawNet: Advanced end-to-end deep neural network using raw waveforms for
  text-independent speaker verification
RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification
Jee-weon Jung
Hee-Soo Heo
Ju-ho Kim
Hye-jin Shim
Ha-Jin Yu
17
140
0
17 Apr 2019
Audio-Visual Model Distillation Using Acoustic Images
Audio-Visual Model Distillation Using Acoustic Images
Andrés F. Pérez
Valentina Sanguineti
Pietro Morerio
Vittorio Murino
VLM
15
27
0
16 Apr 2019
Previous
123456
Next