Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.00158
Cited By
Speaker Recognition from Raw Waveform with SincNet
29 July 2018
Mirco Ravanelli
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Speaker Recognition from Raw Waveform with SincNet"
50 / 260 papers shown
Title
Interpretable SincNet-based Deep Learning for Emotion Recognition from EEG brain activity
J. M. M. Torres
Mirco Ravanelli
Sara E. Medina-DeVilliers
M. Lerner
Giuseppe Riccardi
11
21
0
18 Jul 2021
Representation based meta-learning for few-shot spoken intent recognition
Ashish R. Mittal
Samarth Bharadwaj
Shreya Khare
Saneem A. Chemmengath
Karthik Sankaranarayanan
Brian Kingsbury
20
12
0
29 Jun 2021
SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform
Yuhang He
A. Trigoni
Andrew Markham
31
19
0
13 Jun 2021
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild
Okan Kopuklu
Maja Taseska
Gerhard Rigoll
3DV
29
45
0
07 Jun 2021
PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform
Wencheng Li
Zhenhua Tan
Jingyu Ning
Zhenche Xia
Danke Wu
16
1
0
31 May 2021
EEG-based Cross-Subject Driver Drowsiness Recognition with an Interpretable Convolutional Neural Network
Jian Cui
Zirui Lan
O. Sourina
W. Müller-Wittig
27
102
0
30 May 2021
A Modulation Front-End for Music Audio Tagging
Cyrus Vahidi
C. Saitis
Gyorgy Fazekas
27
2
0
25 May 2021
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data
Hadrien Pujol
Éric Bavu
Alexandre Garcia
44
22
0
27 Apr 2021
Voice2Mesh: Cross-Modal 3D Face Model Generation from Voices
Cho-Ying Wu
Ke Xu
Chin-Cheng Hsu
Ulrich Neumann
CVBM
3DH
50
4
0
21 Apr 2021
End-to-end Keyword Spotting using Neural Architecture Search and Quantization
David Peter
Wolfgang Roth
Franz Pernkopf
MQ
27
14
0
14 Apr 2021
Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System
Ju-ho Kim
Hye-jin Shim
Jee-weon Jung
Ha-Jin Yu
20
1
0
14 Apr 2021
On Architectures and Training for Raw Waveform Feature Extraction in ASR
Peter Vieting
Christoph Luscher
Wilfried Michel
Ralf Schluter
Hermann Ney
30
9
0
09 Apr 2021
End-to-end speaker segmentation for overlap-aware resegmentation
H. Bredin
Antoine Laurent
VLM
211
163
0
08 Apr 2021
Graph Attention Networks for Anti-Spoofing
Hemlata Tak
Jee-weon Jung
J. Patino
Massimiliano Todisco
Nicholas W. D. Evans
44
66
0
08 Apr 2021
Partially-Connected Differentiable Architecture Search for Deepfake and Spoofing Detection
W. Ge
Michele Panariello
J. Patino
Massimiliano Todisco
Nicholas W. D. Evans
3DPC
60
30
0
07 Apr 2021
Learning spectro-temporal representations of complex sounds with parameterized neural networks
Rachid Riad
Julien Karadayi
Anne-Catherine Bachoud-Lévi
Emmanuel Dupoux
29
7
0
12 Mar 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
22
6
0
02 Mar 2021
Contrastive Separative Coding for Self-supervised Representation Learning
Jun Wang
Max W. Y. Lam
Dan Su
Dong Yu
SSL
16
3
0
01 Mar 2021
Learnable MFCCs for Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
32
17
0
20 Feb 2021
U-vectors: Generating clusterable speaker embedding from unlabeled data
M. F. Mridha
Abu Quwsar Ohi
M. Monowar
Md. Abdul Hamid
Md. Rashedul Islam
Yutaka Watanobe
SSL
25
6
0
07 Feb 2021
Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
22
13
0
07 Feb 2021
Multi-Task Self-Supervised Pre-Training for Music Classification
Ho-Hsiang Wu
Chieh-Chi Kao
Qingming Tang
Ming Sun
Brian McFee
J. P. Bello
Chao Wang
SSL
39
37
0
05 Feb 2021
The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap
Shota Horiguchi
Nelson Yalta
Leibny Paola García-Perera
Yuki Takashima
Yawen Xue
Desh Raj
Zili Huang
Yusuke Fujita
Shinji Watanabe
Sanjeev Khudanpur
BDL
27
36
0
02 Feb 2021
Curriculum Learning: A Survey
Petru Soviany
Radu Tudor Ionescu
Paolo Rota
N. Sebe
ODL
79
342
0
25 Jan 2021
LEAF: A Learnable Frontend for Audio Classification
Neil Zeghidour
O. Teboul
Félix de Chaumont Quitry
Marco Tagliasacchi
VLM
AAML
85
144
0
21 Jan 2021
MAAS: Multi-modal Assignation for Active Speaker Detection
Juan Carlos León Alcázar
Fabian Caba Heilbron
Ali K. Thabet
Guohao Li
65
51
0
11 Jan 2021
Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps
Tri Dao
N. Sohoni
Albert Gu
Matthew Eichhorn
Amit Blonder
Megan Leszczynski
Atri Rudra
Christopher Ré
25
43
0
29 Dec 2020
A Study of Few-Shot Audio Classification
Piper Wolters
Chris Careaga
Brian Hutchinson
Lauren A. Phillips
19
10
0
02 Dec 2020
A comparison of handcrafted, parameterized, and learnable features for speech separation
Wenbo Zhu
Mou Wang
Xiao-Lei Zhang
S. Rahardja
27
4
0
29 Nov 2020
Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational Layer
M. Soltanian
Junaid Malik
Jenni Raitoharju
Alexandros Iosifidis
S. Kiranyaz
Denmark
22
11
0
23 Nov 2020
Deep Learning in EEG: Advance of the Last Ten-Year Critical Period
Shu Gong
Kaibo Xing
A. Cichocki
Junhua Li
VLM
32
64
0
22 Nov 2020
Recognizing More Emotions with Less Data Using Self-supervised Transfer Learning
Jonathan Boigne
Biman Liyanage
Ted Östrem
24
20
0
11 Nov 2020
A Comparison Study on Infant-Parent Voice Diarization
Junzhe Zhu
M. Hasegawa-Johnson
Nancy L. McElwain
20
1
0
05 Nov 2020
Comparison of Speaker Role Recognition and Speaker Enrollment Protocol for conversational Clinical Interviews
Rachid Riad
Hadrien Titeux
Laurie Lemoine
Justine Montillot
A. Sliwinski
J. Bagnou
Xuan-Nga Cao
Anne-Catherine Bachoud-Lévi
Emmanuel Dupoux
15
0
0
30 Oct 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020
Yoohwan Kwon
Hee-Soo Heo
Bong-Jin Lee
Joon Son Chung
21
59
0
29 Oct 2020
Y-Vector: Multiscale Waveform Encoder for Speaker Embedding
Ge Zhu
Fei Jiang
Z. Duan
11
25
0
24 Oct 2020
ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding
Minjeong Kim
Gyuwan Kim
Sang-Woo Lee
Jung-Woo Ha
VLM
24
34
0
23 Oct 2020
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models
Saurabh Kataria
Jesús Villalba
Najim Dehak
VLM
SSL
26
34
0
22 Oct 2020
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Zeqian Li
Jacob Whitehill
19
11
0
22 Oct 2020
Graph Attention Networks for Speaker Verification
Jee-weon Jung
Hee-Soo Heo
Ha-Jin Yu
Joon Son Chung
25
26
0
22 Oct 2020
Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark
Bhusan Chettri
Emmanouil Benetos
Bob L. T. Sturm
34
27
0
15 Oct 2020
Lightweight End-to-End Speech Recognition from Raw Audio Data Using Sinc-Convolutions
Ludwig Kurzinger
Nicolas Lindae
Palle Klewitz
Gerhard Rigoll
27
5
0
15 Oct 2020
A Lightweight Speaker Recognition System Using Timbre Properties
Abu Quwsar Ohi
M. F. Mridha
Md. Abdul Hamid
M. Monowar
Dongsu Lee
Jinsul Kim
18
2
0
12 Oct 2020
Attention Driven Fusion for Multi-Modal Emotion Recognition
Darshana Priyasad
Tharindu Fernando
Simon Denman
Clinton Fookes
Sridha Sridharan
11
68
0
23 Sep 2020
TRIER: Template-Guided Neural Networks for Robust and Interpretable Sleep Stage Identification from EEG Recordings
Taeheon Lee
Jeonghwan Hwang
Honggu Lee
35
7
0
10 Sep 2020
DeepVOX: Discovering Features from Raw Audio for Speaker Recognition in Non-ideal Audio Signals
Anurag Chowdhury
Arun Ross
14
2
0
26 Aug 2020
FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data
Aditya Sunil Joglekar
John H. L. Hansen
M. C. Shekhar
A. Sangwan
32
24
0
15 Aug 2020
End-to-End Neural Transformer Based Spoken Language Understanding
Martin H. Radfar
Athanasios Mouchtaris
Siegfried Kunzmann
44
61
0
12 Aug 2020
A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
28
9
0
30 Jul 2020
Double Multi-Head Attention for Speaker Verification
Miquel India
Pooyan Safari
Javier Hernando
28
18
0
26 Jul 2020
Previous
1
2
3
4
5
6
Next