ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.09509
  4. Cited By
Learning Multiscale Features Directly From Waveforms

Learning Multiscale Features Directly From Waveforms

31 March 2016
Zhenyao Zhu
Jesse Engel
Awni Y. Hannun
ArXivPDFHTML

Papers citing "Learning Multiscale Features Directly From Waveforms"

12 / 12 papers shown
Title
Deep Neural Network for Automatic Assessment of Dysphonia
Deep Neural Network for Automatic Assessment of Dysphonia
Mario Alejandro García
Ana Lorena Rosset
16
5
0
25 Feb 2022
On the limit of English conversational speech recognition
On the limit of English conversational speech recognition
Zoltán Tüske
G. Saon
Brian Kingsbury
22
50
0
03 May 2021
Rethinking CNN Models for Audio Classification
Rethinking CNN Models for Audio Classification
Kamalesh Palanisamy
Dipika Singhania
Angela Yao
SSL
33
144
0
22 Jul 2020
J-Net: Randomly weighted U-Net for audio source separation
J-Net: Randomly weighted U-Net for audio source separation
Bo Chen
Yen-Min Hsu
Hung-yi Lee
25
2
0
29 Nov 2019
End-to-End Environmental Sound Classification using a 1D Convolutional
  Neural Network
End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network
Sajjad Abdoli
P. Cardinal
Alessandro Lameiras Koerich
39
270
0
18 Apr 2019
GANSynth: Adversarial Neural Audio Synthesis
GANSynth: Adversarial Neural Audio Synthesis
Jesse Engel
Kumar Krishna Agrawal
Shuo Chen
Ishaan Gulrajani
Chris Donahue
Adam Roberts
49
385
0
23 Feb 2019
Randomly weighted CNNs for (music) audio classification
Randomly weighted CNNs for (music) audio classification
Jordi Pons
Xavier Serra
19
85
0
01 May 2018
End-to-end learning for music audio tagging at scale
End-to-end learning for music audio tagging at scale
Jordi Pons
Oriol Nieto
Matthew Prockup
Erik M. Schmidt
Andreas F. Ehmann
Xavier Serra
30
176
0
07 Nov 2017
Speaker Diarization using Deep Recurrent Convolutional Neural Networks
  for Speaker Embeddings
Speaker Diarization using Deep Recurrent Convolutional Neural Networks for Speaker Embeddings
Pawel Cyrta
Tomasz Trzciñski
Wojciech Stokowiec
19
33
0
09 Aug 2017
Reducing Bias in Production Speech Models
Reducing Bias in Production Speech Models
Eric Battenberg
R. Child
Adam Coates
Christopher Fougner
Yashesh Gaur
...
Vinay Rao
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
38
10
0
11 May 2017
Temporal Segment Networks for Action Recognition in Videos
Temporal Segment Networks for Action Recognition in Videos
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
37
803
0
08 May 2017
CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016
CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016
Yuanjun Xiong
Limin Wang
Zhe Wang
Bowen Zhang
Hang Song
Wei Li
Dahua Lin
Yu Qiao
Luc Van Gool
Xiaoou Tang
35
146
0
02 Aug 2016
1