ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00927
  4. Cited By
Audio Set classification with attention model: A probabilistic
  perspective

Audio Set classification with attention model: A probabilistic perspective

2 November 2017
Qiuqiang Kong
Yong-mei Xu
Wenwu Wang
Mark D. Plumbley
    BDL
ArXivPDFHTML

Papers citing "Audio Set classification with attention model: A probabilistic perspective"

14 / 14 papers shown
Title
ICSD: An Open-source Dataset for Infant Cry and Snoring Detection
ICSD: An Open-source Dataset for Infant Cry and Snoring Detection
Qingyu Liu
Longfei Song
Dongxing Xu
Yanhua Long
45
0
0
20 Aug 2024
Event-related data conditioning for acoustic event classification
Event-related data conditioning for acoustic event classification
Yuanbo Hou
Dick Botteldooren
28
3
0
16 Jun 2022
Audio-Visual Transformer Based Crowd Counting
Audio-Visual Transformer Based Crowd Counting
Usman Sajid
Xiangyu Chen
Hasan Sajid
Taejoon Kim
Guanghui Wang
ViT
48
22
0
04 Sep 2021
Voice activity detection in the wild: A data-driven approach using
  teacher-student training
Voice activity detection in the wild: A data-driven approach using teacher-student training
Heinrich Dinkel
Shuai Wang
Xuenan Xu
Mengyue Wu
K. Yu
VLM
11
32
0
10 May 2021
Audio Retrieval with Natural Language Queries
Audio Retrieval with Natural Language Queries
Andreea-Maria Oncescu
A. Sophia Koepke
João F. Henriques
Zeynep Akata
Samuel Albanie
21
77
0
05 May 2021
Perceiver: General Perception with Iterative Attention
Perceiver: General Perception with Iterative Attention
Andrew Jaegle
Felix Gimeno
Andrew Brock
Andrew Zisserman
Oriol Vinyals
João Carreira
VLM
ViT
MDE
91
976
0
04 Mar 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and
  Aggregation
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
104
144
0
02 Feb 2021
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video
  Parsing
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing
Yapeng Tian
Dingzeyu Li
Chenliang Xu
34
180
0
21 Jul 2020
Implicit Neural Representations with Periodic Activation Functions
Implicit Neural Representations with Periodic Activation Functions
Vincent Sitzmann
Julien N. P. Martel
Alexander W. Bergman
David B. Lindell
Gordon Wetzstein
AI4TS
47
2,489
0
17 Jun 2020
Visual Attention for Musical Instrument Recognition
Visual Attention for Musical Instrument Recognition
Karn N. Watcharasupat
Francesco Ferroni
Alexander Lerch
24
3
0
17 Jun 2020
What Makes Training Multi-Modal Classification Networks Hard?
What Makes Training Multi-Modal Classification Networks Hard?
Weiyao Wang
Du Tran
Matt Feiszli
28
442
0
29 May 2019
Hierarchical Pooling Structure for Weakly Labeled Sound Event Detection
Hierarchical Pooling Structure for Weakly Labeled Sound Event Detection
Ke-Xin He
Yuhan Shen
Weiqiang Zhang
25
6
0
28 Mar 2019
Weakly Labelled AudioSet Tagging with Attention Neural Networks
Weakly Labelled AudioSet Tagging with Attention Neural Networks
Qiuqiang Kong
Changsong Yu
Turab Iqbal
Yong-mei Xu
Wenwu Wang
Mark D. Plumbley
NoLa
24
78
0
02 Mar 2019
Audio-Based Activities of Daily Living (ADL) Recognition with
  Large-Scale Acoustic Embeddings from Online Videos
Audio-Based Activities of Daily Living (ADL) Recognition with Large-Scale Acoustic Embeddings from Online Videos
Dawei Liang
Edison Thomaz
14
80
0
19 Oct 2018
1