ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.10666
  4. Cited By
VarietySound: Timbre-Controllable Video to Sound Generation via
  Unsupervised Information Disentanglement

VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement

19 November 2022
Chenye Cui
Yi Ren
Jinglin Liu
Rongjie Huang
Zhou Zhao
    VGen
ArXivPDFHTML

Papers citing "VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement"

12 / 12 papers shown
Title
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation
  Under Semantic Guidance
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance
Yaoyun Zhang
Xuenan Xu
Mengyue Wu
VGen
36
0
0
24 Dec 2024
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Junwon Lee
Jaekwon Im
Dabin Kim
Juhan Nam
VGen
40
9
0
21 Aug 2024
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for
  Temporal-Event-Guided Foley Sound Synthesis
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis
Yoonjin Chung
Junwon Lee
Juhan Nam
48
13
0
17 Jan 2024
SonicVisionLM: Playing Sound with Vision Language Models
SonicVisionLM: Playing Sound with Vision Language Models
Zhifeng Xie
Shengye Yu
Qile He
Mengtian Li
VLM
VGen
28
2
0
09 Jan 2024
A Demand-Driven Perspective on Generative Audio AI
A Demand-Driven Perspective on Generative Audio AI
Sangshin Oh
Minsung Kang
Hyeongi Moon
Keunwoo Choi
Ben Sangbae Chon
33
3
0
10 Jul 2023
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality
  Alignment
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
Xize Cheng
Tao Jin
Lin Li
Wang Lin
Xinyu Duan
Zhou Zhao
VLM
18
15
0
10 Jun 2023
Conditional Generation of Audio from Video via Foley Analogies
Conditional Generation of Audio from Video via Foley Analogies
Yuexi Du
Ziyang Chen
Justin Salamon
Bryan C. Russell
Andrew Owens
VGen
17
38
0
17 Apr 2023
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain
  Text-to-Speech
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech
Rongjie Huang
Yi Ren
Jinglin Liu
Chenye Cui
Zhou Zhao
OODD
VLM
115
34
0
15 May 2022
Adversarial Examples for Good: Adversarial Examples Guided Imbalanced
  Learning
Adversarial Examples for Good: Adversarial Examples Guided Imbalanced Learning
Jie Zhang
Lei Zhang
Gang Li
Chao Wu
GAN
AAML
47
10
0
28 Jan 2022
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice
  Generation
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Rongjie Huang
Chenye Cui
Feiyang Chen
Yi Ren
Jinglin Liu
Zhou Zhao
Baoxing Huai
N. Yuan
GAN
105
62
0
14 Oct 2021
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
303
10,368
0
12 Dec 2018
Learning to Decompose and Disentangle Representations for Video
  Prediction
Learning to Decompose and Disentangle Representations for Video Prediction
Jun-Ting Hsieh
Bingbin Liu
De-An Huang
Li Fei-Fei
Juan Carlos Niebles
DRL
138
305
0
11 Jun 2018
1