ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.10009
  4. Cited By
Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion

Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion

15 February 2024
Hila Manor
T. Michaeli
    DiffM
ArXivPDFHTML

Papers citing "Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion"

18 / 18 papers shown
Title
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
Sifei Li
Mining Tan
Feier Shen
Minyan Luo
Zijiao Yin
Fan Tang
W. Dong
Changsheng Xu
69
0
0
17 Apr 2025
A Simple Combination of Diffusion Models for Better Quality Trade-Offs in Image Denoising
A Simple Combination of Diffusion Models for Better Quality Trade-Offs in Image Denoising
Jonas Dornbusch
Emanuel Pfarr
Florin-Alexandru Vasluianu
Frank Werner
Radu Timofte
DiffM
55
0
0
18 Mar 2025
XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
Yong-Jin Liu
Lie Lu
Jihui Jin
Lichao Sun
Andrea Fanelli
98
1
0
06 Feb 2025
Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer
Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer
Michele Mancusi
Yurii Halychanskyi
K. Cheuk
Eloi Moliner
Chieh-Hsin Lai
...
Junghyun Koo
Marco A. Martínez-Ramírez
Wei-Hsiang Liao
Giorgio Fabbro
Yuki Mitsufuji
DiffM
85
2
0
08 Jan 2025
Shallow Diffuse: Robust and Invisible Watermarking through
  Low-Dimensional Subspaces in Diffusion Models
Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models
Wenda Li
Huijie Zhang
Qing Qu
WIGM
49
2
0
28 Oct 2024
Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and
  Generative Refinement
Annotation-Free MIDI-to-Audio Synthesis via Concatenative Synthesis and Generative Refinement
Osamu Take
Taketo Akama
26
0
0
22 Oct 2024
Free Hunch: Denoiser Covariance Estimation for Diffusion Models Without Extra Costs
Free Hunch: Denoiser Covariance Estimation for Diffusion Models Without Extra Costs
Severi Rissanen
Markus Heinonen
Arno Solin
DiffM
122
0
0
15 Oct 2024
SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
Xinlei Niu
Jing Zhang
Charles Patrick Martin
25
1
0
03 Oct 2024
AudioEditor: A Training-Free Diffusion-Based Audio Editing Framework
AudioEditor: A Training-Free Diffusion-Based Audio Editing Framework
Yuhang Jia
Yang Chen
Jinghua Zhao
Shiwan Zhao
Wenjia Zeng
Yong Chen
Yong Qin
DiffM
36
1
0
19 Sep 2024
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control
Huadai Liu
Jialei Wang
Rongjie Huang
Yang Liu
Jiayang Xu
Zhou Zhao
31
4
0
18 Jul 2024
Audio Conditioning for Music Generation via Discrete Bottleneck Features
Audio Conditioning for Music Generation via Discrete Bottleneck Features
Simon Rouard
Yossi Adi
Jade Copet
Axel Roebel
Alexandre Défossez
MGen
54
1
0
17 Jul 2024
Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling
Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling
Noam Elata
T. Michaeli
Michael Elad
DiffM
MedIm
29
9
0
11 Jul 2024
Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion
  Models
Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models
J. Nistal
Marco Pasini
Cyran Aouameur
M. Grachten
Stefan Lattner
DiffM
47
16
0
12 Jun 2024
Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data
Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data
Eloi Moliner
Sebastian Braun
H. Gamper
OT
47
2
0
29 May 2024
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language
  Models via Instruction Tuning
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning
Yixiao Zhang
Yukara Ikemiya
Woosung Choi
Naoki Murata
Marco A. Martínez-Ramírez
Liwei Lin
Gus Xia
Wei-Hsiang Liao
Yuki Mitsufuji
Simon Dixon
57
10
0
28 May 2024
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
Yixiao Zhang
Yukara Ikemiya
Gus Xia
Naoki Murata
Marco A. Martínez-Ramírez
Wei-Hsiang Liao
Yuki Mitsufuji
Simon Dixon
47
20
0
09 Feb 2024
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound
  Classification and Detection
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
121
264
0
02 Feb 2022
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,781
0
24 Feb 2021
1