A Survey on World Models Grounded in Acoustic Physical Information

A Survey on World Models Grounded in Acoustic Physical Information

16 June 2025

ArXiv (abs)PDF HTML

Papers citing "A Survey on World Models Grounded in Acoustic Physical Information"

13 / 13 papers shown

Title
A Synergistic Framework of Nonlinear Acoustic Computing and Reinforcement Learning for Real-World Human-Robot Interaction Xiaoliang Chen Xin Yu Le Chang Yunhe Huang Jiashuai He ... Jin Li Likai Lin Ziyu Zeng Xianling Tu Shuyu Zhang 100 1 0 04 May 2025
Flamingo: a Visual Language Model for Few-Shot Learning Jean-Baptiste Alayrac Jeff Donahue Pauline Luc Antoine Miech Iain Barr ... Mikolaj Binkowski Ricardo Barreira Oriol Vinyals Andrew Zisserman Karen Simonyan MLLM VLM 418 3,602 0 29 Apr 2022
Learning Transferable Visual Models From Natural Language Supervision Alec Radford Jong Wook Kim Chris Hallacy Aditya A. Ramesh Gabriel Goh ... Amanda Askell Pamela Mishkin Jack Clark Gretchen Krueger Ilya Sutskever CLIP VLM 967 29,810 0 26 Feb 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai ... Matthias Minderer G. Heigold Sylvain Gelly Jakob Uszkoreit N. Houlsby ViT 670 41,369 0 22 Oct 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis Zhifeng Kong Ming-Yu Liu Jiaji Huang Kexin Zhao Bryan Catanzaro DiffM BDL 158 1,468 0 21 Sep 2020
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation Chuang Gan Jeremy Schwartz S. Alter Damian Mrowca Martin Schrimpf ... Antonio Torralba J. DiCarlo J. Tenenbaum Josh H. McDermott Daniel L. K. Yamins VGen 143 314 0 09 Jul 2020
Equivariant Neural Rendering Emilien Dupont Miguel Angel Bautista Alex Colburn Aditya Sankar Carlos Guestrin J. Susskind Qi Shan 3DH 86 64 0 13 Jun 2020
Dream to Control: Learning Behaviors by Latent Imagination Danijar Hafner Timothy Lillicrap Jimmy Ba Mohammad Norouzi VLM 126 1,371 0 03 Dec 2019
WHAM!: Extending Speech Separation to Noisy Environments Gordon Wichern J. Antognini Michael Flynn Licheng Richard Zhu E. McQuinn Dwight Crow Ethan Manilow Jonathan Le Roux 84 351 0 02 Jul 2019
Co-Separating Sounds of Visual Objects Ruohan Gao Kristen Grauman 131 210 0 16 Apr 2019
World Models David R Ha Jürgen Schmidhuber SyDa 143 1,098 0 27 Mar 2018
Supervised Speech Separation Based on Deep Learning: An Overview DeLiang Wang Jitong Chen SSL 77 1,374 0 24 Aug 2017
Federated Learning: Strategies for Improving Communication Efficiency Jakub Konecný H. B. McMahan Felix X. Yu Peter Richtárik A. Suresh Dave Bacon FedML 309 4,649 0 18 Oct 2016