ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.02061
  4. Cited By
Aligned Better, Listen Better for Audio-Visual Large Language Models

Aligned Better, Listen Better for Audio-Visual Large Language Models

2 April 2025
Yuxin Guo
Shuailei Ma
Shijie Ma
Xiaoyi Bao
Chen-Wei Xie
Kecheng Zheng
Tingyu Weng
Siyang Sun
Yun Zheng
Wei Zou
    MLLM
    AuLLM
ArXivPDFHTML

Papers citing "Aligned Better, Listen Better for Audio-Visual Large Language Models"

2 / 2 papers shown
Title
EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning
EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning
Zhenghao Xing
Xiaowei Hu
Chi-Wing Fu
Wei Wang
Jifeng Dai
Pheng-Ann Heng
MLLM
OffRL
VLM
LRM
50
0
0
07 May 2025
ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery
ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery
Shijie Ma
Fei Zhu
Xu-Yao Zhang
Cheng-Lin Liu
37
1
0
02 Apr 2025
1