Needle: A Generative-AI Powered Monte Carlo Method for Answering Complex Natural Language Queries on Multi-modal Data
- SyDaMedIm

Main:10 Pages
14 Figures
Bibliography:6 Pages
3 Tables
Appendix:17 Pages
Abstract
Multi-modal data, such as image data sets, often miss the detailed descriptions that properly capture the rich information encoded in them. This makes answering complex natural language queries a major challenge in these domains. In particular, unlike the traditional nearest-neighbor search, where the tuples and the query are modeled as points in a data cube, the query and the tuples are of different natures, making the traditional query answering solutions not directly applicable for such settings. Existing literature addresses this challenge for image data through vector representations jointly trained on natural language and images. This technique, however, underperforms for complex queries due to various reasons.
View on arXiv@article{erfanian2025_2412.00639, title={ Needle: A Generative AI-Powered Multi-modal Database for Answering Complex Natural Language Queries }, author={ Mahdi Erfanian and Mohsen Dehghankar and Abolfazl Asudeh }, journal={arXiv preprint arXiv:2412.00639}, year={ 2025 } }
Comments on this paper