ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.09268
  4. Cited By
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
v1v2v3 (latest)

MS MARCO: A Human Generated MAchine Reading COmprehension Dataset

28 November 2016
Payal Bajaj
Daniel Fernando Campos
Nick Craswell
Li Deng
Jianfeng Gao
Xiaodong Liu
Rangan Majumder
Andrew McNamara
Bhaskar Mitra
Tri Nguyen
Mir Rosenberg
Xia Song
Alina Stoica
Saurabh Tiwary
Tong Wang
    RALM
ArXiv (abs)PDFHTML

Papers citing "MS MARCO: A Human Generated MAchine Reading COmprehension Dataset"

50 / 1,372 papers shown
Title
Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?
Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?
Adithya Bhaskar
Alexander Wettig
Tianyu Gao
Yihe Dong
Danqi Chen
15
0
0
20 Jun 2025
Approximating Language Model Training Data from Weights
Approximating Language Model Training Data from Weights
John X. Morris
Junjie Oscar Yin
Woojeong Kim
Vitaly Shmatikov
Alexander M. Rush
31
0
0
18 Jun 2025
Intended Target Identification for Anomia Patients with Gradient-based Selective Augmentation
Intended Target Identification for Anomia Patients with Gradient-based Selective Augmentation
Jongho Kim
Romain Storaï
Seung-won Hwang
26
1
0
17 Jun 2025
Maximally-Informative Retrieval for State Space Model Generation
Maximally-Informative Retrieval for State Space Model Generation
Evan Becker
Benjamin Bowman
Matthew Trager
Tian Yu Liu
Luca Zancato
Wei Xia
Stefano Soatto
RALM
22
0
0
13 Jun 2025
ThinkQE: Query Expansion via an Evolving Thinking Process
ThinkQE: Query Expansion via an Evolving Thinking Process
Yibin Lei
Tao Shen
Andrew Yates
ReLMLRM
38
0
0
10 Jun 2025
Brevity is the soul of sustainability: Characterizing LLM response lengths
S. Poddar
Paramita Koley
Janardan Misra
Sanjay Podder
Navveen Balani
Niloy Ganguly
Saptarshi Ghosh
32
0
0
10 Jun 2025
LGAI-EMBEDDING-Preview Technical Report
LGAI-EMBEDDING-Preview Technical Report
Jooyoung Choi
H. Kim
Hansol Jang
Changwook Jun
Kyunghoon Bae
Hyewon Choi
Stanley Jungkyu Choi
Honglak Lee
Chulmin Yun
12
0
0
09 Jun 2025
On the Merits of LLM-Based Corpus Enrichment
On the Merits of LLM-Based Corpus Enrichment
Gal Zur
Tommy Mordo
Moshe Tennenholtz
Oren Kurland
53
0
0
06 Jun 2025
Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
Haowei Wang
Rupeng Zhang
Junjie Wang
Mingyang Li
Yuekai Huang
Dandan Wang
Qing Wang
SILMAAML
51
0
0
06 Jun 2025
LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table
Yusuke Matsui
118
0
0
05 Jun 2025
GOLFer: Smaller LM-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information Retrieval
GOLFer: Smaller LM-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information Retrieval
Lingyuan Liu
Mengxiang Zhang
150
0
0
05 Jun 2025
Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query Expansion
Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query Expansion
Lingyuan Liu
Mengxiang Zhang
116
0
0
05 Jun 2025
TracLLM: A Generic Framework for Attributing Long Context LLMs
TracLLM: A Generic Framework for Attributing Long Context LLMs
Yanting Wang
Wei Zou
Runpeng Geng
Jinyuan Jia
LLMAG
126
0
0
04 Jun 2025
EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models
EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models
Mingzhe Li
Gehao Zhang
Zhenting Wang
Shiqing Ma
Siqi Pan
Richard Cartwright
Juan Zhai
DiffM
52
0
0
03 Jun 2025
When Should Dense Retrievers Be Updated in Evolving Corpora? Detecting Out-of-Distribution Corpora Using GradNormIR
When Should Dense Retrievers Be Updated in Evolving Corpora? Detecting Out-of-Distribution Corpora Using GradNormIR
Dayoon Ko
Jinyoung Kim
Sohyeon Kim
Jinhyuk Kim
Jaehoon Lee
Seonghak Song
Minyoung Lee
Gunhee Kim
55
0
0
02 Jun 2025
IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
Pasunuti Prasanjith
Prathmesh B More
Anoop Kunchukuttan
Raj Dabre
RALM
49
0
0
02 Jun 2025
Entity Image and Mixed-Modal Image Retrieval Datasets
Entity Image and Mixed-Modal Image Retrieval Datasets
Cristian-Ioan Blaga
Paul Suganthan
Sahil Dua
Krishna Srinivasan
Enrique Alfonseca
Peter Dornbach
Tom Duerig
I. Zitouni
Zhe Dong
VLM
20
0
0
02 Jun 2025
CiteEval: Principle-Driven Citation Evaluation for Source Attribution
CiteEval: Principle-Driven Citation Evaluation for Source Attribution
Yumo Xu
Peng Qi
Jifan Chen
Kunlun Liu
Rujun Han
Lan Liu
Bonan Min
Vittorio Castelli
Arshit Gupta
Zhiguo Wang
HILM
52
0
0
02 Jun 2025
LaMP-QA: A Benchmark for Personalized Long-form Question Answering
LaMP-QA: A Benchmark for Personalized Long-form Question Answering
Alireza Salemi
Hamed Zamani
13
0
0
30 May 2025
On the Scaling of Robustness and Effectiveness in Dense Retrieval
On the Scaling of Robustness and Effectiveness in Dense Retrieval
Yu-an Liu
Ruqing Zhang
Jiafeng Guo
Maarten de Rijke
Yixing Fan
Xueqi Cheng
22
0
0
30 May 2025
Who is in the Spotlight: The Hidden Bias Undermining Multimodal Retrieval-Augmented Generation
Who is in the Spotlight: The Hidden Bias Undermining Multimodal Retrieval-Augmented Generation
Jiayu Yao
Shenghua Liu
Yiwei Wang
Lingrui Mei
Baolong Bi
Yuyao Ge
Zhecheng Li
Xueqi Cheng
13
0
0
30 May 2025
REIC: RAG-Enhanced Intent Classification at Scale
REIC: RAG-Enhanced Intent Classification at Scale
Ziji Zhang
Michael Yang
Zhiyu Chen
Yingying Zhuang
S. Pi
Qun Liu
Rajashekar Maragoud
Vy Nguyen
Anurag Beniwal
23
0
0
30 May 2025
Uncovering Visual-Semantic Psycholinguistic Properties from the Distributional Structure of Text Embedding Space
Uncovering Visual-Semantic Psycholinguistic Properties from the Distributional Structure of Text Embedding Space
Si Wu
Sebastian Bruch
46
0
0
29 May 2025
Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers
Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers
Chaitanya Sharma
RALM3DV
30
0
0
28 May 2025
Xinyu AI Search: Enhanced Relevance and Comprehensive Results with Rich Answer Presentations
Xinyu AI Search: Enhanced Relevance and Comprehensive Results with Rich Answer Presentations
Bo Tang
Junyi Zhu
Chenyang Xi
Yunhang Ge
Jiahao Wu
...
Yebin Yang
Jiajia Wang
Zhiyu Li
Feiyu Xiong
Jingrun Chen
42
0
0
28 May 2025
ChatPD: An LLM-driven Paper-Dataset Networking System
ChatPD: An LLM-driven Paper-Dataset Networking System
Anjie Xu
Ruiqing Ding
Leye Wang
44
0
0
28 May 2025
Decoding Dense Embeddings: Sparse Autoencoders for Interpreting and Discretizing Dense Retrieval
Decoding Dense Embeddings: Sparse Autoencoders for Interpreting and Discretizing Dense Retrieval
Seongwan Park
Taeklim Kim
Youngjoong Ko
23
0
0
28 May 2025
Disentangling Locality and Entropy in Ranking Distillation
Disentangling Locality and Entropy in Ranking Distillation
Andrew Parry
Debasis Ganguly
Sean MacAvaney
51
0
0
27 May 2025
Query Drift Compensation: Enabling Compatibility in Continual Learning of Retrieval Embedding Models
Query Drift Compensation: Enabling Compatibility in Continual Learning of Retrieval Embedding Models
Dipam Goswami
Liying Wang
Bartłomiej Twardowski
Joost van de Weijer
33
0
0
27 May 2025
ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision
ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision
Dosung Lee
Wonjun Oh
Boyoung Kim
Minyoung Kim
Joonsuk Park
Paul Hongsuck Seo
LRM
15
0
0
27 May 2025
CPA-RAG:Covert Poisoning Attacks on Retrieval-Augmented Generation in Large Language Models
CPA-RAG:Covert Poisoning Attacks on Retrieval-Augmented Generation in Large Language Models
Chunyang Li
Junwei Zhang
Anda Cheng
Zhuo Ma
Xinghua Li
Jianfeng Ma
SILMAAML
39
0
0
26 May 2025
GenKI: Enhancing Open-Domain Question Answering with Knowledge Integration and Controllable Generation in Large Language Models
GenKI: Enhancing Open-Domain Question Answering with Knowledge Integration and Controllable Generation in Large Language Models
Tingjia Shen
Hao Wang
Chuan Qin
Ruijun Sun
Yang Song
Defu Lian
Hengshu Zhu
Enhong Chen
49
0
0
26 May 2025
POQD: Performance-Oriented Query Decomposer for Multi-vector retrieval
POQD: Performance-Oriented Query Decomposer for Multi-vector retrieval
Yaoyang Liu
Junlin Li
Yinjun Wu
Zhen Chen
67
0
0
25 May 2025
RankLLM: A Python Package for Reranking with LLMs
RankLLM: A Python Package for Reranking with LLMs
Sahel Sharifymoghaddam
Ronak Pradeep
Andre Slavescu
Ryan Nguyen
Andrew Xu
Zijian Chen
Yilin Zhang
Yidi Chen
Jasper Xian
Jimmy Lin
KELMALMLRM
35
1
0
25 May 2025
Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval
Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval
Kidist Amde Mekonnen
Yosef Worku Alemneh
Maarten de Rijke
RALM
44
0
0
25 May 2025
Conventional Contrastive Learning Often Falls Short: Improving Dense Retrieval with Cross-Encoder Listwise Distillation and Synthetic Data
Conventional Contrastive Learning Often Falls Short: Improving Dense Retrieval with Cross-Encoder Listwise Distillation and Synthetic Data
Manveer Singh Tamber
Suleman Kazi
Vivek Sourabh
Jimmy Lin
63
0
0
25 May 2025
Aligning Web Query Generation with Ranking Objectives via Direct Preference Optimization
Aligning Web Query Generation with Ranking Objectives via Direct Preference Optimization
Joao Coelho
Bruno Martins
João Magalhães
Chenyan Xiong
SyDa
45
0
0
25 May 2025
The Silent Saboteur: Imperceptible Adversarial Attacks against Black-Box Retrieval-Augmented Generation Systems
The Silent Saboteur: Imperceptible Adversarial Attacks against Black-Box Retrieval-Augmented Generation Systems
Hongru Song
Yu-an Liu
Ruqing Zhang
Jiafeng Guo
Jianming Lv
Maarten de Rijke
Xueqi Cheng
AAML
35
0
0
24 May 2025
Modeling Ranking Properties with In-Context Learning
Modeling Ranking Properties with In-Context Learning
Nilanjan Sinhababu
Andrew Parry
Debasis Ganguly
Pabitra Mitra
50
0
0
23 May 2025
RaDeR: Reasoning-aware Dense Retrieval Models
RaDeR: Reasoning-aware Dense Retrieval Models
Debrup Das
Sam O' Nuallain
Razieh Rahimi
RALMLRM
69
1
0
23 May 2025
Locality-Sensitive Hashing for Efficient Hard Negative Sampling in Contrastive Learning
Locality-Sensitive Hashing for Efficient Hard Negative Sampling in Contrastive Learning
Fabian Deuser
Philipp Hausenblas
Hannah Schieber
Daniel Roth
Martin Werner
Norbert Oswald
205
0
0
23 May 2025
Chain-of-Thought Poisoning Attacks against R1-based Retrieval-Augmented Generation Systems
Chain-of-Thought Poisoning Attacks against R1-based Retrieval-Augmented Generation Systems
Hongru Song
Yu-an Liu
Ruqing Zhang
Jiafeng Guo
Yixing Fan
AAMLSILMLRM
49
0
0
22 May 2025
Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval
Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval
Nandan Thakur
Crystina Zhang
Xueguang Ma
Jimmy Lin
160
0
0
22 May 2025
Don't "Overthink" Passage Reranking: Is Reasoning Truly Necessary?
Don't "Overthink" Passage Reranking: Is Reasoning Truly Necessary?
Nour Jedidi
Yung-Sung Chuang
James R. Glass
Jimmy Lin
ReLMLRM
84
0
0
22 May 2025
MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries
MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries
Jonghwi Kim
Deokhyung Kang
Seonjeong Hwang
Yunsu Kim
Jungseul Ok
Gary Lee
22
0
0
22 May 2025
InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation
InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation
Yunjia Xi
Jianghao Lin
Menghui Zhu
Yongzhao Xiao
Zhuoying Ou
...
Weiwen Liu
Yasheng Wang
Ruiming Tang
Weinan Zhang
Yong Yu
114
1
0
21 May 2025
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning
Changtai Zhu
Siyin Wang
Ruijun Feng
Kai Song
Xipeng Qiu
LRM
86
0
0
21 May 2025
Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation
Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation
Siddhant Bhambri
Upasana Biswas
Subbarao Kambhampati
137
1
0
20 May 2025
Rank-K: Test-Time Reasoning for Listwise Reranking
Rank-K: Test-Time Reasoning for Listwise Reranking
Eugene Yang
Andrew Yates
Kathryn Ricci
Orion Weller
Vivek Chari
Benjamin Van Durme
Dawn J Lawrie
LRM
69
2
0
20 May 2025
Benchmarking the Myopic Trap: Positional Bias in Information Retrieval
Benchmarking the Myopic Trap: Positional Bias in Information Retrieval
Ziyang Zeng
Dun Zhang
Jiacheng Li
Panxiang Zou
Yuqing Yang
78
0
0
20 May 2025
1234...262728
Next