ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.09268
  4. Cited By
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
v1v2v3 (latest)

MS MARCO: A Human Generated MAchine Reading COmprehension Dataset

28 November 2016
Payal Bajaj
Daniel Fernando Campos
Nick Craswell
Li Deng
Jianfeng Gao
Xiaodong Liu
Rangan Majumder
Andrew McNamara
Bhaskar Mitra
Tri Nguyen
Mir Rosenberg
Xia Song
Alina Stoica
Saurabh Tiwary
Tong Wang
    RALM
ArXiv (abs)PDFHTML

Papers citing "MS MARCO: A Human Generated MAchine Reading COmprehension Dataset"

50 / 1,372 papers shown
Title
Robust Interaction-based Relevance Modeling for Online E-Commerce and
  LLM-based Retrieval
Robust Interaction-based Relevance Modeling for Online E-Commerce and LLM-based Retrieval
Ben Chen
Huangyu Dai
Xiang Ma
Wen Jiang
Wei Ning
32
0
0
04 Jun 2024
A Survey of Generative Information Retrieval
A Survey of Generative Information Retrieval
Tzu-Lin Kuo
Tzu-Wei Chiu
Tzung-Sheng Lin
Sheng-Yang Wu
Chao-Wei Huang
Yun-Nung Chen
SyDa
128
2
0
03 Jun 2024
Luna: An Evaluation Foundation Model to Catch Language Model
  Hallucinations with High Accuracy and Low Cost
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Masha Belyi
Robert Friel
Shuai Shao
Atindriyo Sanyal
HILMRALM
113
7
0
03 Jun 2024
BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of
  Large Language Models
BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models
Jiaqi Xue
Meng Zheng
Yebowen Hu
Fei Liu
Xun Chen
Qian Lou
AAMLSILM
97
38
0
03 Jun 2024
LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking
LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking
Yifan Zeng
Ojas Tendolkar
Raymond Baartmans
Qingyun Wu
Huazheng Wang
Lizhong Chen
84
0
0
31 May 2024
CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to
  Web Relevance Ranking
CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to Web Relevance Ranking
Josef Vonásek
Milan Straka
Rostislav Krc
Lenka Lasonová
Ekaterina Egorova
Jana Straková
Jakub Náplava
53
2
0
31 May 2024
Passage-specific Prompt Tuning for Passage Reranking in Question
  Answering with Large Language Models
Passage-specific Prompt Tuning for Passage Reranking in Question Answering with Large Language Models
Xuyang Wu
Zhiyuan Peng
Sravanthi Rajanala
Hsin-Tai Wu
Yi Fang
LRMRALM
84
0
0
31 May 2024
Phantom: General Trigger Attacks on Retrieval Augmented Language
  Generation
Phantom: General Trigger Attacks on Retrieval Augmented Language Generation
Harsh Chaudhari
Giorgio Severi
John Abascal
Matthew Jagielski
Christopher A. Choquette-Choo
Milad Nasr
Cristina Nita-Rotaru
Alina Oprea
SILMAAML
126
40
0
30 May 2024
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for
  Retrieval-Augmented Large Language Models
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
Yutao Zhu
Zhaoheng Huang
Zhicheng Dou
Ji-Rong Wen
RALM
96
6
0
30 May 2024
Quriosity: Analyzing Human Questioning Behavior and Causal Inquiry through Curiosity-Driven Queries
Quriosity: Analyzing Human Questioning Behavior and Causal Inquiry through Curiosity-Driven Queries
Roberto Ceraolo
Dmitrii Kharlapenko
Amélie Reymond
Rada Mihalcea
Mrinmaya Sachan
Bernhard Schölkopf
Zhijing Jin
Zhijing Jin
CML
100
2
0
30 May 2024
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical
  Machine Reading Comprehension
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension
Shubham Vatsal
Ayush Singh
LM&MARALM
86
0
0
29 May 2024
ATM: Adversarial Tuning Multi-agent System Makes a Robust
  Retrieval-Augmented Generator
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator
Junda Zhu
Lingyong Yan
Haibo Shi
Dawei Yin
Lei Sha
RALM
80
8
0
28 May 2024
Generative Query Reformulation Using Ensemble Prompting, Document
  Fusion, and Relevance Feedback
Generative Query Reformulation Using Ensemble Prompting, Document Fusion, and Relevance Feedback
Kaustubh D. Dhole
Ramraj Chandradevan
Eugene Agichtein
66
1
0
27 May 2024
Recent advances in text embedding: A Comprehensive Review of
  Top-Performing Methods on the MTEB Benchmark
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Hongliu Cao
AI4TS
104
15
0
27 May 2024
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Chankyu Lee
Rajarshi Roy
Mengyao Xu
Jonathan Raiman
Mohammad Shoeybi
Bryan Catanzaro
Ming-Yu Liu
RALM
304
205
0
27 May 2024
Crafting Interpretable Embeddings by Asking LLMs Questions
Crafting Interpretable Embeddings by Asking LLMs Questions
Vinamra Benara
Chandan Singh
John X. Morris
Richard Antonello
Ion Stoica
Alexander G. Huth
Jianfeng Gao
69
6
0
26 May 2024
Cocktail: A Comprehensive Information Retrieval Benchmark with
  LLM-Generated Documents Integration
Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration
Sunhao Dai
Weihao Liu
Yuqi Zhou
Liang Pang
Rongju Ruan
Gang Wang
Zhenhua Dong
Jun Xu
Jirong Wen
129
12
0
26 May 2024
AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
R. Reddy
Omar Attia
Yunyao Li
Heng Ji
Saloni Potdar
61
1
0
23 May 2024
RefChecker: Reference-based Fine-grained Hallucination Checker and
  Benchmark for Large Language Models
RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models
Xiangkun Hu
Dongyu Ru
Lin Qiu
Qipeng Guo
Tianhang Zhang
Yang Xu
Yun Luo
Pengfei Liu
Yue Zhang
Zheng Zhang
HILMLRM
98
9
0
23 May 2024
xRAG: Extreme Context Compression for Retrieval-augmented Generation
  with One Token
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
Xin Cheng
Xun Wang
Xingxing Zhang
Tao Ge
Si-Qing Chen
Furu Wei
Huishuai Zhang
Dongyan Zhao
102
36
0
22 May 2024
TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in
  Large Language Models
TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in Large Language Models
Pengzhou Cheng
Yidong Ding
Tianjie Ju
Zongru Wu
Wei Du
Ping Yi
Zhuosheng Zhang
Gongshen Liu
SILMAAML
90
29
0
22 May 2024
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
Jiajie Jin
Yutao Zhu
Xinyu Yang
Chenghao Zhang
Zhicheng Dou
Chenghao Zhang
Tong Zhao
Zhao Yang
Zhicheng Dou
Ji-Rong Wen
VLM
165
72
0
22 May 2024
Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple
  Candidates for Efficient and Effective Retrieval
Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval
Jonghyun Song
Cheyon Jin
Wenlong Zhao
Jay Yoon Lee
84
0
0
21 May 2024
Optimistic Query Routing in Clustering-based Approximate Maximum Inner
  Product Search
Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search
Sebastian Bruch
Aditya Krishnan
F. M. Nardini
56
3
0
20 May 2024
DocReLM: Mastering Document Retrieval with Language Model
DocReLM: Mastering Document Retrieval with Language Model
Gengchen Wei
Xinle Pang
Tianning Zhang
Yu Sun
Xun Qian
Chen Lin
Han-Sen Zhong
Wanli Ouyang
RALM
69
0
0
19 May 2024
Jill Watson: A Virtual Teaching Assistant powered by ChatGPT
Jill Watson: A Virtual Teaching Assistant powered by ChatGPT
Karan Taneja
Pratyusha Maiti
Sandeep Kakar
P. Guruprasad
Sanjeev Rao
Ashok K. Goel
77
23
0
17 May 2024
Large Language Model (LLM) for Telecommunications: A Comprehensive
  Survey on Principles, Key Techniques, and Opportunities
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities
Hao Zhou
Chengming Hu
Ye Yuan
Yufei Cui
Yili Jin
...
Di Wu
Xue Liu
Charlie Zhang
Xianbin Wang
Jiangchuan Liu
111
79
0
17 May 2024
INDUS: Effective and Efficient Language Models for Scientific
  Applications
INDUS: Effective and Efficient Language Models for Scientific Applications
Bishwaranjan Bhattacharjee
Aashka Trivedi
Masayasu Muraoka
Muthukumaran Ramasubramanian
Takuma Udagawa
...
Peter W. J. Staar
S. Vahidinia
Ryan McGranaghan
A. Mehrabian
Tsendgar Lee
AI4CE
97
6
0
17 May 2024
LFED: A Literary Fiction Evaluation Dataset for Large Language Models
LFED: A Literary Fiction Evaluation Dataset for Large Language Models
Linhao Yu
Qun Liu
Deyi Xiong
97
1
0
16 May 2024
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World
  Knowledge
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Andong Wang
Bo Wu
Sunli Chen
Zhenfang Chen
Haotian Guan
Wei-Ning Lee
Li Erran Li
Chuang Gan
LRMRALM
103
19
0
15 May 2024
Words Blending Boxes. Obfuscating Queries in Information Retrieval using
  Differential Privacy
Words Blending Boxes. Obfuscating Queries in Information Retrieval using Differential Privacy
Francesco Luigi De Faveri
G. Faggioli
Nicola Ferro
AAML
73
0
0
15 May 2024
Span-Aggregatable, Contextualized Word Embeddings for Effective Phrase
  Mining
Span-Aggregatable, Contextualized Word Embeddings for Effective Phrase Mining
Eyal Orbach
Lev Haikin
Nelly David
Avi Faizakof
52
2
0
12 May 2024
Prompting Large Language Models with Knowledge Graphs for Question
  Answering Involving Long-tail Facts
Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts
Wenyu Huang
Guancheng Zhou
Mirella Lapata
Pavlos Vougiouklis
Sébastien Montella
Jeff Z. Pan
KELM
75
5
0
10 May 2024
DOLOMITES: Domain-Specific Long-Form Methodical Tasks
DOLOMITES: Domain-Specific Long-Form Methodical Tasks
Chaitanya Malaviya
Priyanka Agrawal
Kuzman Ganchev
Pranesh Srinivasan
Fantine Huot
Jonathan Berant
Mark Yatskar
Dipanjan Das
Mirella Lapata
Chris Alberti
71
6
0
09 May 2024
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models
Luke Merrick
Danmei Xu
Gaurav Nuti
Daniel Campos
73
27
0
08 May 2024
Stochastic RAG: End-to-End Retrieval-Augmented Generation through
  Expected Utility Maximization
Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization
Hamed Zamani
Michael Bendersky
118
29
0
05 May 2024
Enhancing Contextual Understanding in Large Language Models through
  Contrastive Decoding
Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding
Zheng Zhao
Emilio Monti
Jens Lehmann
H. Assem
97
33
0
04 May 2024
Beyond Relevance: Evaluate and Improve Retrievers on Perspective
  Awareness
Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness
Xinran Zhao
Tong Chen
Sihao Chen
Hongming Zhang
Tongshuang Wu
78
9
0
04 May 2024
Learning label-label correlations in Extreme Multi-label Classification
  via Label Features
Learning label-label correlations in Extreme Multi-label Classification via Label Features
Siddhant Kharbanda
Devaansh Gupta
Erik Schultheis
Atmadeep Banerjee
Cho-Jui Hsieh
Rohit Babbar
55
4
0
03 May 2024
Language Fairness in Multilingual Information Retrieval
Language Fairness in Multilingual Information Retrieval
Eugene Yang
Thomas Janich
James Mayfield
Dawn J Lawrie
66
5
0
02 May 2024
When to Retrieve: Teaching LLMs to Utilize Information Retrieval
  Effectively
When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively
Tiziano Labruna
Jon Ander Campos
Gorka Azkune
75
13
0
30 Apr 2024
BMRetriever: Tuning Large Language Models as Better Biomedical Text
  Retrievers
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers
Ran Xu
Wenqi Shi
Yue Yu
Yuchen Zhuang
Yanqiao Zhu
M. D. Wang
Joyce C. Ho
Chao Zhang
Carl Yang
LM&MA
97
25
0
29 Apr 2024
Enhancing Pre-Trained Generative Language Models with Question Attended
  Span Extraction on Machine Reading Comprehension
Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension
Lin Ai
Zheng Hui
Zizhou Liu
Julia Hirschberg
60
1
0
27 Apr 2024
TIGQA:An Expert Annotated Question Answering Dataset in Tigrinya
TIGQA:An Expert Annotated Question Answering Dataset in Tigrinya
Hailay Teklehaymanot
Dren Fazlija
Niloy Ganguly
Gourab K. Patro
Wolfgang Nejdl
96
0
0
26 Apr 2024
XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
Jo˜ao Monteiro
Étienne Marcotte
Pierre-Andre Noel
Valentina Zantedeschi
David Vázquez
Nicolas Chapados
Christopher Pal
Perouz Taslakian
77
5
0
23 Apr 2024
Multi-view Content-aware Indexing for Long Document Retrieval
Multi-view Content-aware Indexing for Long Document Retrieval
Kuicai Dong
Derrick-Goh-Xin Deik
Yi Quan Lee
Hao Zhang
Xiangyang Li
Cong Zhang
Yong Liu
78
3
0
23 Apr 2024
A Reproducibility Study of PLAID
A Reproducibility Study of PLAID
Sean MacAvaney
Nicola Tonellotto
71
8
0
23 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
208
61
0
23 Apr 2024
ChatRetriever: Adapting Large Language Models for Generalized and Robust
  Conversational Dense Retrieval
ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval
Kelong Mao
Chenlong Deng
Haonan Chen
Fengran Mo
Zheng Liu
Tetsuya Sakai
Zhicheng Dou
KELM
102
15
0
21 Apr 2024
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge
  Bases
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases
Shirley Wu
Shiyu Zhao
Michihiro Yasunaga
Kexin Huang
Kaidi Cao
Qian Huang
V. Ioannidis
Karthik Subbian
James Zou
J. Leskovec
96
20
0
19 Apr 2024
Previous
123...789...262728
Next