ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.08787
  4. Cited By
Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval

Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval

19 August 2021
Xinyu Crystina Zhang
Xueguang Ma
Peng Shi
Jimmy J. Lin
ArXivPDFHTML

Papers citing "Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval"

26 / 26 papers shown
Title
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Weiwei Sun
Lingyong Yan
Xinyu Ma
Shuaiqiang Wang
Pengjie Ren
Zhumin Chen
Dawei Yin
Z. Ren
RALM
ALM
ELM
LRM
LM&MA
76
285
0
31 Dec 2024
The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design
The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design
Artem Snegirev
Maria Tikhonova
Anna Maksimova
Alena Fenogenova
Alexander Abramov
31
4
0
22 Aug 2024
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Guangyuan Ma
Yongliang Ma
Xing Wu
Zhenpeng Su
Ming Zhou
Songlin Hu
OOD
41
2
0
20 Aug 2024
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Chankyu Lee
Rajarshi Roy
Mengyao Xu
Jonathan Raiman
M. Shoeybi
Bryan Catanzaro
Ming-Yu Liu
RALM
56
145
0
27 May 2024
Domain Adaptation of Multilingual Semantic Search -- Literature Review
Domain Adaptation of Multilingual Semantic Search -- Literature Review
Anna Bringmann
Anastasia Zhukova
VLM
41
0
0
05 Feb 2024
IndicIRSuite: Multilingual Dataset and Neural Information Models for
  Indian Languages
IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages
Saiful Haq
Ashutosh Sharma
Pushpak Bhattacharyya
26
2
0
15 Dec 2023
Leveraging LLMs for Synthesizing Training Data Across Many Languages in
  Multilingual Dense Retrieval
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval
Nandan Thakur
Jianmo Ni
Gustavo Hernández Ábrego
John Wieting
Jimmy J. Lin
Daniel Cer
RALM
31
12
0
10 Nov 2023
Okapi: Instruction-tuned Large Language Models in Multiple Languages
  with Reinforcement Learning from Human Feedback
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Viet Dac Lai
Chien Van Nguyen
Nghia Trung Ngo
Thuat Nguyen
Franck Dernoncourt
Ryan A. Rossi
Thien Huu Nguyen
ALM
42
130
0
29 Jul 2023
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense
  Retrieval
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval
Shunyu Zhang
Yaobo Liang
Ming Gong
Daxin Jiang
Nan Duan
25
4
0
03 Feb 2023
An Experimental Study on Pretraining Transformers from Scratch for IR
An Experimental Study on Pretraining Transformers from Scratch for IR
Carlos Lassance
Hervé Déjean
S. Clinchant
28
11
0
25 Jan 2023
Moving Beyond Downstream Task Accuracy for Information Retrieval
  Benchmarking
Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Keshav Santhanam
Jon Saad-Falcon
M. Franz
Omar Khattab
Avirup Sil
Radu Florian
Md Arafat Sultan
Salim Roukos
Matei A. Zaharia
Christopher Potts
OffRL
26
10
0
02 Dec 2022
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for
  Cross-lingual Text-to-SQL Semantic Parsing
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing
Peng Shi
Rui Zhang
Richard He Bai
Jimmy J. Lin
RALM
36
42
0
25 Oct 2022
Incorporating Relevance Feedback for Information-Seeking Retrieval using
  Few-Shot Document Re-Ranking
Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking
Tim Baumgärtner
Leonardo F. R. Ribeiro
Nils Reimers
Iryna Gurevych
32
6
0
19 Oct 2022
Better Than Whitespace: Information Retrieval for Languages without
  Custom Tokenizers
Better Than Whitespace: Information Retrieval for Languages without Custom Tokenizers
Odunayo Ogundepo
Xinyu Crystina Zhang
Jimmy J. Lin
18
2
0
11 Oct 2022
mRobust04: A Multilingual Version of the TREC Robust 2004 Benchmark
mRobust04: A Multilingual Version of the TREC Robust 2004 Benchmark
Vitor Jeronymo
Mauricio Nascimento
R. Lotufo
Rodrigo Nogueira
23
3
0
27 Sep 2022
RealTime QA: What's the Answer Right Now?
RealTime QA: What's the Answer Right Now?
Jungo Kasai
Keisuke Sakaguchi
Yoichi Takahashi
Ronan Le Bras
Akari Asai
Xinyan Velocity Yu
Dragomir R. Radev
Noah A. Smith
Yejin Choi
Kentaro Inui
KELM
45
167
0
27 Jul 2022
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval
Luyu Gao
Xueguang Ma
Jimmy J. Lin
Jamie Callan
32
76
0
11 Mar 2022
HC4: A New Suite of Test Collections for Ad Hoc CLIR
HC4: A New Suite of Test Collections for Ad Hoc CLIR
Dawn J Lawrie
J. Mayfield
Douglas W. Oard
Eugene Yang
VLM
AILaw
38
30
0
24 Jan 2022
Unsupervised Dense Information Retrieval with Contrastive Learning
Unsupervised Dense Information Retrieval with Contrastive Learning
Gautier Izacard
Mathilde Caron
Lucas Hosseini
Sebastian Riedel
Piotr Bojanowski
Armand Joulin
Edouard Grave
RALM
38
813
0
16 Dec 2021
Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant
  Representations
Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations
Ji Xin
Chenyan Xiong
A. Srinivasan
Ankita Sharma
Damien Jose
Paul N. Bennett
VLM
83
41
0
14 Oct 2021
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question
  Answering
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering
Minghan Li
Jimmy J. Lin
AI4CE
25
9
0
04 Oct 2021
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset
L. Bonifacio
Vitor Jeronymo
Hugo Queiroz Abonizio
Israel Campiotti
Marzieh Fadaee
R. Lotufo
Rodrigo Nogueira
40
108
0
31 Aug 2021
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information
  Retrieval Models
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
Nandan Thakur
Nils Reimers
Andreas Rucklé
Abhishek Srivastava
Iryna Gurevych
VLM
237
971
0
17 Apr 2021
Overview of the TREC 2020 deep learning track
Overview of the TREC 2020 deep learning track
Nick Craswell
Bhaskar Mitra
Emine Yilmaz
Daniel Fernando Campos
54
368
0
15 Feb 2021
Pretrained Transformers for Text Ranking: BERT and Beyond
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy J. Lin
Rodrigo Nogueira
Andrew Yates
VLM
239
611
0
13 Oct 2020
Overview of the TREC 2019 deep learning track
Overview of the TREC 2019 deep learning track
Nick Craswell
Bhaskar Mitra
Emine Yilmaz
Daniel Fernando Campos
E. Voorhees
180
466
0
17 Mar 2020
1