Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.08787
Cited By
Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval
19 August 2021
Xinyu Crystina Zhang
Xueguang Ma
Peng Shi
Jimmy J. Lin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval"
25 / 25 papers shown
Title
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Weiwei Sun
Lingyong Yan
Xinyu Ma
Shuaiqiang Wang
Pengjie Ren
Zhumin Chen
Dawei Yin
Z. Ren
RALM
ALM
ELM
LRM
LM&MA
76
285
0
31 Dec 2024
The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design
Artem Snegirev
Maria Tikhonova
Anna Maksimova
Alena Fenogenova
Alexander Abramov
31
4
0
22 Aug 2024
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Guangyuan Ma
Yongliang Ma
Xing Wu
Zhenpeng Su
Ming Zhou
Songlin Hu
OOD
41
2
0
20 Aug 2024
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Chankyu Lee
Rajarshi Roy
Mengyao Xu
Jonathan Raiman
M. Shoeybi
Bryan Catanzaro
Ming-Yu Liu
RALM
54
139
0
27 May 2024
Domain Adaptation of Multilingual Semantic Search -- Literature Review
Anna Bringmann
Anastasia Zhukova
VLM
41
0
0
05 Feb 2024
IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages
Saiful Haq
Ashutosh Sharma
Pushpak Bhattacharyya
26
2
0
15 Dec 2023
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval
Nandan Thakur
Jianmo Ni
Gustavo Hernández Ábrego
John Wieting
Jimmy J. Lin
Daniel Cer
RALM
31
12
0
10 Nov 2023
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Viet Dac Lai
Chien Van Nguyen
Nghia Trung Ngo
Thuat Nguyen
Franck Dernoncourt
Ryan A. Rossi
Thien Huu Nguyen
ALM
42
130
0
29 Jul 2023
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval
Shunyu Zhang
Yaobo Liang
Ming Gong
Daxin Jiang
Nan Duan
25
4
0
03 Feb 2023
An Experimental Study on Pretraining Transformers from Scratch for IR
Carlos Lassance
Hervé Déjean
S. Clinchant
28
11
0
25 Jan 2023
Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Keshav Santhanam
Jon Saad-Falcon
M. Franz
Omar Khattab
Avirup Sil
Radu Florian
Md Arafat Sultan
Salim Roukos
Matei A. Zaharia
Christopher Potts
OffRL
26
10
0
02 Dec 2022
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing
Peng Shi
Rui Zhang
Richard He Bai
Jimmy J. Lin
RALM
30
42
0
25 Oct 2022
Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking
Tim Baumgärtner
Leonardo F. R. Ribeiro
Nils Reimers
Iryna Gurevych
29
6
0
19 Oct 2022
Better Than Whitespace: Information Retrieval for Languages without Custom Tokenizers
Odunayo Ogundepo
Xinyu Crystina Zhang
Jimmy J. Lin
18
2
0
11 Oct 2022
mRobust04: A Multilingual Version of the TREC Robust 2004 Benchmark
Vitor Jeronymo
Mauricio Nascimento
R. Lotufo
Rodrigo Nogueira
17
3
0
27 Sep 2022
RealTime QA: What's the Answer Right Now?
Jungo Kasai
Keisuke Sakaguchi
Yoichi Takahashi
Ronan Le Bras
Akari Asai
Xinyan Velocity Yu
Dragomir R. Radev
Noah A. Smith
Yejin Choi
Kentaro Inui
KELM
45
165
0
27 Jul 2022
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval
Luyu Gao
Xueguang Ma
Jimmy J. Lin
Jamie Callan
32
76
0
11 Mar 2022
Unsupervised Dense Information Retrieval with Contrastive Learning
Gautier Izacard
Mathilde Caron
Lucas Hosseini
Sebastian Riedel
Piotr Bojanowski
Armand Joulin
Edouard Grave
RALM
38
808
0
16 Dec 2021
Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations
Ji Xin
Chenyan Xiong
A. Srinivasan
Ankita Sharma
Damien Jose
Paul N. Bennett
VLM
83
41
0
14 Oct 2021
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering
Minghan Li
Jimmy J. Lin
AI4CE
25
9
0
04 Oct 2021
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset
L. Bonifacio
Vitor Jeronymo
Hugo Queiroz Abonizio
Israel Campiotti
Marzieh Fadaee
R. Lotufo
Rodrigo Nogueira
37
108
0
31 Aug 2021
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
Nandan Thakur
Nils Reimers
Andreas Rucklé
Abhishek Srivastava
Iryna Gurevych
VLM
234
971
0
17 Apr 2021
Overview of the TREC 2020 deep learning track
Nick Craswell
Bhaskar Mitra
Emine Yilmaz
Daniel Fernando Campos
54
368
0
15 Feb 2021
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy J. Lin
Rodrigo Nogueira
Andrew Yates
VLM
239
611
0
13 Oct 2020
Overview of the TREC 2019 deep learning track
Nick Craswell
Bhaskar Mitra
Emine Yilmaz
Daniel Fernando Campos
E. Voorhees
180
465
0
17 Mar 2020
1