Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.18887
Cited By
Embedding And Clustering Your Data Can Improve Contrastive Pretraining
26 July 2024
Luke Merrick
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Embedding And Clustering Your Data Can Improve Contrastive Pretraining"
5 / 5 papers shown
Title
Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
Alex Laitenberger
Christopher D. Manning
Nelson F. Liu
RALM
67
0
0
04 Jun 2025
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster Management
Kai Yin
Xiangjue Dong
Chengkai Liu
Lipai Huang
Yiming Xiao
Zhewei Liu
Ali Mostafavi
James Caverlee
93
0
0
20 May 2025
Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation
Alireza Salemi
Chris Samarinas
Hamed Zamani
78
0
0
10 Apr 2025
Improved Large Language Model Jailbreak Detection via Pretrained Embeddings
Erick Galinkin
Martin Sablotny
116
3
0
02 Dec 2024
OnlySportsLM: Optimizing Sports-Domain Language Models with SOTA Performance under Billion Parameters
Zexin Chen
Chengxi Li
Xiangyu Xie
Parijat Dube
ALM
64
2
0
30 Aug 2024
1