ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.03281
  4. Cited By
Towards General Text Embeddings with Multi-stage Contrastive Learning

Towards General Text Embeddings with Multi-stage Contrastive Learning

7 August 2023
Zehan Li
Xin Zhang
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
ArXivPDFHTML

Papers citing "Towards General Text Embeddings with Multi-stage Contrastive Learning"

50 / 225 papers shown
Title
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
David Grangier
Simin Fan
Skyler Seto
Pierre Ablin
44
3
0
30 Sep 2024
Do We Need Domain-Specific Embedding Models? An Empirical Investigation
Do We Need Domain-Specific Embedding Models? An Empirical Investigation
Yixuan Tang
Yi Yang
AIFin
50
3
0
27 Sep 2024
IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through
  Semantic Comprehension in Retrieval-Augmented Generation Scenarios
IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios
Hai Lin
Shaoxiong Zhan
Junyou Su
Haitao Zheng
Hui Wang
RALM
37
1
0
24 Sep 2024
Making Text Embedders Few-Shot Learners
Making Text Embedders Few-Shot Learners
Chaofan Li
Minghao Qin
Shitao Xiao
Jianlyu Chen
Kun Luo
Yingxia Shao
Defu Lian
Zheng Liu
35
23
0
24 Sep 2024
Lessons Learned on Information Retrieval in Electronic Health Records: A
  Comparison of Embedding Models and Pooling Strategies
Lessons Learned on Information Retrieval in Electronic Health Records: A Comparison of Embedding Models and Pooling Strategies
Skatje Myers
Timothy A. Miller
Yanjun Gao
M. Churpek
Anoop Mayampurath
Dmitriy Dligach
Majid Afshar
28
3
0
23 Sep 2024
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining
  for Clinical LLMs
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs
Clément Christophe
Tathagata Raha
Svetlana Maslenkova
Muhammad Umar Salman
Praveen K Kanithi
Marco AF Pimentel
Shadab Khan
LM&MA
41
2
0
23 Sep 2024
QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling
QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling
Blessed Guda
Gabrial Zencha A.
Lawrence Francis
Carlee Joe-Wong
28
1
0
21 Sep 2024
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like
  Language Models
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models
Orion Weller
Benjamin Van Durme
Dawn J Lawrie
Ashwin Paranjape
Yuhao Zhang
Jack Hessel
LRM
RALM
57
17
0
17 Sep 2024
LLMs4OL 2024 Overview: The 1st Large Language Models for Ontology
  Learning Challenge
LLMs4OL 2024 Overview: The 1st Large Language Models for Ontology Learning Challenge
Hamed Babaei Giglou
Jennifer D'Souza
Sören Auer
51
8
0
16 Sep 2024
Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks
  against RAG-based Inference in Scale and Severity Using Jailbreaking
Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks against RAG-based Inference in Scale and Severity Using Jailbreaking
Stav Cohen
Ron Bitton
Ben Nassi
44
4
0
12 Sep 2024
E2LLM: Encoder Elongated Large Language Models for Long-Context
  Understanding and Reasoning
E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning
Zihan Liao
Jun Wang
Hang Yu
Lingxiao Wei
Jianguo Li
Jun Wang
Wei Zhang
24
2
0
10 Sep 2024
Rx Strategist: Prescription Verification using LLM Agents System
Rx Strategist: Prescription Verification using LLM Agents System
Phuc Phan Van
Dat Nguyen Minh
An Dinh Ngoc
Huy-Phan Thanh
OffRL
36
1
0
05 Sep 2024
Pooling And Attention: What Are Effective Designs For LLM-Based
  Embedding Models?
Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?
Yixuan Tang
Yi Yang
33
3
0
04 Sep 2024
Prompt Compression with Context-Aware Sentence Encoding for Fast and
  Improved LLM Inference
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference
Barys Liskavets
Maxim Ushakov
Shuvendu Roy
Mark Klibanov
Ali Etemad
Shane Luke
46
6
0
02 Sep 2024
Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction
  Retriever
Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever
Rohan Jha
Bo Wang
Michael Gunther
Georgios Mastrapas
Saba Sturua
Isabelle Mohr
Andreas Koukounas
Mohammad Kalim Akram
Nan Wang
Han Xiao
34
2
0
29 Aug 2024
Conan-embedding: General Text Embedding with More and Better Negative
  Samples
Conan-embedding: General Text Embedding with More and Better Negative Samples
Shiyu Li
Yang Tang
Shizhe Chen
Xi Chen
18
3
0
28 Aug 2024
DSTI at LLMs4OL 2024 Task A: Intrinsic versus extrinsic knowledge for
  type classification
DSTI at LLMs4OL 2024 Task A: Intrinsic versus extrinsic knowledge for type classification
Hanna Abi Akl
24
1
0
26 Aug 2024
IntelliCare: Improving Healthcare Analysis with Variance-Controlled
  Patient-Level Knowledge from Large Language Models
IntelliCare: Improving Healthcare Analysis with Variance-Controlled Patient-Level Knowledge from Large Language Models
Zhihao Yu
Yujie Jin
Yongxin Xu
Xu Chu
Yasha Wang
Junfeng Zhao
32
0
0
23 Aug 2024
The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design
The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design
Artem Snegirev
Maria Tikhonova
Anna Maksimova
Alena Fenogenova
Alexander Abramov
34
4
0
22 Aug 2024
Mistral-SPLADE: LLMs for better Learned Sparse Retrieval
Mistral-SPLADE: LLMs for better Learned Sparse Retrieval
Meet Doshi
Vishwajeet Kumar
Rudra Murthy
Vignesh P
Jaydeep Sen
RALM
39
2
0
20 Aug 2024
ColBERT Retrieval and Ensemble Response Scoring for Language Model
  Question Answering
ColBERT Retrieval and Ensemble Response Scoring for Language Model Question Answering
Alex Gichamba
Tewodros Kederalah Idris
Brian Ebiyau
Eric Nyberg
Teruko Mitamura
23
0
0
20 Aug 2024
Improving embedding with contrastive fine-tuning on small datasets with
  expert-augmented scores
Improving embedding with contrastive fine-tuning on small datasets with expert-augmented scores
Jun Lu
David Li
Bill Ding
Yu Kang
64
3
0
19 Aug 2024
Understanding Generative AI Content with Embedding Models
Understanding Generative AI Content with Embedding Models
Max Vargas
Reilly Cannon
A. Engel
Anand D. Sarwate
Tony Chiang
54
3
0
19 Aug 2024
Moonshine: Distilling Game Content Generators into Steerable Generative Models
Moonshine: Distilling Game Content Generators into Steerable Generative Models
Yuhe Nie
Michael Middleton
Tim Merino
Nidhushan Kanagaraja
Ashutosh Kumar
Zhan Zhuang
Julian Togelius
53
0
0
18 Aug 2024
wav2graph: A Framework for Supervised Learning Knowledge Graph from
  Speech
wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech
Khai Le-Duc
Quy-Anh Dang
Tan-Hanh Pham
Truong-Son Hy
32
0
0
08 Aug 2024
DebateQA: Evaluating Question Answering on Debatable Knowledge
DebateQA: Evaluating Question Answering on Debatable Knowledge
Rongwu Xu
Xuan Qi
Zehan Qi
Wei Xu
Zhijiang Guo
ELM
53
5
0
02 Aug 2024
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Kunlun Zhu
Yifan Luo
Dingling Xu
Ruobing Wang
Shi Yu
...
Yishan Li
Zhiyuan Liu
Xu Han
Zhiyuan Liu
Maosong Sun
34
17
0
02 Aug 2024
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods
Gabriel Loiseau
Damien Sileo
Damien Riquet
Maxime Meyer
Marc Tommasi
46
0
0
31 Jul 2024
Language-Conditioned Offline RL for Multi-Robot Navigation
Language-Conditioned Offline RL for Multi-Robot Navigation
Steven D. Morad
Ajay Shankar
J. Blumenkamp
Amanda Prorok
LM&Ro
OffRL
48
6
0
29 Jul 2024
Motion Manifold Flow Primitives for Task-Conditioned Trajectory Generation under Complex Task-Motion Dependencies
Motion Manifold Flow Primitives for Task-Conditioned Trajectory Generation under Complex Task-Motion Dependencies
Yonghyeon Lee
Byeongho Lee
Seungyeon Kim
Frank C. Park
36
1
0
29 Jul 2024
mGTE: Generalized Long-Context Text Representation and Reranking Models
  for Multilingual Text Retrieval
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Xin Zhang
Yanzhao Zhang
Dingkun Long
Wen Xie
Ziqi Dai
...
Pengjun Xie
Fei Huang
Meishan Zhang
Wenjie Li
Min Zhang
42
78
0
29 Jul 2024
Open Sentence Embeddings for Portuguese with the Serafim PT* encoders
  family
Open Sentence Embeddings for Portuguese with the Serafim PT* encoders family
Luís Gomes
António Branco
Joao Silva
João Rodrigues
Rodrigo Santos
3DV
26
0
0
28 Jul 2024
Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable
  Frameworks
Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable Frameworks
Yunfan Gao
Yun Xiong
Meng Wang
Haofen Wang
39
17
0
26 Jul 2024
Revolutionizing Undergraduate Learning: CourseGPT and Its Generative AI
  Advancements
Revolutionizing Undergraduate Learning: CourseGPT and Its Generative AI Advancements
Ahmad M. Nazar
Mohamed Y. Selim
Ashraf Gaffar
Shakil Ahmed
37
2
0
25 Jul 2024
Exploring Description-Augmented Dataless Intent Classification
Exploring Description-Augmented Dataless Intent Classification
Ruoyu Hu
Foaad Khosmood
Abbas Edalat
AI4TS
45
0
0
25 Jul 2024
UniMEL: A Unified Framework for Multimodal Entity Linking with Large
  Language Models
UniMEL: A Unified Framework for Multimodal Entity Linking with Large Language Models
Liu Qi
Yongyi He
Lian Defu
Zhi Zheng
Tong Xu
Liu Che
Chen Enhong
MLLM
39
1
0
23 Jul 2024
NV-Retriever: Improving text embedding models with effective hard-negative mining
NV-Retriever: Improving text embedding models with effective hard-negative mining
Gabriel de Souza P. Moreira
Radek Osmulski
Mengyao Xu
Ronay Ak
Benedikt Schifferer
Even Oldridge
RALM
49
31
0
22 Jul 2024
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Peng Xu
Ming-Yu Liu
Xianchao Wu
Zihan Liu
M. Shoeybi
Mohammad Shoeybi
Bryan Catanzaro
RALM
52
14
0
19 Jul 2024
Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller
  Embedding Dimensions
Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions
Jinsung Yoon
Raj Sinha
Sercan Ö. Arik
Tomas Pfister
24
1
0
17 Jul 2024
$\textit{GeoHard}$: Towards Measuring Class-wise Hardness through
  Modelling Class Semantics
GeoHard\textit{GeoHard}GeoHard: Towards Measuring Class-wise Hardness through Modelling Class Semantics
Fengyu Cai
Xinran Zhao
Hongming Zhang
Iryna Gurevych
Heinz Koeppl
34
0
0
17 Jul 2024
ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context
ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context
Victoria R. Li
Yida Chen
Naomi Saphra
43
3
0
09 Jul 2024
LETS-C: Leveraging Language Embedding for Time Series Classification
LETS-C: Leveraging Language Embedding for Time Series Classification
Rachneet Kaur
Zhen Zeng
T. Balch
Manuela Veloso
AI4TS
41
0
0
09 Jul 2024
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models
Xiangyang Li
Kuicai Dong
Yi Quan Lee
Wei Xia
Yichun Yin
Xinyi Dai
Yasheng Wang
Ruiming Tang
65
15
0
03 Jul 2024
MeMemo: On-device Retrieval Augmentation for Private and Personalized
  Text Generation
MeMemo: On-device Retrieval Augmentation for Private and Personalized Text Generation
Zijie J. Wang
Duen Horng Chau
51
4
0
02 Jul 2024
Searching for Best Practices in Retrieval-Augmented Generation
Searching for Best Practices in Retrieval-Augmented Generation
Xiaohua Wang
Zhenghua Wang
Xuan Gao
Feiran Zhang
Yixin Wu
...
Qi Qian
Ruicheng Yin
Changze Lv
Xiaoqing Zheng
Xuanjing Huang
60
41
0
01 Jul 2024
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
David Rau
Hervé Déjean
Nadezhda Chirkova
Thibault Formal
Shuai Wang
Vassilina Nikoulina
S. Clinchant
45
12
0
01 Jul 2024
ProductAgent: Benchmarking Conversational Product Search Agent with
  Asking Clarification Questions
ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions
Jingheng Ye
Yong Jiang
Xiaobin Wang
Hai-Tao Zheng
Yangning Li
Hai-Tao Zheng
Pengjun Xie
Fei Huang
46
2
0
01 Jul 2024
PFME: A Modular Approach for Fine-grained Hallucination Detection and
  Editing of Large Language Models
PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models
Kunquan Deng
Zeyu Huang
Chen Li
Chenghua Lin
Min Gao
Wenge Rong
KELM
36
0
0
29 Jun 2024
Retrieval Augmented Instruction Tuning for Open NER with Large Language
  Models
Retrieval Augmented Instruction Tuning for Open NER with Large Language Models
Tingyu Xie
Jian Zhang
Yan Zhang
Yuanyuan Liang
Qi Li
Hongwei Wang
RALM
40
0
0
25 Jun 2024
D2LLM: Decomposed and Distilled Large Language Models for Semantic
  Search
D2LLM: Decomposed and Distilled Large Language Models for Semantic Search
Zihan Liao
Hang Yu
Jianguo Li
Jun Wang
Wei Zhang
36
3
0
25 Jun 2024
Previous
12345
Next