ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.03281
  4. Cited By
Towards General Text Embeddings with Multi-stage Contrastive Learning

Towards General Text Embeddings with Multi-stage Contrastive Learning

7 August 2023
Zehan Li
Xin Zhang
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
ArXiv (abs)PDFHTML

Papers citing "Towards General Text Embeddings with Multi-stage Contrastive Learning"

50 / 260 papers shown
Title
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs
Xin Zhang
Yanzhao Zhang
Wen Xie
Mingxin Li
Ziqi Dai
Dingkun Long
Pengjun Xie
Meishan Zhang
Wenjie Li
Hao Fei
235
20
0
22 Dec 2024
LLMs are Also Effective Embedding Models: An In-depth Overview
LLMs are Also Effective Embedding Models: An In-depth Overview
Chongyang Tao
Tao Shen
Shen Gao
Junshuo Zhang
Zhen Li
Zhengwei Tao
Shuai Ma
143
11
0
17 Dec 2024
Persona-SQ: A Personalized Suggested Question Generation Framework For
  Real-world Documents
Persona-SQ: A Personalized Suggested Question Generation Framework For Real-world Documents
Zihao Lin
Ziyi Wang
Yuanting Pan
Varun Manjunatha
Ryan Rossi
Angela Lau
Lifu Huang
Tong Sun
RALM
158
0
0
17 Dec 2024
GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for
  Unsupervised Reverse Dictionary
GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary
F. Almeman
Luis Espinosa-Anke
104
0
0
09 Dec 2024
Enhancing LLMs for Impression Generation in Radiology Reports through a
  Multi-Agent System
Enhancing LLMs for Impression Generation in Radiology Reports through a Multi-Agent System
Fang Zeng
Zhiliang Lyu
Quanzheng Li
Xiang Li
90
4
0
06 Dec 2024
DoubleCCA: Improving Foundation Model Group Robustness with Random
  Sentence Embeddings
DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings
Hong Liu
Yitong Lu
198
0
0
25 Nov 2024
From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive
  Grammars
From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive Grammars
Albert Kornilov
Tatiana Shavrina
106
1
0
23 Nov 2024
Locating the Leading Edge of Cultural Change
Locating the Leading Edge of Cultural Change
Sarah Griebel
Becca Cohen
Lucian Li
Jaihyun Park
Jiayu Liu
Jana Perkins
Ted Underwood
93
1
0
22 Nov 2024
Efficient Aspect-Based Summarization of Climate Change Reports with
  Small Language Models
Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models
Iacopo Ghinassi
Leonardo Catalano
Tommaso Colella
105
1
0
21 Nov 2024
Writing Style Matters: An Examination of Bias and Fairness in
  Information Retrieval Systems
Writing Style Matters: An Examination of Bias and Fairness in Information Retrieval Systems
Hongliu Cao
127
4
0
20 Nov 2024
CodeXEmbed: A Generalist Embedding Model Family for Multiligual and
  Multi-task Code Retrieval
CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval
Yongxu Liu
Rui Meng
Shafiq Joty
Silvio Savarese
Caiming Xiong
Yingbo Zhou
Semih Yavuz
165
8
0
19 Nov 2024
CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal
  Large Language Models
CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models
Junho Kim
Hyungjin Chung
Byung-Hoon Kim
VLM
117
0
0
11 Nov 2024
Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers
Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers
Zhichao Geng
Y. Wang
Dongyu Ru
Yang Yang
62
2
0
07 Nov 2024
Enhancing Table Representations with LLM-powered Synthetic Data
  Generation
Enhancing Table Representations with LLM-powered Synthetic Data Generation
Dayu Yang
Natawut Monaikul
Amanda Ding
Bozhao Tan
Kishore Mosaliganti
Giri Iyengar
49
2
0
04 Nov 2024
TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic
  Vision-Language Negatives
TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Maitreya Patel
Abhiram Kusumba
Sheng Cheng
Changhoon Kim
Tejas Gokhale
Chitta Baral
Yezhou Yang
CoGeCLIP
143
14
0
04 Nov 2024
Evaluating Creative Short Story Generation in Humans and Large Language Models
Evaluating Creative Short Story Generation in Humans and Large Language Models
Mete Ismayilzada
Claire Stevenson
Lonneke van der Plas
LM&MALRM
138
5
0
04 Nov 2024
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset
  for Security Research
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research
Sian-Yao Huang
Cheng-Lin Yang
Hongpeng Zhou
Chun-Ying Huang
83
2
0
02 Nov 2024
The Automated Verification of Textual Claims (AVeriTeC) Shared Task
The Automated Verification of Textual Claims (AVeriTeC) Shared Task
Michael Schlichtkrull
Yulong Chen
Chenxi Whitehouse
Zhenyun Deng
Mubashara Akhtar
...
Christos Christodoulopoulos
O. Cocarascu
Arpit Mittal
James Thorne
Andreas Vlachos
100
9
0
31 Oct 2024
From Context to Action: Analysis of the Impact of State Representation
  and Context on the Generalization of Multi-Turn Web Navigation Agents
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents
Nalin Tiwary
Vardhan Dongre
Sanil Arun Chawla
Ashwin Lamani
Dilek Hakkani-Tur
LLMAG
63
1
0
31 Oct 2024
Synergizing LLM Agents and Knowledge Graph for Socioeconomic Prediction
  in LBSN
Synergizing LLM Agents and Knowledge Graph for Socioeconomic Prediction in LBSN
Zhilun Zhou
Jingyang Fan
Yu Liu
Fengli Xu
Depeng Jin
Yong Li
47
1
0
29 Oct 2024
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive
  Learning
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Xun Guo
Shan Zhang
Yongxin He
Ting Zhang
Wanquan Feng
Haibin Huang
Chongyang Ma
DeLMO
86
10
0
28 Oct 2024
Simple Is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation
Simple Is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation
Mufei Li
Siqi Miao
Pan Li
RALM
203
17
0
28 Oct 2024
AutoMIR: Effective Zero-Shot Medical Information Retrieval without Relevance Labels
AutoMIR: Effective Zero-Shot Medical Information Retrieval without Relevance Labels
Lei Li
Xiangxu Zhang
Xiao Zhou
Zheng Liu
VLMRALM
121
3
0
26 Oct 2024
Little Giants: Synthesizing High-Quality Embedding Data at Scale
Little Giants: Synthesizing High-Quality Embedding Data at Scale
Haonan Chen
Liang Wang
Nan Yang
Yinlin Zhu
Ziliang Zhao
Furu Wei
Zhicheng Dou
SyDa
74
2
0
24 Oct 2024
An Adaptive Framework for Generating Systematic Explanatory Answer in
  Online Q&A Platforms
An Adaptive Framework for Generating Systematic Explanatory Answer in Online Q&A Platforms
Ziyang Chen
Xiaobin Wang
Yong Jiang
Jinzhi Liao
Pengjun Xie
Fei Huang
Xiang Zhao
450
0
0
23 Oct 2024
CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
Chen Yang
Chenyang Zhao
Q. Gu
Dongruo Zhou
LRM
76
0
0
22 Oct 2024
Large Language Models Are Overparameterized Text Encoders
Large Language Models Are Overparameterized Text Encoders
Thennal D K
Tim Fischer
Chris Biemann
85
2
0
18 Oct 2024
CoFE-RAG: A Comprehensive Full-chain Evaluation Framework for
  Retrieval-Augmented Generation with Enhanced Data Diversity
CoFE-RAG: A Comprehensive Full-chain Evaluation Framework for Retrieval-Augmented Generation with Enhanced Data Diversity
Jintao Liu
Ruixue Ding
Linhao Zhang
Pengjun Xie
Fie Huang
54
5
0
16 Oct 2024
On Debiasing Text Embeddings Through Context Injection
On Debiasing Text Embeddings Through Context Injection
Thomas Uriot
67
0
0
14 Oct 2024
EasyRAG: Efficient Retrieval-Augmented Generation Framework for
  Automated Network Operations
EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations
Zhangchi Feng
Dongdong Kuang
Zhongyuan Wang
Zhijie Nie
Yaowei Zheng
Richong Zhang
43
2
0
14 Oct 2024
HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific
  Citation Prediction
HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction
Qianyue Hao
Jingyang Fan
Fengli Xu
Jian Yuan
Yong Li
67
9
0
10 Oct 2024
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed
  KV Caches for Chunked Text
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text
Songshuo Lu
Hua Wang
Yutian Rong
Zhi Chen
Yaohua Tang
VLM
88
18
0
10 Oct 2024
Exploring the Meaningfulness of Nearest Neighbor Search in
  High-Dimensional Space
Exploring the Meaningfulness of Nearest Neighbor Search in High-Dimensional Space
Zhonghan Chen
Ruiyuan Zhang
Xi Zhao
Xiaojun Cheng
Xiaofang Zhou
85
0
0
08 Oct 2024
SAG: Style-Aligned Article Generation via Model Collaboration
SAG: Style-Aligned Article Generation via Model Collaboration
Chenning Xu
Fangxun Shu
Dian Jin
Jinghao Wei
Hao Jiang
ALMSyDa
63
0
0
04 Oct 2024
Reward-RAG: Enhancing RAG with Reward Driven Supervision
Reward-RAG: Enhancing RAG with Reward Driven Supervision
Thang Nguyen
Peter Chin
Yu-Wing Tai
RALM
130
5
0
03 Oct 2024
Thinking Outside of the Differential Privacy Box: A Case Study in Text
  Privatization with Language Model Prompting
Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting
Stephen Meisenbacher
Florian Matthes
64
3
0
01 Oct 2024
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge
  Distillation for Large Language Models in Code Generation
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation
Ziyang Luo
Xin Li
Hongzhan Lin
Jing Ma
Lidong Bing
VLM
65
0
0
01 Oct 2024
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling
David Grangier
Simin Fan
Skyler Seto
Pierre Ablin
207
5
0
30 Sep 2024
Do We Need Domain-Specific Embedding Models? An Empirical Investigation
Do We Need Domain-Specific Embedding Models? An Empirical Investigation
Yixuan Tang
Yi Yang
AIFin
195
6
0
27 Sep 2024
IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through
  Semantic Comprehension in Retrieval-Augmented Generation Scenarios
IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios
Hai Lin
Shaoxiong Zhan
Junyou Su
Haitao Zheng
Hui Wang
RALM
58
1
0
24 Sep 2024
Making Text Embedders Few-Shot Learners
Making Text Embedders Few-Shot Learners
Chaofan Li
Minghao Qin
Shitao Xiao
Jianlyu Chen
Kun Luo
Yingxia Shao
Defu Lian
Zheng Liu
111
37
0
24 Sep 2024
Lessons Learned on Information Retrieval in Electronic Health Records: A
  Comparison of Embedding Models and Pooling Strategies
Lessons Learned on Information Retrieval in Electronic Health Records: A Comparison of Embedding Models and Pooling Strategies
Skatje Myers
Timothy A. Miller
Yanjun Gao
M. Churpek
Anoop Mayampurath
Dmitriy Dligach
Majid Afshar
81
3
0
23 Sep 2024
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining
  for Clinical LLMs
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs
Clément Christophe
Tathagata Raha
Svetlana Maslenkova
Muhammad Umar Salman
Praveen K Kanithi
Marco AF Pimentel
Shadab Khan
LM&MA
68
2
0
23 Sep 2024
QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling
QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling
Blessed Guda
Gabrial Zencha A.
Lawrence Francis
Carlee Joe-Wong
97
1
0
21 Sep 2024
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like
  Language Models
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models
Orion Weller
Benjamin Van Durme
Dawn J Lawrie
Ashwin Paranjape
Yuhao Zhang
Jack Hessel
LRMRALM
97
25
0
17 Sep 2024
LLMs4OL 2024 Overview: The 1st Large Language Models for Ontology
  Learning Challenge
LLMs4OL 2024 Overview: The 1st Large Language Models for Ontology Learning Challenge
Hamed Babaei Giglou
Jennifer D'Souza
Sören Auer
69
8
0
16 Sep 2024
Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks
  against RAG-based Inference in Scale and Severity Using Jailbreaking
Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks against RAG-based Inference in Scale and Severity Using Jailbreaking
Stav Cohen
Ron Bitton
Ben Nassi
85
7
0
12 Sep 2024
E2LLM: Encoder Elongated Large Language Models for Long-Context
  Understanding and Reasoning
E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning
Zihan Liao
Jun Wang
Hang Yu
Lingxiao Wei
Jianguo Li
Jun Wang
Wei Zhang
64
3
0
10 Sep 2024
Rx Strategist: Prescription Verification using LLM Agents System
Rx Strategist: Prescription Verification using LLM Agents System
Phuc Phan Van
Dat Nguyen Minh
An Dinh Ngoc
Huy-Phan Thanh
OffRL
60
2
0
05 Sep 2024
Pooling And Attention: What Are Effective Designs For LLM-Based
  Embedding Models?
Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?
Yixuan Tang
Yi Yang
76
4
0
04 Sep 2024
Previous
123456
Next