ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.03281
  4. Cited By
Towards General Text Embeddings with Multi-stage Contrastive Learning

Towards General Text Embeddings with Multi-stage Contrastive Learning

7 August 2023
Zehan Li
Xin Zhang
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
ArXivPDFHTML

Papers citing "Towards General Text Embeddings with Multi-stage Contrastive Learning"

50 / 224 papers shown
Title
mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs
mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs
Chuan Xu
Qiaosheng Chen
Yutong Feng
Gong Cheng
RALM
3DV
VLM
36
0
0
16 May 2025
Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation
Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation
Sheng Liang
Hang Lv
Zhihao Wen
Yaxiong Wu
Yuhang Zhang
Hao Wang
Yong-Jin Liu
18
0
0
13 May 2025
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
Shunyao Wang
Ming Cheng
Christina Dan Wang
AIFin
25
0
0
11 May 2025
Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy
Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy
Haoqi Wu
Wei Dai
Li Wang
Qiang Yan
SILM
35
0
0
09 May 2025
UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections
UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections
Fatima Haouari
Carolina Scarton
Nicolò Faggiani
Nikolaos Nikolaidis
Bonka Kotseva
Ibrahim Abu Farha
Jens Linge
Kalina Bontcheva
46
0
0
08 May 2025
SweRank: Software Issue Localization with Code Ranking
SweRank: Software Issue Localization with Code Ranking
R. Reddy
Tarun Suresh
JaeHyeok Doo
Yong-Jin Liu
Xuan-Phi Nguyen
Yingbo Zhou
Semih Yavuz
Caiming Xiong
Heng Ji
Shafiq R. Joty
29
0
0
07 May 2025
SimAug: Enhancing Recommendation with Pretrained Language Models for Dense and Balanced Data Augmentation
SimAug: Enhancing Recommendation with Pretrained Language Models for Dense and Balanced Data Augmentation
Yuying Zhao
Xiaodong Yang
Huiyuan Chen
Xiran Fan
Yu-Chiang Frank Wang
Y. Cai
Tyler Derr
27
0
0
03 May 2025
PropRAG: Guiding Retrieval with Beam Search over Proposition Paths
PropRAG: Guiding Retrieval with Beam Search over Proposition Paths
Jingjin Wang
LRM
143
0
0
25 Apr 2025
Out-of-the-Box Conditional Text Embeddings from Large Language Models
Out-of-the-Box Conditional Text Embeddings from Large Language Models
Kosuke Yamada
Peinan Zhang
27
0
0
23 Apr 2025
FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation
FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation
Chanyeol Choi
Jihoon Kwon
Jaeseon Ha
Hojun Choi
Chaewoon Kim
Yongjae Lee
Jy-yong Sohn
Alejandro Lopez-Lira
RALM
58
0
0
22 Apr 2025
POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications
POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications
Chunjing Gan
Dan Yang
Binbin Hu
Ziqi Liu
Yue Shen
Qing Cui
J. Wang
Jun Zhou
30
0
0
21 Apr 2025
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search
Quy-Anh Dang
Chris Ngo
Truong Son-Hy
AAML
SyDa
33
0
0
21 Apr 2025
Don't Retrieve, Generate: Prompting LLMs for Synthetic Training Data in Dense Retrieval
Don't Retrieve, Generate: Prompting LLMs for Synthetic Training Data in Dense Retrieval
Aarush Sinha
RALM
73
0
0
20 Apr 2025
Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts
Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts
Zhenkui Yang
Z. Huang
Ge Wang
H. Ding
Tony Xiao Han
Fei-Yue Wang
32
0
0
20 Apr 2025
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
Jiliang Ni
Jiachen Pu
Zhongyi Yang
Kun Zhou
Hui Wang
Xiaoliang Xiao
Dakui Wang
Xin Li
Jingfeng Luo
Conggang Hu
37
0
0
18 Apr 2025
Estimating Optimal Context Length for Hybrid Retrieval-augmented Multi-document Summarization
Estimating Optimal Context Length for Hybrid Retrieval-augmented Multi-document Summarization
Adithya Pratapa
Teruko Mitamura
RALM
34
0
0
17 Apr 2025
A Survey of Personalization: From RAG to Agent
A Survey of Personalization: From RAG to Agent
Xiaopeng Li
Pengyue Jia
Derong Xu
Yi Wen
Yingyi Zhang
...
X. Li
Y. Liu
Huifeng Guo
Ruiming Tang
Xiangyu Zhao
3DV
AI4TS
AI4CE
44
0
0
14 Apr 2025
VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents
VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents
Ryota Tanaka
Taichi Iki
Taku Hasegawa
Kyosuke Nishida
Kuniko Saito
Jun Suzuki
VLM
52
1
0
14 Apr 2025
Scholar Inbox: Personalized Paper Recommendations for Scientists
Scholar Inbox: Personalized Paper Recommendations for Scientists
Markus Flicke
Glenn Angrabeit
Madhav Iyengar
Vitalii Protsenko
Illia Shakun
...
Lukas Schuler
Lewin Scholz
Kavyanjali Agnihotri
Yong Cao
Andreas Geiger
33
0
0
11 Apr 2025
PathGPT: Leveraging Large Language Models for Personalized Route Generation
PathGPT: Leveraging Large Language Models for Personalized Route Generation
Steeve Cuthbert Marcelyn
Yucen Gao
Yuzhe Zhang
Xiaofeng Gao
Guihai Chen
30
0
0
08 Apr 2025
GOLLuM: Gaussian Process Optimized LLMs -- Reframing LLM Finetuning through Bayesian Optimization
GOLLuM: Gaussian Process Optimized LLMs -- Reframing LLM Finetuning through Bayesian Optimization
Bojana Ranković
P. Schwaller
BDL
172
0
0
08 Apr 2025
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration
Ran Xu
W. Shi
Yuchen Zhuang
Yue Yu
Joyce C. Ho
Haoyu Wang
Carl Yang
26
1
0
07 Apr 2025
The Use of Gaze-Derived Confidence of Inferred Operator Intent in Adjusting Safety-Conscious Haptic Assistance
The Use of Gaze-Derived Confidence of Inferred Operator Intent in Adjusting Safety-Conscious Haptic Assistance
Jeremy D. Webb
Michael Bowman
Songpo Li
Xiaoli Zhang
34
0
0
04 Apr 2025
Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking
Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking
Chris Samarinas
Hamed Zamani
ALM
LRM
74
0
0
04 Apr 2025
Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data
Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data
Waris Gill
Justin Cechmanek
Tyler Hutcherson
Srijith Rajamohan
Jen Agarwal
Muhammad Ali Gulzar
Manvinder Singh
Benoit Dion
35
0
0
03 Apr 2025
Context-Aware Human Behavior Prediction Using Multimodal Large Language Models: Challenges and Insights
Context-Aware Human Behavior Prediction Using Multimodal Large Language Models: Challenges and Insights
Yuchen Liu
Lino Lerch
Luigi Palmieri
Andrey Rudenko
Sebastian Koch
Timo Ropinski
Marco Aiello
60
0
0
01 Apr 2025
Enhancing Negation Awareness in Universal Text Embeddings: A Data-efficient and Computational-efficient Approach
Enhancing Negation Awareness in Universal Text Embeddings: A Data-efficient and Computational-efficient Approach
Hongliu Cao
67
0
0
01 Apr 2025
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation
Junyu Xie
Tengda Han
Max Bain
Arsha Nagrani
Eshika Khandelwal
Gül Varol
Weidi Xie
Andrew Zisserman
DiffM
VGen
59
0
0
01 Apr 2025
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations
Yubo Wang
Xueguang Ma
Ping Nie
Huaye Zeng
Zhiheng Lyu
Yuyao Zhang
Benjamin Schneider
Yi Lu
Xiang Yue
Wenhu Chen
RALM
48
0
0
01 Apr 2025
Universal Zero-shot Embedding Inversion
Universal Zero-shot Embedding Inversion
Collin Zhang
John X. Morris
Vitaly Shmatikov
53
1
0
31 Mar 2025
Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval
Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval
Sangam Lee
Ryang Heo
SeongKu Kang
Dongha Lee
RALM
56
1
0
29 Mar 2025
Improving the Context Length and Efficiency of Code Retrieval for Tracing Security Vulnerability Fixes
Improving the Context Length and Efficiency of Code Retrieval for Tracing Security Vulnerability Fixes
Xueqing Liu
Jiangrui Zheng
Guanqun Yang
Siyan Wen
Qiushi Liu
48
0
0
29 Mar 2025
Spend Your Budget Wisely: Towards an Intelligent Distribution of the Privacy Budget in Differentially Private Text Rewriting
Spend Your Budget Wisely: Towards an Intelligent Distribution of the Privacy Budget in Differentially Private Text Rewriting
Stephen Meisenbacher
Chaeeun Joy Lee
Florian Matthes
46
0
0
28 Mar 2025
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
Gaifan Zhang
Yi Zhou
Danushka Bollegala
153
0
0
21 Mar 2025
MultiConIR: Towards multi-condition Information Retrieval
Xuan Lu
Sifan Liu
Bochao Yin
Y. K. Li
Xinghao Chen
Hui Su
Yaohui Jin
Wenjun Zeng
Xiaoyu Shen
74
0
0
13 Mar 2025
Advancing Vietnamese Information Retrieval with Learning Objective and Benchmark
Phu-Vinh Nguyen
Minh-Nam Tran
Long H. B. Nguyen
D. Dinh
61
0
0
10 Mar 2025
Enhancing Retrieval for ESGLLM via ESG-CID -- A Disclosure Content Index Finetuning Dataset for Mapping GRI and ESRS
Shafiuddin Rehan Ahmed
A. Shah
Quan Hung Tran
Vivek Khetan
Sukryool Kang
Ankit Mehta
Yujia Bao
Wei Wei
41
0
0
10 Mar 2025
LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue
Sangyeop Kim
S. Park
Jaewon Jung
Jinseok Kim
Sungzoon Cho
45
0
0
06 Mar 2025
SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing
Xiangchao Yan
Shiyang Feng
Jiakang Yuan
Renqiu Xia
Bin Wang
Bo Zhang
Junlin Wu
60
2
0
06 Mar 2025
Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization
Yilun Qiu
Xiaoyan Zhao
Yang Zhang
Yimeng Bai
Luu Anh Tuan
Hong Cheng
Fuli Feng
Tat-Seng Chua
55
1
0
04 Mar 2025
An Efficient Plugin Method for Metric Optimization of Black-Box Models
Siddartha Devic
Nurendra Choudhary
Anirudh Srinivasan
Sahika Genc
B. Kveton
G. Hiranandani
41
0
0
03 Mar 2025
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking
Zhuoqun Li
Haiyang Yu
Xuanang Chen
Hongyu Lin
Yaojie Lu
Fei Huang
Xianpei Han
Heng Chang
Le Sun
59
4
0
28 Feb 2025
Granite Embedding Models
Granite Embedding Models
Parul Awasthy
Aashka Trivedi
Yulong Li
Mihaela A. Bornea
David D. Cox
...
Sukriti Sharma
Avirup Sil
Kate Soule
Arafat Sultan
Radu Florian
RALM
70
1
0
27 Feb 2025
Teaching Dense Retrieval Models to Specialize with Listwise Distillation and LLM Data Augmentation
Teaching Dense Retrieval Models to Specialize with Listwise Distillation and LLM Data Augmentation
Manveer Singh Tamber
Suleman Kazi
Vivek Sourabh
Jimmy J. Lin
71
1
0
27 Feb 2025
NeoBERT: A Next-Generation BERT
NeoBERT: A Next-Generation BERT
Lola Le Breton
Quentin Fournier
Mariam El Mezouar
Sarath Chandar
AI4TS
68
1
0
26 Feb 2025
On Synthetic Data Strategies for Domain-Specific Generative Retrieval
On Synthetic Data Strategies for Domain-Specific Generative Retrieval
Haoyang Wen
Jiang Guo
Yi Zhang
Jiarong Jiang
Zhilin Wang
SyDa
74
0
0
25 Feb 2025
DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers
DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers
Xueguang Ma
Xi Lin
Barlas Oğuz
Jimmy Lin
Wen-tau Yih
Xilun Chen
RALM
85
3
0
25 Feb 2025
Large Language Models are Powerful EHR Encoders
Large Language Models are Powerful EHR Encoders
S. Hegselmann
Georg von Arnim
Tillmann Rheude
Noel Kronenberg
David Sontag
Gerhard Hindricks
R. Eils
Benjamin Wild
LM&MA
49
1
0
24 Feb 2025
Mitigating Bias in RAG: Controlling the Embedder
Mitigating Bias in RAG: Controlling the Embedder
Taeyoun Kim
Jacob Mitchell Springer
Aditi Raghunathan
Maarten Sap
58
1
0
24 Feb 2025
Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment
Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment
Haoyuan Wu
Haisheng Zheng
Yuan Pu
Bei Yu
61
1
0
18 Feb 2025
12345
Next