ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.03281
  4. Cited By
Towards General Text Embeddings with Multi-stage Contrastive Learning

Towards General Text Embeddings with Multi-stage Contrastive Learning

7 August 2023
Zehan Li
Xin Zhang
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
ArXivPDFHTML

Papers citing "Towards General Text Embeddings with Multi-stage Contrastive Learning"

50 / 227 papers shown
Title
Retrieval Augmented Instruction Tuning for Open NER with Large Language
  Models
Retrieval Augmented Instruction Tuning for Open NER with Large Language Models
Tingyu Xie
Jian Zhang
Yan Zhang
Yuanyuan Liang
Qi Li
Hongwei Wang
RALM
40
0
0
25 Jun 2024
D2LLM: Decomposed and Distilled Large Language Models for Semantic
  Search
D2LLM: Decomposed and Distilled Large Language Models for Semantic Search
Zihan Liao
Hang Yu
Jianguo Li
Jun Wang
Wei Zhang
36
3
0
25 Jun 2024
Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024
  Retrieval-Augmented Generation Track
Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track
Ronak Pradeep
Nandan Thakur
Sahel Sharifymoghaddam
Eric Zhang
Ryan Nguyen
Daniel Campos
Nick Craswell
Jimmy Lin
42
12
0
24 Jun 2024
Enhancing Idiomatic Representation in Multiple Languages via an Adaptive
  Contrastive Triplet Loss
Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss
Wei He
M. Idiart
Carolina Scarton
Aline Villavicencio
42
2
0
21 Jun 2024
Text Serialization and Their Relationship with the Conventional
  Paradigms of Tabular Machine Learning
Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning
Kyoka Ono
Simon A. Lee
LMTD
24
7
0
19 Jun 2024
SparseCL: Sparse Contrastive Learning for Contradiction Retrieval
SparseCL: Sparse Contrastive Learning for Contradiction Retrieval
Haike Xu
Zongyu Lin
Ningyu Zhang
Kai-Wei Chang
Piotr Indyk
45
0
0
15 Jun 2024
Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic
  Textual Similarity
Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual Similarity
Bowen Zhang
Chunping Li
50
0
0
14 Jun 2024
Joint Learning of Context and Feedback Embeddings in Spoken Dialogue
Joint Learning of Context and Feedback Embeddings in Spoken Dialogue
Livia Qian
Gabriel Skantze
23
0
0
11 Jun 2024
FoodSky: A Food-oriented Large Language Model that Passes the Chef and
  Dietetic Examination
FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Pengfei Zhou
Weiqing Min
Chaoran Fu
Ying Jin
Mingyu Huang
Xiangyang Li
Shuhuan Mei
Shuqiang Jiang
38
8
0
11 Jun 2024
Curating Grounded Synthetic Data with Global Perspectives for Equitable
  AI
Curating Grounded Synthetic Data with Global Perspectives for Equitable AI
Elin Törnquist
R. Caulk
SyDa
44
4
0
10 Jun 2024
Advancing Semantic Textual Similarity Modeling: A Regression Framework
  with Translated ReLU and Smooth K2 Loss
Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss
Bowen Zhang
Chunping Li
50
2
0
08 Jun 2024
Text-Guided Alternative Image Clustering
Text-Guided Alternative Image Clustering
Andreas Stephan
Lukas Miklautz
Collin Leiber
Pedro Henrique Luz de Araujo
Dominik Répás
Claudia Plant
Benjamin Roth
VLM
32
0
0
07 Jun 2024
CTSyn: A Foundational Model for Cross Tabular Data Generation
CTSyn: A Foundational Model for Cross Tabular Data Generation
Xiaofeng Lin
Chenheng Xu
Matthew Yang
Guang Cheng
43
3
0
07 Jun 2024
Repurposing Language Models into Embedding Models: Finding the
  Compute-Optimal Recipe
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Alicja Ziarko
Albert Q. Jiang
Bartosz Piotrowski
Wenda Li
M. Jamnik
Piotr Miłoś
40
0
0
06 Jun 2024
A Bi-metric Framework for Fast Similarity Search
A Bi-metric Framework for Fast Similarity Search
Haike Xu
Sandeep Silwal
Piotr Indyk
FedML
31
0
0
05 Jun 2024
CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Maciej Besta
Lorenzo Paleari
Aleš Kubíček
Piotr Nyczyk
Robert Gerstenberger
Patrick Iff
Tomasz Lehmann
H. Niewiadomski
Torsten Hoefler
75
5
0
04 Jun 2024
M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose
  Audio-Language Representation
M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
Masahiro Yasuda
Shunsuke Tsubaki
Keisuke Imoto
VLM
38
5
0
04 Jun 2024
CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation
CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation
M. Rusanovsky
Or Hirschorn
S. Avidan
31
3
0
01 Jun 2024
Towards Ontology-Enhanced Representation Learning for Large Language
  Models
Towards Ontology-Enhanced Representation Learning for Large Language Models
Francesco Ronzano
Jay Nanavati
31
4
0
30 May 2024
From Zero to Hero: Cold-Start Anomaly Detection
From Zero to Hero: Cold-Start Anomaly Detection
Tal Reiss
George Kour
Naama Zwerdling
Ateret Anaby-Tavor
Yedid Hoshen
39
0
0
30 May 2024
Don't Forget to Connect! Improving RAG with Graph-based Reranking
Don't Forget to Connect! Improving RAG with Graph-based Reranking
Jialin Dong
Bahare Fatemi
Bryan Perozzi
Lin F. Yang
Anton Tsitsulin
58
25
0
28 May 2024
Recent advances in text embedding: A Comprehensive Review of
  Top-Performing Methods on the MTEB Benchmark
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Hongliu Cao
AI4TS
35
11
0
27 May 2024
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Chankyu Lee
Rajarshi Roy
Mengyao Xu
Jonathan Raiman
M. Shoeybi
Bryan Catanzaro
Ming-Yu Liu
RALM
68
145
0
27 May 2024
Crafting Interpretable Embeddings by Asking LLMs Questions
Crafting Interpretable Embeddings by Asking LLMs Questions
Vinamra Benara
Chandan Singh
John X. Morris
Richard Antonello
Ion Stoica
Alexander G. Huth
Jianfeng Gao
26
5
0
26 May 2024
The correlation between nativelike selection and prototypicality: a
  multilingual onomasiological case study using semantic embedding
The correlation between nativelike selection and prototypicality: a multilingual onomasiological case study using semantic embedding
Huasheng Zhang
35
0
0
22 May 2024
TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in
  Large Language Models
TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in Large Language Models
Pengzhou Cheng
Yidong Ding
Tianjie Ju
Zongru Wu
Wei Du
Ping Yi
ZhuoSheng Zhang
Gongshen Liu
SILM
AAML
40
20
0
22 May 2024
Question-Based Retrieval using Atomic Units for Enterprise RAG
Question-Based Retrieval using Atomic Units for Enterprise RAG
Vatsal Raina
Mark J. F. Gales
35
7
0
20 May 2024
INDUS: Effective and Efficient Language Models for Scientific
  Applications
INDUS: Effective and Efficient Language Models for Scientific Applications
Bishwaranjan Bhattacharjee
Aashka Trivedi
Masayasu Muraoka
Muthukumaran Ramasubramanian
Takuma Udagawa
...
Peter W. J. Staar
S. Vahidinia
Ryan McGranaghan
A. Mehrabian
Tsendgar Lee
AI4CE
33
5
0
17 May 2024
FinTextQA: A Dataset for Long-form Financial Question Answering
FinTextQA: A Dataset for Long-form Financial Question Answering
Jian Chen
Peilin Zhou
Yining Hua
Yingxin Loh
Kehui Chen
Ziyuan Li
Bing Zhu
Junwei Liang
37
12
0
16 May 2024
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training
Junqin Huang
Zhongjie Hu
Zihao Jing
Mengya Gao
Yichao Wu
MoE
VLM
41
4
0
11 May 2024
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models
Luke Merrick
Danmei Xu
Gaurav Nuti
Daniel Campos
17
24
0
08 May 2024
URL: Universal Referential Knowledge Linking via Task-instructed
  Representation Compression
URL: Universal Referential Knowledge Linking via Task-instructed Representation Compression
Zhuoqun Li
Hongyu Lin
Tianshu Wang
Boxi Cao
Yaojie Lu
Weixiang Zhou
Hao Wang
Zhenyu Zeng
Le Sun
Xianpei Han
51
1
0
24 Apr 2024
Multi-view Content-aware Indexing for Long Document Retrieval
Multi-view Content-aware Indexing for Long Document Retrieval
Kuicai Dong
Derrick-Goh-Xin Deik
Yi Quan Lee
Hao Zhang
Xiangyang Li
Cong Zhang
Yong-jin Liu
36
3
0
23 Apr 2024
LongEmbed: Extending Embedding Models for Long Context Retrieval
LongEmbed: Extending Embedding Models for Long Context Retrieval
Dawei Zhu
Liang Wang
Nan Yang
Yifan Song
Wenhao Wu
Furu Wei
Sujian Li
RALM
43
21
0
18 Apr 2024
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through
  Direct Preference Optimization
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization
Navonil Majumder
Chia-Yu Hung
Deepanway Ghosal
Wei-Ning Hsu
Rada Mihalcea
Soujanya Poria
47
52
0
15 Apr 2024
ToNER: Type-oriented Named Entity Recognition with Generative Language
  Model
ToNER: Type-oriented Named Entity Recognition with Generative Language Model
Guochao Jiang
Ziqin Luo
Yuchen Shi
Dixuan Wang
Jiaqing Liang
Deqing Yang
49
8
0
14 Apr 2024
Event-enhanced Retrieval in Real-time Search
Event-enhanced Retrieval in Real-time Search
Yanan Zhang
Xiaoling Bai
Tianhua Zhou
46
1
0
09 Apr 2024
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Parishad BehnamGhader
Vaibhav Adlakha
Marius Mosbach
Dzmitry Bahdanau
Nicolas Chapados
Siva Reddy
53
182
0
09 Apr 2024
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Jinhyuk Lee
Zhuyun Dai
Xiaoqi Ren
Blair Chen
Daniel Cer
...
Aditya Kusupati
Prateek Jain
Siddhartha Reddy Jonnalagadda
Ming-Wei Chang
Iftekhar Naim
RALM
VLM
SyDa
48
41
0
29 Mar 2024
NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using
  Representative Data
NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data
Manuel Tonneau
Pedro Vitor Quinta de Castro
Karim Lasri
I. Farouq
Lakshminarayanan Subramanian
Victor Orozco-Olvera
Samuel Fraiberger
44
10
0
28 Mar 2024
BLADE: Enhancing Black-box Large Language Models with Small
  Domain-Specific Models
BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Haitao Li
Qingyao Ai
Jia Chen
Qian Dong
Zhijing Wu
Yiqun Liu
Chong Chen
Qi Tian
AILaw
62
13
0
27 Mar 2024
SMART: Submodular Data Mixture Strategy for Instruction Tuning
SMART: Submodular Data Mixture Strategy for Instruction Tuning
Kowndinya Renduchintala
S. Bhatia
Ganesh Ramakrishnan
46
3
0
13 Mar 2024
OffensiveLang: A Community Based Implicit Offensive Language Dataset
OffensiveLang: A Community Based Implicit Offensive Language Dataset
Amit Das
Mostafa Rahgouy
Dongji Feng
Zheng Zhang
Tathagata Bhattacharya
...
Aman Chadha
Mary J. Sandage
Lauramarie Pope
Gerry V. Dozier
Cheryl Seals
34
1
0
04 Mar 2024
GISTEmbed: Guided In-sample Selection of Training Negatives for Text
  Embedding Fine-tuning
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning
Aivin V. Solatorio
43
18
0
26 Feb 2024
CLAP: Learning Transferable Binary Code Representations with Natural
  Language Supervision
CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision
Hao Wang
Zeyu Gao
Chao Zhang
Zihan Sha
Mingyang Sun
Yuchen Zhou
Wenyu Zhu
Wenju Sun
Han Qiu
Xiangwei Xiao
38
17
0
26 Feb 2024
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
Fanjin Zhang
Shijie Shi
Yifan Zhu
Bo Chen
Yukuo Cen
...
Huihui Yuan
Jian Song
Xiaoyan Li
Yuxiao Dong
Jie Tang
42
16
0
24 Feb 2024
Repetition Improves Language Model Embeddings
Repetition Improves Language Model Embeddings
Jacob Mitchell Springer
Suhas Kotha
Daniel Fried
Graham Neubig
Aditi Raghunathan
48
29
0
23 Feb 2024
Triple-Encoders: Representations That Fire Together, Wire Together
Triple-Encoders: Representations That Fire Together, Wire Together
Justus-Jonas Erker
Florian Mai
Nils Reimers
Gerasimos Spanakis
Iryna Gurevych
22
2
0
19 Feb 2024
FeB4RAG: Evaluating Federated Search in the Context of Retrieval
  Augmented Generation
FeB4RAG: Evaluating Federated Search in the Context of Retrieval Augmented Generation
Shuai Wang
Ekaterina Khramtsova
Shengyao Zhuang
Guido Zuccon
34
11
0
19 Feb 2024
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Xing Han Lù
Zdeněk Kasner
Siva Reddy
34
60
0
08 Feb 2024
Previous
12345
Next