ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.10005
  4. Cited By
Text and Code Embeddings by Contrastive Pre-Training

Text and Code Embeddings by Contrastive Pre-Training

24 January 2022
Arvind Neelakantan
Tao Xu
Raul Puri
Alec Radford
Jesse Michael Han
Jerry Tworek
Qiming Yuan
Nikolas Tezak
Jong Wook Kim
Chris Hallacy
Johannes Heidecke
Pranav Shyam
Boris Power
Tyna Eloundou Nekoul
Girish Sastry
Gretchen Krueger
David Schnurr
F. Such
K. Hsu
Madeleine Thompson
Tabarak Khan
Toki Sherbakov
Joanne Jang
Peter Welinder
Lilian Weng
    SSL
    AI4TS
ArXivPDFHTML

Papers citing "Text and Code Embeddings by Contrastive Pre-Training"

50 / 245 papers shown
Title
Automatic Detection of LLM-generated Code: A Case Study of Claude 3
  Haiku
Automatic Detection of LLM-generated Code: A Case Study of Claude 3 Haiku
Musfiqur Rahman
SayedHassan Khatoonabadi
Ahmad Abdellatif
Emad Shihab
33
3
0
02 Sep 2024
SCOPE: Sign Language Contextual Processing with Embedding from LLMs
SCOPE: Sign Language Contextual Processing with Embedding from LLMs
Yuqi Liu
Wenqian Zhang
Sihan Ren
Chengyu Huang
Jingyi Yu
Lan Xu
SLR
52
0
0
02 Sep 2024
Statically Contextualizing Large Language Models with Typed Holes
Statically Contextualizing Large Language Models with Typed Holes
Andrew Blinn
Xiang Li
June Hyung Kim
Cyrus Omar
42
1
0
02 Sep 2024
vitaLITy 2: Reviewing Academic Literature Using Large Language Models
vitaLITy 2: Reviewing Academic Literature Using Large Language Models
Hongye An
Arpit Narechania
Emily Wall
Kai Xu
37
2
0
24 Aug 2024
Large Language Models as Foundations for Next-Gen Dense Retrieval: A
  Comprehensive Empirical Assessment
Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment
Kun Luo
Minghao Qin
Zheng Liu
Shitao Xiao
Jun Zhao
Kang Liu
34
10
0
22 Aug 2024
LARR: Large Language Model Aided Real-time Scene Recommendation with
  Semantic Understanding
LARR: Large Language Model Aided Real-time Scene Recommendation with Semantic Understanding
Zhizhong Wan
Bin Yin
Junjie Xie
Fei Jiang
Xiang Li
Wei Lin
3DV
43
5
0
21 Aug 2024
Improving embedding with contrastive fine-tuning on small datasets with
  expert-augmented scores
Improving embedding with contrastive fine-tuning on small datasets with expert-augmented scores
Jun Lu
David Li
Bill Ding
Yu Kang
61
3
0
19 Aug 2024
HELP: Hierarchical Embeddings-based Log Parsing
HELP: Hierarchical Embeddings-based Log Parsing
Andy Xu
Arno Gau
52
2
0
15 Aug 2024
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented
  Generation
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Dongyu Ru
Lin Qiu
Xiangkun Hu
Tianhang Zhang
Peng Shi
...
Tong He
Zhiguo Wang
Pengfei Liu
Yue Zhang
Zheng Zhang
51
12
0
15 Aug 2024
Exploring Retrieval Augmented Generation in Arabic
Exploring Retrieval Augmented Generation in Arabic
S. El-Beltagy
Mohamed A. Abdallah
RALM
53
3
0
14 Aug 2024
ViC: Virtual Compiler Is All You Need For Assembly Code Search
ViC: Virtual Compiler Is All You Need For Assembly Code Search
Zeyu Gao
Hao Wang
Yuanda Wang
Chao Zhang
35
1
0
10 Aug 2024
LLM Agents Improve Semantic Code Search
LLM Agents Improve Semantic Code Search
Sarthak Jain
Aditya Dora
Ka Seng Sam
Prabhat Singh
AIFin
26
5
0
05 Aug 2024
mGTE: Generalized Long-Context Text Representation and Reranking Models
  for Multilingual Text Retrieval
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Xin Zhang
Yanzhao Zhang
Dingkun Long
Wen Xie
Ziqi Dai
...
Pengjun Xie
Fei Huang
Meishan Zhang
Wenjie Li
Min Zhang
42
73
0
29 Jul 2024
Towards More Accurate Prediction of Human Empathy and Emotion in Text
  and Multi-turn Conversations by Combining Advanced NLP, Transformers-based
  Networks, and Linguistic Methodologies
Towards More Accurate Prediction of Human Empathy and Emotion in Text and Multi-turn Conversations by Combining Advanced NLP, Transformers-based Networks, and Linguistic Methodologies
Manisha Singh
Divy Sharma
Alonso Ma
Nora Goldfine
35
0
0
26 Jul 2024
Exploring Description-Augmented Dataless Intent Classification
Exploring Description-Augmented Dataless Intent Classification
Ruoyu Hu
Foaad Khosmood
Abbas Edalat
AI4TS
39
0
0
25 Jul 2024
BLAZE: Cross-Language and Cross-Project Bug Localization via Dynamic
  Chunking and Hard Example Learning
BLAZE: Cross-Language and Cross-Project Bug Localization via Dynamic Chunking and Hard Example Learning
Partha Chakraborty
Mahmoud Alfadel
Mei Nagappan
25
2
0
24 Jul 2024
Was it Slander? Towards Exact Inversion of Generative Language Models
Was it Slander? Towards Exact Inversion of Generative Language Models
Adrians Skapars
Edoardo Manino
Youcheng Sun
Lucas C. Cordeiro
33
3
0
10 Jul 2024
Robust Neural Information Retrieval: An Adversarial and
  Out-of-distribution Perspective
Robust Neural Information Retrieval: An Adversarial and Out-of-distribution Perspective
Yu-An Liu
Ruqing Zhang
Jiafeng Guo
Maarten de Rijke
Yixing Fan
Xueqi Cheng
35
6
0
09 Jul 2024
Exploring the Capability of ChatGPT to Reproduce Human Labels for Social
  Computing Tasks (Extended Version)
Exploring the Capability of ChatGPT to Reproduce Human Labels for Social Computing Tasks (Extended Version)
Yiming Zhu
Peixian Zhang
Ehsan-ul Haq
Pan Hui
Gareth Tyson
ALM
AI4MH
47
0
0
08 Jul 2024
What's Wrong with Your Code Generated by Large Language Models? An
  Extensive Study
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Shihan Dou
Haoxiang Jia
Shenxi Wu
Huiyuan Zheng
Weikang Zhou
...
Xunliang Cai
Tao Gui
Xipeng Qiu
Qi Zhang
Xuanjing Huang
31
32
0
08 Jul 2024
MeMemo: On-device Retrieval Augmentation for Private and Personalized
  Text Generation
MeMemo: On-device Retrieval Augmentation for Private and Personalized Text Generation
Zijie J. Wang
Duen Horng Chau
43
4
0
02 Jul 2024
D2LLM: Decomposed and Distilled Large Language Models for Semantic
  Search
D2LLM: Decomposed and Distilled Large Language Models for Semantic Search
Zihan Liao
Hang Yu
Jianguo Li
Jun Wang
Wei Zhang
34
3
0
25 Jun 2024
Enhancing Idiomatic Representation in Multiple Languages via an Adaptive
  Contrastive Triplet Loss
Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss
Wei He
M. Idiart
Carolina Scarton
Aline Villavicencio
34
2
0
21 Jun 2024
R^2AG: Incorporating Retrieval Information into Retrieval Augmented
  Generation
R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation
Fuda Ye
Shuangyin Li
Yongqi Zhang
L. Chen
35
0
0
19 Jun 2024
Can Machines Resonate with Humans? Evaluating the Emotional and Empathic
  Comprehension of LMs
Can Machines Resonate with Humans? Evaluating the Emotional and Empathic Comprehension of LMs
Muhammad Arslan Manzoor
Yuxia Wang
Minghan Wang
Preslav Nakov
35
0
0
17 Jun 2024
LLM Questionnaire Completion for Automatic Psychiatric Assessment
LLM Questionnaire Completion for Automatic Psychiatric Assessment
Gony Rosenman
Lior Wolf
Talma Hendler
33
3
0
09 Jun 2024
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
Junjie Zhou
Zheng Liu
Shitao Xiao
Bo Zhao
Yongping Xiong
51
21
0
06 Jun 2024
Repurposing Language Models into Embedding Models: Finding the
  Compute-Optimal Recipe
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Alicja Ziarko
Albert Q. Jiang
Bartosz Piotrowski
Wenda Li
M. Jamnik
Piotr Miłoś
32
0
0
06 Jun 2024
Exploring Human-AI Perception Alignment in Sensory Experiences: Do LLMs
  Understand Textile Hand?
Exploring Human-AI Perception Alignment in Sensory Experiences: Do LLMs Understand Textile Hand?
Shu Zhong
Elia Gatti
Youngjun Cho
Marianna Obrist
49
3
0
05 Jun 2024
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs
Fatemeh Shiri
Van Nguyen
Farhad Moghimifar
John Yoo
Gholamreza Haffari
Yuan-Fang Li
ReLM
74
3
0
03 Jun 2024
Presence or Absence: Are Unknown Word Usages in Dictionaries?
Presence or Absence: Are Unknown Word Usages in Dictionaries?
Xianghe Ma
Dominik Schlechtweg
Wei-Ye Zhao
36
3
0
02 Jun 2024
MTEB-French: Resources for French Sentence Embedding Evaluation and
  Analysis
MTEB-French: Resources for French Sentence Embedding Evaluation and Analysis
Mathieu Ciancone
Imene Kerboua
Marion Schaeffer
W. Siblini
42
2
0
30 May 2024
Unleashing the Potential of Text-attributed Graphs: Automatic Relation
  Decomposition via Large Language Models
Unleashing the Potential of Text-attributed Graphs: Automatic Relation Decomposition via Large Language Models
Hyunjin Seo
Taewon Kim
J. Yang
Eunho Yang
47
0
0
28 May 2024
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Chankyu Lee
Rajarshi Roy
Mengyao Xu
Jonathan Raiman
M. Shoeybi
Bryan Catanzaro
Ming-Yu Liu
RALM
54
139
0
27 May 2024
M-RAG: Reinforcing Large Language Model Performance through
  Retrieval-Augmented Generation with Multiple Partitions
M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions
Zheng Wang
Shu Xian Teo
Jieer Ouyang
Yongjun Xu
Wei Shi
RALM
VLM
27
13
0
26 May 2024
ChatGPT Code Detection: Techniques for Uncovering the Source of Code
ChatGPT Code Detection: Techniques for Uncovering the Source of Code
Marc Oedingen
Raphael C. Engelhardt
Robin Denz
Maximilian Hammer
Wolfgang Konen
DeLMO
42
8
0
24 May 2024
Semantic Density: Uncertainty Quantification in Semantic Space for Large
  Language Models
Semantic Density: Uncertainty Quantification in Semantic Space for Large Language Models
Xin Qiu
Risto Miikkulainen
46
3
0
22 May 2024
DrHouse: An LLM-empowered Diagnostic Reasoning System through Harnessing
  Outcomes from Sensor Data and Expert Knowledge
DrHouse: An LLM-empowered Diagnostic Reasoning System through Harnessing Outcomes from Sensor Data and Expert Knowledge
Bufang Yang
Siyang Jiang
Lilin Xu
Kaiwei Liu
Hai Li
Guoliang Xing
Hongkai Chen
Xiaofan Jiang
Zhenyu Yan
LM&MA
42
15
0
21 May 2024
ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation
ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation
Chen Huang
Yiping Jin
Ilija Ilievski
Wenqiang Lei
Jiancheng Lv
35
3
0
20 May 2024
DocReLM: Mastering Document Retrieval with Language Model
DocReLM: Mastering Document Retrieval with Language Model
Gengchen Wei
Xinle Pang
Tianning Zhang
Yu Sun
Xun Qian
Chen Lin
Han-Sen Zhong
Wanli Ouyang
RALM
36
0
0
19 May 2024
PromptLink: Leveraging Large Language Models for Cross-Source Biomedical
  Concept Linking
PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking
Yuzhang Xie
Jiaying Lu
Joyce C. Ho
Fadi B Nahab
Xiao Hu
Carl Yang
38
4
0
13 May 2024
Refining Joint Text and Source Code Embeddings for Retrieval Task with
  Parameter-Efficient Fine-Tuning
Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning
Karim Galliamov
Leila Khaertdinova
Karina Denisova
38
1
0
07 May 2024
Towards Neural Synthesis for SMT-Assisted Proof-Oriented Programming
Towards Neural Synthesis for SMT-Assisted Proof-Oriented Programming
Saikat Chakraborty
Gabriel Ebner
Siddharth Bhat
Sarah Fakhoury
Sakina Fatima
Shuvendu K. Lahiri
Nikhil Swamy
40
15
0
03 May 2024
Aptly: Making Mobile Apps from Natural Language
Aptly: Making Mobile Apps from Natural Language
Evan W. Patton
David Y.J. Kim
Ashley Granquist
Robin Liu
Arianna Scott
Jennet Zamanova
Harold Abelson
28
1
0
30 Apr 2024
BMRetriever: Tuning Large Language Models as Better Biomedical Text
  Retrievers
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers
Ran Xu
Wenqi Shi
Yue Yu
Yuchen Zhuang
Yanqiao Zhu
M. D. Wang
Joyce C. Ho
Chao Zhang
Carl Yang
LM&MA
40
19
0
29 Apr 2024
Understanding Privacy Risks of Embeddings Induced by Large Language
  Models
Understanding Privacy Risks of Embeddings Induced by Large Language Models
Zhihao Zhu
Ninglu Shao
Defu Lian
Chenwang Wu
Zheng Liu
Yi Yang
Enhong Chen
35
0
0
25 Apr 2024
KGValidator: A Framework for Automatic Validation of Knowledge Graph
  Construction
KGValidator: A Framework for Automatic Validation of Knowledge Graph Construction
Jack Boylan
Shashank Mangla
Dominic Thorn
D. Ghalandari
Parsa Ghaffari
Chris Hokamp
SLR
40
0
0
24 Apr 2024
Character is Destiny: Can Large Language Models Simulate Persona-Driven
  Decisions in Role-Playing?
Character is Destiny: Can Large Language Models Simulate Persona-Driven Decisions in Role-Playing?
Rui Xu
Xintao Wang
Jiangjie Chen
Siyu Yuan
Xinfeng Yuan
Jiaqing Liang
Zulong Chen
Xiaoqing Dong
Yanghua Xiao
63
4
0
18 Apr 2024
LongEmbed: Extending Embedding Models for Long Context Retrieval
LongEmbed: Extending Embedding Models for Long Context Retrieval
Dawei Zhu
Liang Wang
Nan Yang
Yifan Song
Wenhao Wu
Furu Wei
Sujian Li
RALM
43
21
0
18 Apr 2024
Glitch Tokens in Large Language Models: Categorization Taxonomy and
  Effective Detection
Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection
Yuxi Li
Yi Liu
Gelei Deng
Ying Zhang
Wenjia Song
Ling Shi
Kailong Wang
Yuekang Li
Yang Liu
Haoyu Wang
47
20
0
15 Apr 2024
Previous
12345
Next