Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.10005
Cited By
Text and Code Embeddings by Contrastive Pre-Training
24 January 2022
Arvind Neelakantan
Tao Xu
Raul Puri
Alec Radford
Jesse Michael Han
Jerry Tworek
Qiming Yuan
Nikolas Tezak
Jong Wook Kim
Chris Hallacy
Johannes Heidecke
Pranav Shyam
Boris Power
Tyna Eloundou Nekoul
Girish Sastry
Gretchen Krueger
David Schnurr
F. Such
K. Hsu
Madeleine Thompson
Tabarak Khan
Toki Sherbakov
Joanne Jang
Peter Welinder
Lilian Weng
SSL
AI4TS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Text and Code Embeddings by Contrastive Pre-Training"
50 / 245 papers shown
Title
Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning
Yijiang Liu
Rongyu Zhang
Huanrui Yang
Kurt Keutzer
Yuan Du
Li Du
Shanghang Zhang
MoE
41
6
0
13 Apr 2024
Reducing hallucination in structured outputs via Retrieval-Augmented Generation
Patrice Béchard
Orlando Marquez Ayala
LLMAG
37
50
0
12 Apr 2024
RAR-b: Reasoning as Retrieval Benchmark
Chenghao Xiao
G. Thomas
Al Moubayed
LRM
RALM
36
8
0
09 Apr 2024
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Parishad BehnamGhader
Vaibhav Adlakha
Marius Mosbach
Dzmitry Bahdanau
Nicolas Chapados
Siva Reddy
53
182
0
09 Apr 2024
Linguistic Changes in Spontaneous Speech for Detecting Parkinsons Disease Using Large Language Models
Jonathan Crawford
36
0
0
08 Apr 2024
Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors
Chen Huang
Peixin Qin
Yang Deng
Wenqiang Lei
Jiancheng Lv
Tat-Seng Chua
39
6
0
04 Apr 2024
Empowering Biomedical Discovery with AI Agents
Shanghua Gao
Ada Fang
Yepeng Huang
Valentina Giunchiglia
Ayush Noori
Jonathan Richard Schwarz
Yasha Ektefaie
Jovana Kondic
Marinka Zitnik
LLMAG
AI4CE
44
66
0
03 Apr 2024
PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning
Weihua Hu
Yiwen Yuan
Zecheng Zhang
Akihiro Nitta
Kaidi Cao
Vid Kocijan
J. Leskovec
Matthias Fey
LMTD
39
11
0
31 Mar 2024
Beyond One-Size-Fits-All: Multi-Domain, Multi-Task Framework for Embedding Model Selection
Vivek Khetan
26
0
0
30 Mar 2024
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Jinhyuk Lee
Zhuyun Dai
Xiaoqi Ren
Blair Chen
Daniel Cer
...
Aditya Kusupati
Prateek Jain
Siddhartha Reddy Jonnalagadda
Ming-Wei Chang
Iftekhar Naim
RALM
VLM
SyDa
48
41
0
29 Mar 2024
A Survey on Large Language Models from Concept to Implementation
Chen Wang
Jin Zhao
Jiaqi Gong
LLMAG
LM&MA
37
3
0
27 Mar 2024
SQL-Encoder: Improving NL2SQL In-Context Learning Through a Context-Aware Encoder
Mohammadreza Pourreza
Davood Rafiei
Yuxi Feng
Raymond Li
Zhenan Fan
Weiwei Zhang
37
5
0
24 Mar 2024
Semantically Aligned Question and Code Generation for Automated Insight Generation
Ananya Singha
Bhavya Chopra
Anirudh Khatry
Sumit Gulwani
Austin Z. Henley
Vu Le
Chris Parnin
Mukul Singh
Microsoft Belgium
39
3
0
21 Mar 2024
A Semantic Search Engine for Mathlib4
Guoxiong Gao
Haocheng Ju
Jiedong Jiang
Zihan Qin
Bin Dong
40
3
0
20 Mar 2024
RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning
J. Asl
Prajwal Panzade
Eduardo Blanco
Daniel Takabi
Zhipeng Cai
SSL
29
2
0
17 Mar 2024
MedInsight: A Multi-Source Context Augmentation Framework for Generating Patient-Centric Medical Responses using Large Language Models
Subash Neupane
Shaswata Mitra
Sudip Mittal
Noorbakhsh Amiri Golilarz
Shahram Rahimi
Amin Amirlatifi
62
3
0
13 Mar 2024
JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models
Arefa
Mohammed Abbas Ansari
Chandni Saxena
Tanvir Ahmad
MLLM
34
2
0
05 Mar 2024
LLM-Oriented Retrieval Tuner
Si Sun
Hanqing Zhang
Zhiyuan Liu
Jie Bao
Dawei Song
RALM
41
0
0
04 Mar 2024
ReMatch: Retrieval Enhanced Schema Matching with LLMs
Eitam Sheetrit
Menachem Brief
Moshik Mishaeli
Oren Elisha
26
11
0
03 Mar 2024
Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Qiaoyu Tang
Jiawei Chen
Bowen Yu
Yaojie Lu
Cheng Fu
...
Fei Huang
Xianpei Han
Xianpei Han
Le Sun
Yongbin Li
KELM
46
0
0
23 Feb 2024
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models
Andrew Zhu
Alyssa Hwang
Liam Dugan
Chris Callison-Burch
ELM
50
0
0
21 Feb 2024
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Demin Song
Honglin Guo
Yunhua Zhou
Shuhao Xing
Yudong Wang
...
Wenwei Zhang
Qipeng Guo
Hang Yan
Xipeng Qiu
Dahua Lin
SyDa
65
8
0
20 Feb 2024
Towards Trustworthy Reranking: A Simple yet Effective Abstention Mechanism
Hippolyte Gisserot-Boukhlef
Manuel Faysse
Emmanuel Malherbe
C´eline Hudelot
Pierre Colombo
34
2
0
20 Feb 2024
Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs
Jiejun Tan
Zhicheng Dou
Yutao Zhu
Peidong Guo
Kun Fang
Ji-Rong Wen
42
24
0
19 Feb 2024
HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?
Shubhashis Roy Dipta
Sadat Shahriar
DeLMO
51
1
0
19 Feb 2024
Uncovering Latent Human Wellbeing in Language Model Embeddings
Pedro Freire
ChengCheng Tan
Adam Gleave
Dan Hendrycks
Scott Emmons
36
1
0
19 Feb 2024
BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models
Kun Luo
Zheng Liu
Shitao Xiao
Kang Liu
39
11
0
18 Feb 2024
Exploring ChatGPT for Next-generation Information Retrieval: Opportunities and Challenges
Yizheng Huang
Jimmy X. Huang
35
10
0
17 Feb 2024
Vehicle Behavior Prediction by Episodic-Memory Implanted NDT
Peining Shen
Jianwu Fang
Hongkai Yu
Jianru Xue
37
0
0
13 Feb 2024
Previously on the Stories: Recap Snippet Identification for Story Reading
JiangNan Li
Qiujing Wang
Liyan Xu
Wenjie Pang
Mo Yu
Zheng Lin
Weiping Wang
Jie Zhou
39
3
0
11 Feb 2024
BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
Jianlv Chen
Shitao Xiao
Peitian Zhang
Kun Luo
Defu Lian
Zheng Liu
115
328
0
05 Feb 2024
Code Representation Learning At Scale
Dejiao Zhang
W. Ahmad
Ming Tan
Hantian Ding
Ramesh Nallapati
Dan Roth
Xiaofei Ma
Bing Xiang
OffRL
21
9
0
02 Feb 2024
Homogenization Effects of Large Language Models on Human Creative Ideation
Barrett R Anderson
Jash Hemant Shah
Max Kreminski
40
74
0
02 Feb 2024
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
49
23
0
27 Jan 2024
APT-Pipe: A Prompt-Tuning Tool for Social Data Annotation using ChatGPT
Yiming Zhu
Zhizhuo Yin
Gareth Tyson
Ehsan-ul Haq
Lik-Hang Lee
Pan Hui
ALM
43
6
0
24 Jan 2024
DREditor: An Time-efficient Approach for Building a Domain-specific Dense Retrieval Model
Chen Huang
Duanyu Feng
Wenqiang Lei
Jiancheng Lv
59
1
0
23 Jan 2024
Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text
Mazal Bethany
Brandon Wherry
Emet Bethany
Nishant Vishwamitra
Anthony Rios
Peyman Najafirad
DeLMO
36
3
0
17 Jan 2024
An Exploratory Study on Automatic Identification of Assumptions in the Development of Deep Learning Frameworks
Chen Yang
Peng Liang
Zinan Ma
24
0
0
08 Jan 2024
Data-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning
Yingqian Min
Kun Zhou
Dawei Gao
Wayne Xin Zhao
He Hu
Yaliang Li
26
1
0
07 Jan 2024
German Text Embedding Clustering Benchmark
Silvan Wehrli
Bert Arnrich
Christopher Irrgang
30
5
0
05 Jan 2024
Improving Text Embeddings with Large Language Models
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
SyDa
35
158
0
31 Dec 2023
Is Knowledge All Large Language Models Needed for Causal Reasoning?
Hengrui Cai
Shengjie Liu
Rui Song
LRM
ELM
28
10
0
30 Dec 2023
Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages
Mofetoluwa Adeyemi
Akintunde Oladipo
Ronak Pradeep
Jimmy J. Lin
27
1
0
26 Dec 2023
Making Large Language Models A Better Foundation For Dense Retrieval
Chaofan Li
Zheng Liu
Shitao Xiao
Yingxia Shao
RALM
34
34
0
24 Dec 2023
A Strong Baseline for Temporal Video-Text Alignment
Zeqian Li
Qirui Chen
Tengda Han
Ya-Qin Zhang
Yanfeng Wang
Weidi Xie
AI4TS
VGen
43
5
0
21 Dec 2023
Vectorizing string entries for data processing on tables: when are larger language models better?
Léo Grinsztajn
Edouard Oyallon
Myung Jun Kim
Gaël Varoquaux
40
2
0
15 Dec 2023
Exploring Large Language Models to Facilitate Variable Autonomy for Human-Robot Teaming
Younes Lakhnati
Max Pascher
Jens Gerken
LLMAG
LM&Ro
32
3
0
12 Dec 2023
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
O. Ovadia
Menachem Brief
Moshik Mishaeli
Oren Elisha
RALM
28
132
0
10 Dec 2023
PaperQA: Retrieval-Augmented Generative Agent for Scientific Research
Jakub Lála
Odhran O'Donoghue
Aleksandar Shtedritski
Sam Cox
Samuel G. Rodriques
Andrew D. White
RALM
77
73
0
08 Dec 2023
Latent Skill Discovery for Chain-of-Thought Reasoning
Zifan Xu
Haozhu Wang
Dmitriy Bespalov
Peter Stone
Yanjun Qi
ReLM
LRM
56
2
0
07 Dec 2023
Previous
1
2
3
4
5
Next