ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.11004
  4. Cited By
The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators

The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators

25 June 2024
Tzu-Heng Huang
Catherine Cao
Vaishnavi Bhargava
Frederic Sala
ArXivPDFHTML

Papers citing "The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators"

39 / 39 papers shown
Title
ScriptoriumWS: A Code Generation Assistant for Weak Supervision
ScriptoriumWS: A Code Generation Assistant for Weak Supervision
Tzu-Heng Huang
Catherine Cao
Spencer Schoenberg
Harit Vishwakarma
Nicholas Roberts
Frederic Sala
NoLa
190
6
0
17 Feb 2025
Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations
  from Large Language Models
Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models
Chirag Agarwal
Sree Harsha Tanneru
Himabindu Lakkaraju
LRM
71
41
0
07 Feb 2024
Retrieval-Augmented Generation for Large Language Models: A Survey
Retrieval-Augmented Generation for Large Language Models: A Survey
Yunfan Gao
Yun Xiong
Xinyu Gao
Kangxiang Jia
Jinliu Pan
Yuxi Bi
Yi Dai
Jiawei Sun
Meng Wang
Haofen Wang
3DV
RALM
140
1,757
1
18 Dec 2023
Can Large Language Models Design Accurate Label Functions?
Can Large Language Models Design Accurate Label Functions?
Naiqing Guan
Kaiwen Chen
Nick Koudas
ALM
46
7
0
01 Nov 2023
LLMaAA: Making Large Language Models as Active Annotators
LLMaAA: Making Large Language Models as Active Annotators
Ruoyu Zhang
Yanzeng Li
Yongliang Ma
Ming Zhou
Lei Zou
75
73
0
30 Oct 2023
Can Large Language Models Explain Themselves? A Study of LLM-Generated
  Self-Explanations
Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations
Shiyuan Huang
Siddarth Mamidanna
Shreedhar Jangam
Yilun Zhou
Leilani H. Gilpin
LRM
MILM
ELM
78
74
0
17 Oct 2023
Zero-Shot Robustification of Zero-Shot Models
Zero-Shot Robustification of Zero-Shot Models
Dyah Adila
Changho Shin
Lin Cai
Frederic Sala
65
20
0
08 Sep 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
280
11,828
0
18 Jul 2023
GPT Self-Supervision for a Better Data Annotator
GPT Self-Supervision for a Better Data Annotator
Xiaohuan Pei
Yanxi Li
Chang Xu
50
7
0
07 Jun 2023
Distilling Step-by-Step! Outperforming Larger Language Models with Less
  Training Data and Smaller Model Sizes
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Lokesh Nagalapatti
Chun-Liang Li
Chih-Kuan Yeh
Hootan Nakhost
Yasuhisa Fujii
Alexander Ratner
Ranjay Krishna
Chen-Yu Lee
Tomas Pfister
ALM
290
544
0
03 May 2023
AnnoLLM: Making Large Language Models to Be Better Crowdsourced
  Annotators
AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators
Xingwei He
Zheng-Wen Lin
Yeyun Gong
Alex Jin
Hang Zhang
Chen Lin
Jian Jiao
Siu-Ming Yiu
Nan Duan
Weizhu Chen
63
201
0
29 Mar 2023
ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks
ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks
Fabrizio Gilardi
Meysam Alizadeh
M. Kubli
AI4MH
104
914
0
27 Mar 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
1.4K
14,313
0
15 Mar 2023
In-context Examples Selection for Machine Translation
In-context Examples Selection for Machine Translation
Sweta Agrawal
Chunting Zhou
M. Lewis
Luke Zettlemoyer
Marjan Ghazvininejad
LRM
82
195
0
05 Dec 2022
Evaluating Unsupervised Text Classification: Zero-shot and
  Similarity-based Approaches
Evaluating Unsupervised Text Classification: Zero-shot and Similarity-based Approaches
Tim Schopf
Daniel Braun
Florian Matthes
60
57
0
29 Nov 2022
Tuning Language Models as Training Data Generators for
  Augmentation-Enhanced Few-Shot Learning
Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
Yu Meng
Martin Michalski
Jiaxin Huang
Yu Zhang
Tarek Abdelzaher
Jiawei Han
VLM
96
49
0
06 Nov 2022
ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback
ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback
Jiacheng Ye
Jiahui Gao
Jiangtao Feng
Zhiyong Wu
Tao Yu
Lingpeng Kong
SyDa
VLM
136
77
0
22 Oct 2022
AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100
  Labels
AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels
Nicholas Roberts
Xintong Li
Tzu-Heng Huang
Dyah Adila
Spencer Schoenberg
Chengao Liu
Lauren Pick
Haotian Ma
Aws Albarghouthi
Frederic Sala
UQCV
75
8
0
30 Aug 2022
Self-Generated In-Context Learning: Leveraging Auto-regressive Language
  Models as a Demonstration Generator
Self-Generated In-Context Learning: Leveraging Auto-regressive Language Models as a Demonstration Generator
Sungmin Cho
Hyunsoo Cho
Junyeob Kim
Taeuk Kim
Kang Min Yoo
Sang-goo Lee
80
64
0
16 Jun 2022
Teaching Models to Express Their Uncertainty in Words
Teaching Models to Express Their Uncertainty in Words
Stephanie C. Lin
Jacob Hilton
Owain Evans
OOD
64
413
0
28 May 2022
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning
Jiahui Gao
Renjie Pi
Yong Lin
Hang Xu
Jiacheng Ye
Zhiyong Wu
Weizhong Zhang
Xiaodan Liang
Zhenguo Li
Lingpeng Kong
SyDa
VLM
95
49
0
25 May 2022
ZeroGen: Efficient Zero-shot Learning via Dataset Generation
ZeroGen: Efficient Zero-shot Learning via Dataset Generation
Jiacheng Ye
Jiahui Gao
Qintong Li
Hang Xu
Jiangtao Feng
Zhiyong Wu
Tao Yu
Lingpeng Kong
SyDa
90
220
0
16 Feb 2022
Learning To Retrieve Prompts for In-Context Learning
Learning To Retrieve Prompts for In-Context Learning
Ohad Rubin
Jonathan Herzig
Jonathan Berant
VPVLM
RALM
79
702
0
16 Dec 2021
WRENCH: A Comprehensive Benchmark for Weak Supervision
WRENCH: A Comprehensive Benchmark for Weak Supervision
Jieyu Zhang
Yue Yu
Yinghao Li
Yujing Wang
Yaming Yang
Mao Yang
Alexander Ratner
57
113
0
23 Sep 2021
Want To Reduce Labeling Cost? GPT-3 Can Help
Want To Reduce Labeling Cost? GPT-3 Can Help
Shuohang Wang
Yang Liu
Yichong Xu
Chenguang Zhu
Michael Zeng
64
255
0
30 Aug 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
195
3,964
0
28 Jul 2021
Evaluating Large Language Models Trained on Code
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELM
ALM
222
5,513
0
07 Jul 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
871
29,372
0
26 Feb 2021
What Makes Good In-Context Examples for GPT-$3$?
What Makes Good In-Context Examples for GPT-333?
Jiachang Liu
Dinghan Shen
Yizhe Zhang
Bill Dolan
Lawrence Carin
Weizhu Chen
AAML
RALM
378
1,374
0
17 Jan 2021
Interactive Weak Supervision: Learning Useful Heuristics for Data
  Labeling
Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling
Benedikt Boecking
Willie Neiswanger
Eric Xing
A. Dubrawski
NoLa
OffRL
77
70
0
11 Dec 2020
Denoising Multi-Source Weak Supervision for Neural Text Classification
Denoising Multi-Source Weak Supervision for Neural Text Classification
Wendi Ren
Yinghao Li
Hanting Su
David Kartchner
Cassie S. Mitchell
Chao Zhang
NoLa
77
70
0
09 Oct 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
741
41,894
0
28 May 2020
Fast and Three-rious: Speeding Up Weak Supervision with Triplet Methods
Fast and Three-rious: Speeding Up Weak Supervision with Triplet Methods
Daniel Y. Fu
Mayee F. Chen
Frederic Sala
Sarah Hooper
Kayvon Fatahalian
Christopher Ré
OffRL
74
115
0
27 Feb 2020
Distributionally Robust Neural Networks for Group Shifts: On the
  Importance of Regularization for Worst-Case Generalization
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization
Shiori Sagawa
Pang Wei Koh
Tatsunori B. Hashimoto
Percy Liang
OOD
93
1,237
0
20 Nov 2019
GOGGLES: Automatic Image Labeling with Affinity Coding
GOGGLES: Automatic Image Labeling with Affinity Coding
Nilaksh Das
Sanya Chaba
Renzhi Wu
Sakshi Gandhi
Duen Horng Chau
Xu Chu
VLM
50
33
0
11 Mar 2019
Training Complex Models with Multi-Task Weak Supervision
Training Complex Models with Multi-Task Weak Supervision
Alexander Ratner
Braden Hancock
Jared A. Dunnmon
Frederic Sala
Shreyash Pandey
Christopher Ré
46
212
0
05 Oct 2018
Snorkel: Rapid Training Data Creation with Weak Supervision
Snorkel: Rapid Training Data Creation with Weak Supervision
Alexander Ratner
Stephen H. Bach
Henry R. Ehrenberg
Jason Alan Fries
Sen Wu
Christopher Ré
73
1,027
0
28 Nov 2017
Data Programming: Creating Large Training Sets, Quickly
Data Programming: Creating Large Training Sets, Quickly
Alexander Ratner
Christopher De Sa
Sen Wu
Daniel Selsam
Christopher Ré
183
716
0
25 May 2016
Good Debt or Bad Debt: Detecting Semantic Orientations in Economic Texts
Good Debt or Bad Debt: Detecting Semantic Orientations in Economic Texts
P. Malo
Ankur Sinha
P. Takala
P. Korhonen
J. Wallenius
69
511
0
19 Jul 2013
1