Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.11702
Cited By
Lingua Manga: A Generic Large Language Model Centric System for Data Curation
20 June 2023
Zui Chen
Lei Cao
Samuel Madden
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Lingua Manga: A Generic Large Language Model Centric System for Data Curation"
10 / 10 papers shown
Title
Data Proportion Detection for Optimized Data Management for Large Language Models
Hao Liang
Keshi Zhao
Yajie Yang
Bin Cui
Guosheng Dong
Zenan Zhou
Wentao Zhang
38
0
0
26 Sep 2024
Synth-Empathy: Towards High-Quality Synthetic Empathy Data
Hao Liang
Linzhuang Sun
Jingxuan Wei
Xijie Huang
Linkun Sun
Bihui Yu
Conghui He
Wentao Zhang
SyDa
48
4
0
31 Jul 2024
SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models
Zheng Liu
Hao Liang
Xijie Huang
Wentao Xiong
Qinhan Yu
Linzhuang Sun
Chong Chen
Conghui He
Bin Cui
Wentao Zhang
SyDa
55
0
0
30 Jul 2024
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System
Miao Zheng
H. Liang
Fan Yang
Haoze Sun
Tianpeng Li
...
Kun Fang
Weipeng Chen
Bin Cui
Wentao Zhang
Zenan Zhou
RALM
44
3
0
08 Jul 2024
KeyVideoLLM: Towards Large-scale Video Keyframe Selection
Hao Liang
Jiapeng Li
Tianyi Bai
Xijie Huang
Linzhuang Sun
Zhengren Wang
Conghui He
Bin Cui
Chong Chen
Wentao Zhang
VGen
29
7
0
03 Jul 2024
Efficient-Empathy: Towards Efficient and Effective Selection of Empathy Data
Linzhuang Sun
Hao Liang
Jingxuan Wei
Linkun Sun
Bihui Yu
Bin Cui
Wentao Zhang
37
1
0
02 Jul 2024
The Role of Data Curation in Image Captioning
Wenyan Li
Jonas F. Lotz
Chen Qiu
Desmond Elliott
DiffM
40
6
0
05 May 2023
Can Foundation Models Wrangle Your Data?
A. Narayan
Ines Chami
Laurel J. Orr
Simran Arora
Christopher Ré
LMTD
AI4CE
181
214
0
20 May 2022
Distilling Knowledge from Reader to Retriever for Question Answering
Gautier Izacard
Edouard Grave
RALM
185
251
0
08 Dec 2020
Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference
Timo Schick
Hinrich Schütze
258
1,591
0
21 Jan 2020
1