Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.05658
Cited By
UER: An Open-Source Toolkit for Pre-training Models
12 September 2019
Zhe Zhao
Hui Chen
Jinbin Zhang
Xin Zhao
Tao Liu
Wei Lu
Xi Chen
Haotang Deng
Qi Ju
Xiaoyong Du
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"UER: An Open-Source Toolkit for Pre-training Models"
50 / 54 papers shown
Title
Entropy-Driven Pre-Tokenization for Byte-Pair Encoding
Yifan Hu
Frank Liang
Dachuan Zhao
Jonathan Geuter
Varshini Reddy
Craig W. Schmidt
Chris Tanner
15
0
0
18 Jun 2025
Language-Agnostic Suicidal Risk Detection Using Large Language Models
June-Woo Kim
Wonkyo Oh
Haram Yoon
Sung-Hoon Yoon
Dae-Jin Kim
Dong-Ho Lee
Sang-Yeol Lee
Chan-Mo Yang
39
0
0
26 May 2025
Language Models at the Syntax-Semantics Interface: A Case Study of the Long-Distance Binding of Chinese Reflexive ziji
Xiulin Yang
112
1
0
02 Apr 2025
GADFA: Generator-Assisted Decision-Focused Approach for Opinion Expressing Timing Identification
Chung-Chi Chen
Hiroya Takamura
Ichiro Kobayashi
Yusuke Miyao
66
0
0
02 Oct 2024
Real or Robotic? Assessing Whether LLMs Accurately Simulate Qualities of Human Responses in Dialogue
Jonathan Ivey
Shivani Kumar
Jiayu Liu
Hua Shen
Sushrita Rakshit
...
Dustin Wright
Abraham Israeli
Anders Giovanni Møller
Lechen Zhang
David Jurgens
106
3
0
12 Sep 2024
FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure
Ziyue Xu
Peilin Zhou
Xinyu Shi
Jiageng Wu
Yikang Jiang
Bin Ke
Jie Yang
Jie Yang
108
5
0
17 Jun 2024
FinGen: A Dataset for Argument Generation in Finance
Chung-Chi Chen
Hiroya Takamura
Ichiro Kobayashi
Yusuke Miyao
58
0
0
31 May 2024
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model
Zhengpeng Shi
Haoran Luo
LRM
ALM
93
2
0
28 Apr 2024
Design as Desired: Utilizing Visual Question Answering for Multimodal Pre-training
Tongkun Su
Jun Li
Xi Zhang
Haibo Jin
Hao Chen
Qiong Wang
Faqin Lv
Baoliang Zhao
Yin Hu
71
0
0
30 Mar 2024
mALBERT: Is a Compact Multilingual BERT Model Still Worth It?
Christophe Servan
Sahar Ghannay
Sophie Rosset
46
1
0
27 Mar 2024
LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error Correction
Yixuan Wang
Baoxin Wang
Yijun Liu
Dayong Wu
Wanxiang Che
KELM
93
2
0
26 Mar 2024
Evolving Knowledge Distillation with Large Language Models and Active Learning
Chengyuan Liu
Yangyang Kang
Fubang Zhao
Kun Kuang
Zhuoren Jiang
Changlong Sun
Leilei Gan
49
6
0
11 Mar 2024
LiFi: Lightweight Controlled Text Generation with Fine-Grained Control Codes
Chufan Shi
Deng Cai
Yujiu Yang
160
3
0
10 Feb 2024
Inconsistent dialogue responses and how to recover from them
Mian Zhang
Lifeng Jin
Linfeng Song
Haitao Mi
Dong Yu
60
1
0
18 Jan 2024
Prompt Pool based Class-Incremental Continual Learning for Dialog State Tracking
Hong Liu
Yucheng Cai
Yuan Zhou
Zhijian Ou
Yi Huang
Junlan Feng
CLL
95
2
0
17 Nov 2023
Create and Find Flatness: Building Flat Training Spaces in Advance for Continual Learning
Wenhang Shi
Yiren Chen
Zhe Zhao
Wei Lu
Kimmo Yan
Xiaoyong Du
CLL
81
5
0
20 Sep 2023
CPPF: A contextual and post-processing-free model for automatic speech recognition
Lei Zhang
Zhengkun Tian
Xiang Chen
Jiaming Sun
Hongyu Xiang
Ke Ding
Guanglu Wan
67
0
0
14 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
204
40
0
30 Aug 2023
NewsDialogues: Towards Proactive News Grounded Conversation
Siheng Li
Yichun Yin
Cheng Yang
Wangjie Jiang
Yiwei Li
Ze-Long Cheng
Lifeng Shang
Xin Jiang
Qun Liu
Yujiu Yang
58
6
0
12 Aug 2023
Methods for Acquiring and Incorporating Knowledge into Stock Price Prediction: A Survey
Liping Wang
Jiawei Li
Lifan Zhao
Zhizhuo Kou
Xiaohan Wang
Xinyi Zhu
Hao Wang
Yanyan Shen
Lei Chen
AIFin
111
9
0
09 Aug 2023
A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction
Zefa Hu
Ziyi Ni
Jing Shi
Shuang Xu
Bo Xu
MedIm
82
2
0
30 Jul 2023
Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling
Longyue Wang
Zefeng Du
Donghua Liu
Cai Deng
Dian Yu
Haiyun Jiang
Yan Wang
Leyang Cui
Shuming Shi
Zhaopeng Tu
CoGe
99
6
0
16 Jul 2023
Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition
Hong Liu
Z. Lv
Zhijian Ou
Wenbo Zhao
Qing Xiao
61
1
0
22 May 2023
A Cognitive Stimulation Dialogue System with Multi-source Knowledge Fusion for Elders with Cognitive Impairment
Jiyue Jiang
Sheng Wang
Qintong Li
Lingpeng Kong
Chuan Wu
99
8
0
14 May 2023
SikuGPT: A Generative Pre-trained Model for Intelligent Information Processing of Ancient Texts from the Perspective of Digital Humanities
Chang Liu
Dongbo Wang
Zhixiao Zhao
Die Hu
Mengcheng Wu
...
Si Shen
Bin Li
Jiangfeng Liu
Hai Zhang
Lianzheng Zhao
46
10
0
16 Apr 2023
MetaAID 2.0: An Extensible Framework for Developing Metaverse Applications via Human-controllable Pre-trained Models
Hongyin Zhu
56
6
0
25 Feb 2023
BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark
Dakuan Lu
Hengkui Wu
Jiaqing Liang
Yipei Xu
Qi He
Yipeng Geng
Mengkun Han
Ying Xin
Yanghua Xiao
89
62
0
18 Feb 2023
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
Zhe Zhao
Yudong Li
Cheng-An Hou
Jing-xin Zhao
Rong Tian
...
Xingwu Sun
Zhanhui Kang
Xiaoyong Du
Linlin Shen
Kimmo Yan
VLM
106
24
0
13 Dec 2022
Investigating Glyph Phonetic Information for Chinese Spell Checking: What Works and What's Next
Xiaotian Zhang
Yanjun Zheng
Hang Yan
Xipeng Qiu
69
5
0
08 Dec 2022
CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog Evaluation
Yinpei Dai
Wanwei He
Bowen Li
Yuchuan Wu
Zhen Cao
Zhongqi An
Jian Sun
Yongbin Li
ELM
ALM
92
13
0
21 Nov 2022
TSMind: Alibaba and Soochow University's Submission to the WMT22 Translation Suggestion Task
Xin Ge
Ke Min Wang
Jiayi Wang
Nini Xiao
Xiangyu Duan
Yu Zhao
Yuqi Zhang
41
2
0
16 Nov 2022
A Method to Judge the Style of Classical Poetry Based on Pre-trained Model
Zi-Chuan Wang
Jiandong Zhang
Jun Ma
33
1
0
09 Nov 2022
Generation of Chinese classical poetry based on pre-trained model
Ziyao Wang
Lujin Guan
Guanyu Liu
29
0
0
04 Nov 2022
SLING: Sino Linguistic Evaluation of Large Language Models
Yixiao Song
Kalpesh Krishna
R. Bhatt
Mohit Iyyer
83
10
0
21 Oct 2022
Counterfactual Recipe Generation: Exploring Compositional Generalization in a Realistic Scenario
Xiao Liu
Yansong Feng
Jizhi Tang
ChenGang Hu
Dongyan Zhao
38
9
0
20 Oct 2022
Improving Chinese Story Generation via Awareness of Syntactic Dependencies and Semantics
Hen-Hsen Huang
Chen Tang
Tyler Loakman
Frank Guerin
Chenghua Lin
66
12
0
19 Oct 2022
Mars: Modeling Context & State Representations with Contrastive Learning for End-to-End Task-Oriented Dialog
Haipeng Sun
Junwei Bao
Youzheng Wu
Xiaodong He
75
13
0
17 Oct 2022
A Benchmark for Understanding and Generating Dialogue between Characters in Stories
Jianzhu Yao
Ziqi Liu
Jian Guan
Minlie Huang
67
1
0
18 Sep 2022
Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching
Kunbo Ding
Weijie Liu
Yuejian Fang
Zhe Zhao
Qi Ju
Xuefeng Yang
41
1
0
13 Sep 2022
CSL: A Large-scale Chinese Scientific Literature Dataset
Yudong Li
Yuqing Zhang
Zhe Zhao
Lin-cheng Shen
Weijie Liu
Weiquan Mao
Hui Zhang
AILaw
202
52
0
12 Sep 2022
A Corpus for Understanding and Generating Moral Stories
Jian Guan
Ziqi Liu
Minlie Huang
75
10
0
20 Apr 2022
MetaAID: A Flexible Framework for Developing Metaverse Applications via AI Technology and Human Editing
Hongyin Zhu
50
31
0
04 Apr 2022
PathSAGE: Spatial Graph Attention Neural Networks With Random Path Sampling
Junhua Ma
Jiajun Li
Xueming Li
Xu Li
3DPC
GNN
31
1
0
11 Mar 2022
Semantic Matching from Different Perspectives
Weijie Liu
Tao Zhu
Weiquan Mao
Zhe Zhao
Weigang Guo
Xuefeng Yang
Qi Ju
36
0
0
14 Feb 2022
ET-BERT: A Contextualized Datagram Representation with Pre-training Transformers for Encrypted Traffic Classification
Xinjie Lin
G. Xiong
Gaopeng Gou
Zhen Li
Junzheng Shi
Jiahao Yu
56
258
0
13 Feb 2022
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
VLM
54
31
0
14 Dec 2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Kaustubh D. Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
...
Tianbao Xie
Usama Yaseen
Michael A. Yee
Jing Zhang
Yue Zhang
235
88
0
06 Dec 2021
NSP-BERT: A Prompt-based Few-Shot Learner Through an Original Pre-training Task--Next Sentence Prediction
Yi Sun
Yu Zheng
Chao Hao
Hangping Qiu
VLM
107
37
0
08 Sep 2021
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation
Jian Guan
Zhuoer Feng
Yamei Chen
Ru He
Xiaoxi Mao
Changjie Fan
Minlie Huang
120
33
0
30 Aug 2021
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
Liang Xu
Xiaojing Lu
Chenyang Yuan
Xuanwei Zhang
Huilin Xu
...
Guoao Wei
X. Pan
Xin Tian
Libo Qin
Hai Hu
ELM
99
57
0
15 Jul 2021
1
2
Next