Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.12788
Cited By
v1
v2
v3 (latest)
Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical Perception
16 October 2024
Jihao Zhao
Zhiyuan Ji
Pengnian Qi
Pengnian Qi
Simin Niu
Feiyu Xiong
Zhiyu Li
Zhiyu Li
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical Perception"
19 / 19 papers shown
Title
RAGBench: Explainable Benchmark for Retrieval-Augmented Generation Systems
Robert Friel
Masha Belyi
Atindriyo Sanyal
133
28
0
17 Jan 2025
EfficientRAG: Efficient Retriever for Multi-Hop Question Answering
Ziyuan Zhuang
Zhiyang Zhang
Sitao Cheng
Fangkai Yang
Jia Liu
Shujian Huang
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
Qi Zhang
RALM
68
9
0
08 Aug 2024
Adaptive Contrastive Decoding in Retrieval-Augmented Generation for Handling Noisy Contexts
Youna Kim
Sungmin Cho
Cheonbok Park
Choonghyun Park
Hyunsoo Cho
Junyeob Kim
Kang Min Yoo
Sang-goo Lee
Taeuk Kim
73
7
0
02 Aug 2024
Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities
Zhonghao Li
Xuming Hu
Aiwei Liu
Kening Zheng
Shijie Huang
Hui Xiong
RALM
167
8
0
17 Jun 2024
Multi-Head RAG: Solving Multi-Aspect Problems with LLMs
Maciej Besta
Aleš Kubíček
Robert Gerstenberger
Marcin Chrapek
Roman Niggli
...
Joanna Gajda
Piotr Nyczyk
Jürgen Müller
H. Niewiadomski
Torsten Hoefler
87
20
0
07 Jun 2024
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
Yuanjie Lyu
Zhiyu Li
Pengnian Qi
Feiyu Xiong
Simin Niu
Wenjin Wang
Hao Wu
Huan Liu
Tong Xu
Enhong Chen
RALM
77
40
0
30 Jan 2024
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Guosheng Dong
Zhiying Wu
ELM
LRM
257
751
0
19 Sep 2023
ChatGPT Hallucinates when Attributing Answers
Guido Zuccon
Bevan Koopman
Razia Shaik
RALM
LRM
HILM
86
28
0
17 Sep 2023
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Shayne Longpre
Gregory Yauney
Emily Reif
Katherine Lee
Adam Roberts
...
Denny Zhou
Jason W. Wei
Kevin Robinson
David M. Mimno
Daphne Ippolito
108
167
0
22 May 2023
Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks
Xianzhi Li
Samuel Chan
Xiaodan Zhu
Yulong Pei
Zhiqiang Ma
Xiaomo Liu
Sameena Shah
AI4MH
57
81
0
10 May 2023
Rethinking with Retrieval: Faithful Large Language Model Inference
Hangfeng He
Hongming Zhang
Dan Roth
KELM
LRM
227
168
0
31 Dec 2022
TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge
Chao-Hong Tan
Jia-Chen Gu
Chongyang Tao
Zhen-Hua Ling
Can Xu
Huang Hu
Xiubo Geng
Daxin Jiang
RALM
73
11
0
16 Mar 2022
BERTopic: Neural topic modeling with a class-based TF-IDF procedure
M. Grootendorst
163
1,498
0
11 Mar 2022
Internet-augmented language models through few-shot prompting for open-domain question answering
Angeliki Lazaridou
E. Gribovskaya
Wojciech Stokowiec
N. Grigorev
KELM
LRM
57
138
0
10 Mar 2022
End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering
Devendra Singh Sachan
Siva Reddy
William L. Hamilton
Chris Dyer
Dani Yogatama
OOD
RALM
92
170
0
09 Jun 2021
Top2Vec: Distributed Representations of Topics
D. Angelov
75
349
0
19 Aug 2020
REALM: Retrieval-Augmented Language Model Pre-Training
Kelvin Guu
Kenton Lee
Zora Tung
Panupong Pasupat
Ming-Wei Chang
RALM
147
2,121
0
10 Feb 2020
UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction
Leland McInnes
John Healy
James Melville
202
9,479
0
09 Feb 2018
Probabilistic Latent Semantic Analysis
Thomas Hofmann
449
2,808
0
23 Jan 2013
1