Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.14482
Cited By
SpanSeq: Similarity-based sequence data splitting method for improved development and assessment of deep learning projects
22 February 2024
A. F. Florensa
J. J. A. Armenteros
Henrik Nielsen
F. Aarestrup
P. Clausen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SpanSeq: Similarity-based sequence data splitting method for improved development and assessment of deep learning projects"
2 / 2 papers shown
Title
Deduplicating Training Data Makes Language Models Better
Katherine Lee
Daphne Ippolito
A. Nystrom
Chiyuan Zhang
Douglas Eck
Chris Callison-Burch
Nicholas Carlini
SyDa
242
595
0
14 Jul 2021
Memorization vs. Generalization: Quantifying Data Leakage in NLP Performance Evaluation
Aparna Elangovan
Jiayuan He
Karin Verspoor
TDI
FedML
167
89
0
03 Feb 2021
1