ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.14482
  4. Cited By
SpanSeq: Similarity-based sequence data splitting method for improved
  development and assessment of deep learning projects

SpanSeq: Similarity-based sequence data splitting method for improved development and assessment of deep learning projects

22 February 2024
A. F. Florensa
J. J. A. Armenteros
Henrik Nielsen
F. Aarestrup
P. Clausen
ArXivPDFHTML

Papers citing "SpanSeq: Similarity-based sequence data splitting method for improved development and assessment of deep learning projects"

2 / 2 papers shown
Title
Deduplicating Training Data Makes Language Models Better
Deduplicating Training Data Makes Language Models Better
Katherine Lee
Daphne Ippolito
A. Nystrom
Chiyuan Zhang
Douglas Eck
Chris Callison-Burch
Nicholas Carlini
SyDa
242
595
0
14 Jul 2021
Memorization vs. Generalization: Quantifying Data Leakage in NLP
  Performance Evaluation
Memorization vs. Generalization: Quantifying Data Leakage in NLP Performance Evaluation
Aparna Elangovan
Jiayuan He
Karin Verspoor
TDI
FedML
167
89
0
03 Feb 2021
1