Paper
6 April 2000 Theoretical sampling for data mining
Author Affiliations +
Abstract
Given a finite sequence of vectors (numerical tuples), there is a complexity associated to it, called data complexity. The 'simplest' pattern that is supported by this data set has a complexity, called pattern complexity. Then the 'smallest' sub-sequence, whose pattern complexity and data complexity are both equal to the pattern complexity of the original sequence, is the smallest sample, called theoretical sample. This paper investigates such samples.
© (2000) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Tsau Young Lin "Theoretical sampling for data mining", Proc. SPIE 4057, Data Mining and Knowledge Discovery: Theory, Tools, and Technology II, (6 April 2000); https://doi.org/10.1117/12.381733
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data mining

Databases

Knowledge discovery

Lithium

MATLAB

Electrical engineering

Information operations

RELATED CONTENT

DICOM data storage and retrieval with MongoDB
Proceedings of SPIE (April 04 2022)
A topological-based spatial data clustering
Proceedings of SPIE (April 20 2016)
Efficient mining of strongly correlated item pairs
Proceedings of SPIE (April 18 2006)
PNP: mining of profile navigational patterns
Proceedings of SPIE (March 12 2002)

Back to Top