Results 1 -
4 of
4
A middleware for developing parallel data mining implementations
- In Proceedings of the first SIAM conference on Data Mining
, 2001
"... Data mining is an interdisciplinary field, having applications in diverse areas like bioinformatics, medical informatics, scientific data analysis, financial analysis, consumer profiling, etc. In each of these application domains, the amount of data available for analysis has exploded in recent year ..."
Abstract
-
Cited by 17 (10 self)
- Add to MetaCart
Data mining is an interdisciplinary field, having applications in diverse areas like bioinformatics, medical informatics, scientific data analysis, financial analysis, consumer profiling, etc. In each of these application domains, the amount of data available for analysis has exploded in recent years, making the scalability of data
High-Performance Data Mining with Skeleton-based Structured Parallel Programming
- PARALLEL COMPUTING, SPECIAL ISSUE ON PARALLEL DATA INTENSIVE COMPUTING
, 2001
"... We show how to apply a Structured Parallel Programming methodology based on skeletons to Data Mining problems, reporting several results about three commonly used mining techniques, namely association rules, decision tree induction and spatial clustering. We analyze the structural patterns common to ..."
Abstract
-
Cited by 13 (4 self)
- Add to MetaCart
We show how to apply a Structured Parallel Programming methodology based on skeletons to Data Mining problems, reporting several results about three commonly used mining techniques, namely association rules, decision tree induction and spatial clustering. We analyze the structural patterns common to these applications, looking at application performance and software engineering efficiency. Our aim is to clearly state what features a Structured Parallel Programming Environment should have to be useful for parallel Data Mining. Within the skeleton-based PPE SkIE that we have developed, we study the different patterns of data access of parallel implementations of Apriori, C4.5 and DBSCAN. We need to address large partitions reads, frequent and sparse access to small blocks, as well as an irregular mix of small and large transfers, to allow efficient development of applications on huge databases. We examine the addition of an object/component interface to the skeleton structured model, to simplify the development of environment-integrated, parallel Data Mining applications.
The Design of Discovery Net: Towards Open Grid Services for Knowledge Discovery
- High-Performance Computing Applications
, 2003
"... Citations (this article cites 18 articles hosted on the ..."
Abstract
-
Cited by 12 (7 self)
- Add to MetaCart
Citations (this article cites 18 articles hosted on the

