Digital Library

cab1

 
Title:      PROJECTION BASED SAMPLING FOR MORE EFFICIENT HIGH UTILITY ITEMSET MINING
Author(s):      Alva Erwin, Raj P. Gopalan, N.R. Achuthan
ISBN:      978-972-8939-23-6
Editors:      António Palma dos Reis and Ajith P. Abraham
Year:      2010
Edition:      Single
Keywords:      High Utility Itemset Mining, Sampling, Frequent Itemset Mining
Type:      Full Paper
First Page:      77
Last Page:      84
Language:      English
Cover:      cover          
Full Contents:      click to dowload Download
Paper Abstract:      High Utility Itemset Mining is a generalization of Frequent Itemset Mining, where not only the absence or the presence of items, but also the utility of items in the form of quantity and profit are significant. Mining High Utility patterns is a difficult problem especially from a large database due to the combinatorial explosion of patterns to be considered and the inapplicability of the downward closure property for pruning. Sampling can reduce the size of the dataset to be mined, but its usefulness depends on the accuracy of the result and the level of accuracy required for a given purpose. In this paper we propose a projection based sampling algorithm to mine High Utility Itemsets that improves the accuracy of mining compared to simple random sampling. Experiments have been performed on real and synthetic datasets to show the effectiveness of the proposed algorithm.
   

Social Media Links

Search

Login