High Utility Itemsets Mining for Transactional Databases

G Kurubindu, P Lalitha kumari

Abstract


Mainstream issue in data mining, which is called "high-utility itemset mining" or all the more for the most part utility mining. High Utility Itemsets which are itemsets having an utility gathering a client determined least utility edge value i.e min_util. The principle target of utility mining is to discover thing sets with highest utilities, by thinking about benefit, amount, cost or some other client inclinations. Research has been done in region of mining HUI's. Different procedures have been connected. The fundamental issue with setting edge value which is for the most part client particular, is it should be proper. In Order to set most fitting or right Threshold value for mining HUI's,user needs to do trial and mistake which thus is tedious and repetitive process, in light of the fact that if min_util is set too low, framework will bring about getting substantial data of HUI, which thus makes framework incapable with the end goal of HUI. In the event that we set min_util too high, this will bring about getting little sum or no HUI's. Consequently setting least edge value is troublesome. The proposed framework is following Top-k framework for mining top-k HUI's, which is utilizing two algorithms TKU (mining top-k utility itemsets) and TKO (mining top-k in one phase),without setting min_util edge.


Keywords


Frequent Itemset Mining, High Utility Itemset, Closed High Utility Itemsets, Top-k Mining, and Transaction Utility.

References


R. Agrawal and R. Srikant, “Fast algorithms for mining association rules,” in Proc. Int. Conf. Very Large Data Bases, 1994, pp. 487–499.

C. Ahmed, S. Tanbeer, B. Jeong, and Y. Lee, “Efficient tree structures for high-utility pattern mining in incremental databases,” IEEE Trans. Knowl. Data Eng., vol. 21, no. 12, pp. 1708–1721, Dec. 2009.

K. Chuang, J. Huang, and M. Chen, “Mining top-k frequent patterns in the presence of the memory constraint,” VLDB J., vol. 17, pp. 1321–1344, 2008.

R. Chan, Q. Yang, and Y. Shen, “Mining high-utility itemsets,” in Proc. IEEE Int. Conf. Data Mining, 2003, pp. 19–26.

P. Fournier-Viger and V. S. Tseng, “Mining top-k sequential rules,” in Proc. Int. Conf. Adv. Data Mining Appl., 2011, pp. 180–194.

P. Fournier-Viger, C.Wu, and V. S. Tseng, “Mining top-k association rules,” in Proc. Int. Conf. Can. Conf. Adv. Artif. Intell., 2012, pp. 61–73.

P. Fournier-Viger, C. Wu, and V. S. Tseng, “Novel concise representations of high utility itemsets using generator patterns,” in Proc. Int. Conf. Adv. Data Mining Appl. Lecture Notes Comput. Sci., 2014, vol. 8933, pp. 30–43.

J. Han, J. Pei, and Y. Yin, “Mining frequent patterns without candidate generation,” in Proc. ACM SIGMOD Int. Conf. Manag. Data, 2000, pp. 1–12.

J. Han, J. Wang, Y. Lu, and P. Tzvetkov, “Mining top-k frequent closed patterns without minimum support,” in Proc. IEEE Int. Conf. Data Mining, 2002, pp. 211–218.

S. Krishnamoorthy, “Pruning strategies for mining high utility itemsets,” Expert Syst. Appl., vol. 42, no. 5, pp. 2371–2381, 2015.


Full Text: PDF[Full Text]

Refbacks

  • There are currently no refbacks.


Copyright © 2013, All rights reserved.| ijseat.com

Creative Commons License
International Journal of Science Engineering and Advance Technology is licensed under a Creative Commons Attribution 3.0 Unported License.Based on a work at IJSEat , Permissions beyond the scope of this license may be available at http://creativecommons.org/licenses/by/3.0/deed.en_GB.