Zoran Gacovski, Gjorgji Ilievski, Sime Arsenovski
FON University, Bul. Vojvodina, bb, Skopje, Macedonia zoran.gacovski@fon.edu.mk, gjorgji.ilievski@yahoo.com, sime.arsenovski@fon.edu.mk
Abstract. Prediction of the customer behavior is a subject that is considered to be “the holy grail” in the business. Data mining techniques are not a new subject, but the amount of data that can be processed by the modern computers and the global market that the world has become has opened a lot of opportunities. This paper considers a method for proposal of video materials to the customers in a video on demand (VOD) system, but its broader usage covers any closed system in which the user is identified before the purchase and history of previous user actions is available. By usingthe data from previous purchases in the systemand applying the well-known Apriori algorithm, a set of association rules is generated. An algorithm that uses the history of the client for which the recommendation should be made, compares it with the association rules found previously and produces the prediction for the best fit videos that will be recommended to the customer. The method is simulated using WEKA for the association rules and using T-SQL procedures and functions for the prediction algorithm. Real data from an existing and publicly available VOD (T-home’s MAX TV) system is used for the simulation. The data is put in a relational MS SQL database.
Keywords: Data mining, prediction, Apriori algorithm, association rules, video-on-demand, WEKA.
Introduction
Making high-quality predictions about the customer’s behavior is very important for any business, for planning, marketing, pricing, product management, human resources, training and almost any subject related to a management of business processes. Data mining techniques have been around for a long time, but the development of the IT infrastructure today allows
References: RakeshAgrawal, RamakrishnanSrikant, ,,Fast Algorithms for Mining Association Rules”, IBM Almaden Research Center, 650 Harry Road, San Jose, CA 95120, 1999 Mehmet AydinUlas, ,,Market Basket Analysis for Data Mining”, (PhD thesis) Bogazici University, 1999. Sotiris Kotsiantis, Dimitris Kanellopoulos, ,,Association Rules Mining: A Recent Overview”, GESTS Intern. Trans. on Computer Science and Engineering, Vol.32 (1), 2006, pp. 71-82, 2006. R. Agrawal, H. Mannila, R. Srikant, H. Toivonen& A.I. Verkamo, ,,Fast discovery of association rules", Advan. in Knowledge Discovery and Data Mining, pp. 307 - 328, 1996. YaseminBoztuğ, Lutz Hildebrandt, ,,A Market Basket Analysis Conducted with a Multivariate Logit Model", Schmalenbach Business Review (sbr), Vol. 60(4), pp. 400-422, 2005. Sally Jo Cunningham, Eibe Frank, ,,Market Basket Analysis of Library Circulation Data", Proc. of 6th International Conference on Neural Information Processing, vol. II, Perth, Australia, pp. 825-830, 1999. E.García, C.Romero, S.Ventura, C.de Castro, ,,An architecture for making recommendations to courseware authors using association rule mining and collaborative filtering", Jour. of User Modeling and User-Adapted Interaction, Vol. 19 Issue 1-2, 2009. Troy Raeder, Nitesh V. Chawla, ,,Market Basket Analysis with Networks", Social Networks Analysis and Modeling Journal, vol. 1, No. 2, pp. 97-113, 2010. Ian H. Witten &Eibe Frank, ,,Data Mining Practical Machine Learning Tool and Techniques", second edition, Morgan Kaufmann Publishers, 2005. Xiaoyuan Su and Taghi M. Khoshgoftaar, ,,A Survey of Collaborative Filtering Techniques", Advances in Artificial Intelligence, Vol. 2009 (2009), Article 421425, 2009.