Lavalee Singh1 Arun Singh2 1 M.Tech (C.S.) Student IIMT Engineering College Meerut (U.P.) India lovely_198631@rediffmail.com 2Associate Professor IIMT Engineering College Meerut (U.P.) India
Abstract
The World-Wide-Web contains a large amount of information. Everyone can store and retrieve the information from web. It is difficult to find the relevant piece of information from web. Extracting the important information from web is called Web Mining. Web mining technologies are best suited for web information extraction and information retrieval. Web mining is one of the mining technologies, which applies data mining techniques in large amount of web data to improve the web services. We are going to give a brief description of web mining and its categorization namely: web content mining, web structure mining and web usage mining. This paper also reports the web data mining with applications. Keywords: Web Mining, Information Extraction, Information Retrieval, Web content mining, Web structure mining, Web usage mining and Web crawling
1.0 INTRODUCTION
The World Wide Web is a popular and interactive medium to disseminate information today. With the explosive growth of information sources available on the World Wide Web, it has become increasingly necessary for users to utilize automated tools in order to find, extract, filter, and evaluate the desired information and resources. The World Wide Web provides a vast source of information of almost all types, ranging from DNA databases to resumes to lists of popular
References: [2] O.etzioni. The world wield web: Quagmire or Gold Mining. Communicate of the ACM, (39)11:65-68, 1996. [3] Rekha Jain, Dr. G. N. Purohit, Page Ranking Algorithms for Web Mining, International Journal of Computer Applications (0975 – 8887) Volume 13– No.5, January 2011. [4] Masashi Toyoda, Masaru Kitsuregawa What’s Really New on the Web? Identifying New Pages from a Series of Unstable Web Snapshots, WWW 2006, May 23–26, 2006, Edinburgh, Scotland. ACM 1-59593-323-9/06/0005. [6] Johannes Fürnkranz, WEB MINING, TU Darmstadt, Knowledge Engineering Group [7] Hong T, Chiang M, Wang S H, "Mining weighted browsing patterns with linguistic minimum supports", 2002 IEEE International Conference on Systems, Man and Cybernetics, 2002,Yasmine Hammamet, Tunisia, pp. 635-639.