Preview

Testt

Good Essays
Open Document
Open Document
4243 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Testt
International Conference on Information and Communication Technology for the Muslim World (ICT4M 2006), 21-23 November 2006, Kuala Lumpur, Malaysia

Classified Ads Harvesting Agent and Notification System
Razvi Doomun*, Lollmahamod N., Auleear Nadeem, Mozafar Aukin
Faculty of Engineering University of Mauritius, Reduit, E-mail : r.doomun@uom.ac.mu

ABSTRACT The shift from an information society to a knowledge society require rapid information harvesting, reliable search and instantaneous on demand delivery. Information extraction agents are used to explore and collect data available from Web, in order to effectively exploit such data for business purposes, such as automatic news filtering, advertisement or product searching and price comparing. In this paper, we develop a real-time automatic harvesting agent for adverts posted on Servihoo web portal and an SMS-based notification system. It uses the URL of the web portal and the object model, i.e., the fields of interests and a set of rules written using the HTML parsing functions to extract latest adverts information. The extraction engine executes the extraction rules and stores the information in a database to be processed for automatic notification. This intelligent system

aggregation for information portals, scientific research and business activity monitoring. A lot of work has been carried out into the idea of using agents to aid e-commerce, the majority of the attention being focused on B2B agents, with B2C agents receiving a little attention. Sen and Hernandez (2000) discuss the fact that many e-businesses have “seller 's agents” whose function it is to push merchandise or services to customers, and there are also “buyer 's agents" whose goal is to best serve the user 's interests. Maes (1994) discusses how agents used as “personal assistants” that collaborate with the user can be used to reduce work carried out by the user. They can also be used to help with information overload by learning a



References: C.-H. Chang, C.-N. Hsu, and S.-C. Lui. (2003) Automatic Information Extraction from Semi-Structured Web Pages by Pattern Discovery. Decision Support Systems Journal, 35(1). Crescenzi V., Mecca G., and Merialdo P. (2001) RoadRunner: Towards Automatic Data Extraction from Large Web Sites. In The VLDB Journal, pages 109– 118. Gao X. and Sterling L (1999) Semi-Structured Data Extraction from Heterogeneous Sources. In Second International Workshop on Innovative Internet Information Systems (IIIS’99), Copenhagen. Habegger B. and Quafafou M. (2002) Multi-pattern wrappers for relation extraction. In Proceedings of the 15th European Conference on Artificial Intelligence, Amsterdam, IOS Press. Hannes Marais and Tom Rodeheffer (1999). Automating the Web with WebL. In Dr. Dobb 's Journal, January 1999. http://www.w3.org/DOM/DOMTR 6.0 DISCUSSION The system developed is an Intelligent Information Harvester and SMS Agent that is the system once started, automatically launches connection to the Servihoo Web Portal Site, extracts the latest ads information from the “Petites Annonces” section and downloads it to a database. The downloaded information is then dispatched as SMS to registered clients. With such a system, no need for viewers of “Petites Annonces” to each time visit the Servihoo Portal Site and lose time and effort in navigating the classified ads section to obtain latest ads details, what they need to do is just register on the system through the client interface and specify what type of information they want the system to harvest for them and receive the latest ad details on their mobile phone. International Conference on Information and Communication Technology for the Muslim World (ICT4M 2006), 21-23 November 2006, Kuala Lumpur, Malaysia Kistler T., Marais H, (1998) WebL - A Programming Language for the Web,” in Proceedings of the 7th International World Wide Web Conference. Brisbane, Australia. Kushmerick N. (2000). “Wrapper induction: Efficiency and expressiveness” Artificial Intelligence. Laender, A., Ribeiro-Neto, B., Silva, A. and Teixeira, J. (2002) A Brief Survey of Web Data Ex-traction Tools, in: SIGMOD Record, Volume 31, Number 2, June 2002 Maes P. (1994). Agents that reduce work and information overload, Communications of the ACM, Volume 37, Number 7 (July 1994) Muslea I., Minton S. and Knoblock, C. A. (2001). Hierarchical wrapper induction for semi-structured information sources. Journal of Autonomous Agents and Multi-Agent Systems 4:93–114. Sahuguet, A., Azavant F, (2000) WysiWyg Web Wrapper Factory (W4F), in Proceedings of the 8th International World Wide Web Conference, A. Mendelzon Editor, Elzevier Science, Toronto. Sen S. and Hernandez K (2000). A buyer 's agent, In Proc ' Fourth International Conference on Autonomous Agents, 2000. W3C DOM Technical Committee. 2003 Document object model technical reports.

You May Also Find These Documents Helpful

  • Satisfactory Essays

    This file comprises BSHS 352 Week 1 Paper on Analyzing a Web Page Individual Paper…

    • 442 Words
    • 3 Pages
    Satisfactory Essays
  • Powerful Essays

    DB could benefit from using its Web site to learn more about its customer’s tastes and interests. The Web site could collect information such as visitors’ age, location, marital status, occupation, income level, hobbies, and specific motorcycle i t t t ti i l l h bbi d ifi t l interests (i l di why t (including h they might purchase a motorcycle and how they would use it—races, transportation, joining a motorcycle group). Such information will help DB to know the types of p p interested p yp people in its products and how it can communicate to them, create more targeted and personalized advertising and even affect DB’s plans for the future, including designing new products to better fit their interests. If DB…

    • 5618 Words
    • 23 Pages
    Powerful Essays
  • Good Essays

    XML can express, or model, many types of data structures, including structures that are similar to relational data, hierarchical data and loosely structured data. The use of XML as a support for the databases on the mentioned company's web site to track shipment and orders is based on many factors, and the logic behind this markup language is substantial in the success of this implementation. XML is described mostly in terms of a set of rules that define how sequences of characters are to be used so that an XML processor can process an XML document without throwing errors and that also define the physical structure, expressed as entities. But XML documents also have a logical structure that is expressed by the nesting of elements and the presence of attributes on selected elements. The highly flexible document structure means that the programmer can model many types of data, resulting for modeling both highly flexible structured data that can be stored in a relational database.…

    • 703 Words
    • 3 Pages
    Good Essays
  • Best Essays

    In order for a company to be effective in personalized marketing, the company must be able to gather information on the target individual. Today, with the power of technology this is a widespread practice on the Internet. The Internet provides a medium to make one-on-one personalization practical for a variety of firms (Schibsted, 2001). For example, a web page may establish cookies and track the buying habits of the customer. Based on the customer buying habits, advertisements are geared towards that individual.…

    • 4478 Words
    • 18 Pages
    Best Essays
  • Powerful Essays

    Wsdwd

    • 4341 Words
    • 19 Pages

    General CertiÞcate of Secondary Education June 2009 INFORMATION AND COMMUNICATION TECHNOLOGY 3522/H (SPECIFICATION B) (FULL COURSE) Written Paper Higher Tier Tuesday 19 May 2009 1.30 pm to 3.30 pm…

    • 4341 Words
    • 19 Pages
    Powerful Essays
  • Powerful Essays

    The development and growth of computer technologies has transformed the way in which companies have traditionally approached advertising. The huge presence of the Internet has dramatically transformed the face of advertising and its effectiveness. The seemingly endless amount of information that is available to users and the amount of time that is now spent on the Internet has made it a prime way to advertise and reach consumers. Because of the flexibility and control over advertising materials that is possible through the use of the Internet, it has become a widely used marketing communications tool.…

    • 2249 Words
    • 9 Pages
    Powerful Essays
  • Powerful Essays

    Good Internet Censorship

    • 2008 Words
    • 9 Pages

    Turban, B. e. a., 2012. ISYS100 Information Technology and Society CB 3e. 3rd ed. s.l.:Pearson Education Custom.…

    • 2008 Words
    • 9 Pages
    Powerful Essays
  • Powerful Essays

    XML can express, or model, many types of data structures, including structures similar to relational data, hierarchical data and loosely structured data. We recommend using XML as a support for the databases on the Baderman’s website to track website visitors, reservation, hotel promotions. The logic behind this markup language is substantial in the…

    • 2150 Words
    • 9 Pages
    Powerful Essays
  • Powerful Essays

    Website Project Plan

    • 3909 Words
    • 16 Pages

    McBride Financial is a startup lending company that plans on serving the regional area of Idaho, Montana, Wyoming, North Dakota, and South Dakota. The organization offers credit reports, mortgages, inspections, and appraisal services for a fixed rate of $1,500. McBride’s business plan involves utilizing limited personnel with the focus on technology to expand the organization’s reach through online loan applications and financial education, whether from the client’s home or in-office kiosk (Apollo Group, Inc., 2003).…

    • 3909 Words
    • 16 Pages
    Powerful Essays
  • Best Essays

    Bibliography: World Economic Forum, 2013. 'The Global Information Technology Report 2013 ' [PDF] Available at: http://www3.weforum.org/docs/WEF_GITR_Report_2013.pdf [Last Accessed: 20 May 2013]…

    • 1987 Words
    • 8 Pages
    Best Essays
  • Satisfactory Essays

    miss

    • 494 Words
    • 2 Pages

    Applications of Digital Information and Web Technologies, 2008. ICADIWT 2008. First International Conference on the…

    • 494 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    MALAYSIAN CODE OF ADVERTISING PRACTICE Advertising Standards Authority Malaysia Unit 706, Block B, Pusat Dagangan Phileo Damansara 1, 9 Jalan 16/11, Off Jalan Damansara, 46350 Petaling Jaya, Selangor, Malaysia. Tel: 03-7660 8535 Fax: 03-7660 8532…

    • 19439 Words
    • 78 Pages
    Powerful Essays
  • Better Essays

    Journal article critique

    • 1060 Words
    • 5 Pages

    The authors used one well-developed, coherent, unified and concise paragraph understandable to a wide audience. The objectives and focus of the article were clearly stated and agreed with the title. The authors introduced the methods of accomplishing the task in general, without any specifications. The abstract was written in accordance with “Descriptive abstract qualities” (Driscoll, 2013), but the information provided in it didn’t follow the organization of the report itself. Also, the authors didn’t use the keywords to ease the web search of the article on electronic information systems.…

    • 1060 Words
    • 5 Pages
    Better Essays
  • Powerful Essays

    Web Portal

    • 3194 Words
    • 13 Pages

    This paper presents our work on the detection of temporal information in web pages. The pages examined within the scope of this study were taken from the tourism sector and the temporal information in question is thus particular to this area. The differences that exist between extraction from plain textual data and extraction from the web are brought to light. These differences mainly concern the spatial arrangement of the text, the use of punctuation and the respect of traditional syntactic rules. The temporal expressions to be extracted are classified into two kinds: temporal information that concerns one particular event and repetitive temporal information. We adopt a symbolic approach relying on patterns and rules for the detection, extraction and annotation of temporal expressions; our method is based on the use of transducers. First evaluations have shown promising results. Since the visual structure of a web page is very important and often informs the user before he has even read the text, a semiotic study is also presented in this paper.…

    • 3194 Words
    • 13 Pages
    Powerful Essays
  • Satisfactory Essays

    Commerence

    • 596 Words
    • 3 Pages

    As companies began to conduct electronic commerce on the Web, the need to present large amounts of data on Web pages also became important. Companies created Web, the need to present large amounts of data on Web pages also became important. Companies created Web sites that contained lists of inventory items, sales invoices, purchase orders, and other business data. The need to keep these list updated was also important and posed a new challenge for many Web designers.…

    • 596 Words
    • 3 Pages
    Satisfactory Essays

Related Topics