Preview

Web Portal

Powerful Essays
Open Document
Open Document
3194 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Web Portal
Automatic Identification of Temporal Information in Tourism Web Pages
Stéphanie Weiser*, Philippe Laublet**, Jean-Luc Minel*
* MoDyCo, UMR 7114, CNRS 200 avenue de la République, 92001 Nanterre ** LaLIC, Université Paris-Sorbonne Maison de la recherche, 28 rue Serpente 75006 Paris E-mail: steph.weiser@gmail.com, Philippe.Laublet@paris-sorbonne.fr, jminel@u-paris10.fr

Abstract
This paper presents our work on the detection of temporal information in web pages. The pages examined within the scope of this study were taken from the tourism sector and the temporal information in question is thus particular to this area. The differences that exist between extraction from plain textual data and extraction from the web are brought to light. These differences mainly concern the spatial arrangement of the text, the use of punctuation and the respect of traditional syntactic rules. The temporal expressions to be extracted are classified into two kinds: temporal information that concerns one particular event and repetitive temporal information. We adopt a symbolic approach relying on patterns and rules for the detection, extraction and annotation of temporal expressions; our method is based on the use of transducers. First evaluations have shown promising results. Since the visual structure of a web page is very important and often informs the user before he has even read the text, a semiotic study is also presented in this paper.

1. Introduction
With the methods of the Semantic Web, portal applications can be created, relying on ontologies. For these applications and many service applications, temporal information is often essential. For example, a tourism web portal would need information about the type of tourism object and its location in time and space. In addition, the extracted information must be stored in the knowledge base according to the ontology used by the application. In this paper we will focus on temporal information in tourism web pages. The temporal



References: Battistelli, D., Minel, J.-L., Schwer, S. (2006). Représentation des expressions calendaires dans les textes : une application à la lecture assistée de biographies, Traitement Automatique des Langues, 47, 3, pp.1--26. Bry, F. Lorenz, B. Ohlbach, H. J. Spranger, S. (2003). On Reasoning on Time and Location on the Web, Lecture Notes in Computer Science, Springer-Verlag, Germany, pp. 69--83. Noël, L., Carloni, O., Moreau, N., Weiser, S. (2008). Designing a Knowledge-Based Tourism Information System, Int. J. of Digital Culture and Electronic Tourism, Special Issue on National Tourism Organisations and Exploitation of Information Technologies, to be published. Stern, R.-D. (2007). Expression linguistique du temps et représentation ontologique : OWL-Time et étude des adverbiaux temporels, Mémoire de Master IILGI, Université de Paris-Sorbonne. Tenier, S., Toussaint, Y., Napoli, A. et Polanco, X. (2006). Instantiation of relations for semantic annotation, In the 2006 IEEE/WIC/ACM International Conference on Web Intelligence - WI 2006, pp. 463-472 131

You May Also Find These Documents Helpful

  • Best Essays

    INFS1602 Assignment A

    • 3808 Words
    • 16 Pages

    16. X Ning, H. J. (2008). RSS: A Framwork Enabling Ranked Research on the Semantic Web. Information Processing and Management .…

    • 3808 Words
    • 16 Pages
    Best Essays
  • Satisfactory Essays

    This file comprises BSHS 352 Week 1 Paper on Analyzing a Web Page Individual Paper…

    • 442 Words
    • 3 Pages
    Satisfactory Essays
  • Powerful Essays

    EAGLES. Evaluation of natural language processing systems. (1995). Retrieved October 29, 2006 from the Université de Genève web site: http://www.issco.unige.ch/ewg95/…

    • 5023 Words
    • 21 Pages
    Powerful Essays
  • Satisfactory Essays

    Web Master

    • 612 Words
    • 3 Pages

    General Dynamics is moving forward with offsite 2-day training session if the cost and resources are evaluated to see what our financial projection scope to ensure our budget limitations are feasible, and approval. General Dynamics will be consulting with the including the number or type of resources, critical task sequencing, and how duration estimates. For our 2-day training session we will need to acquire finances for labor, material, periderm. The task duration and critical task sequencing will be added to the process of this implementation.…

    • 612 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    Isds Ch 5

    • 3328 Words
    • 14 Pages

    11) By applying a learning algorithm to parsed text, researchers from Stanford University's NLP lab have…

    • 3328 Words
    • 14 Pages
    Good Essays
  • Good Essays

    Part of Speech Recognizer

    • 3200 Words
    • 13 Pages

    References: [1] S. L. Abebe and P. Tonella. Natural language parsing of program element names for concept extraction. In 18th IEEE International Conference on Program Comprehension. IEEE, 2010. [2] K. Atkinson. Spell checking oriented word lists (scowl). [3] E. Boschee, R. Weischedel, and A. Zamanian. Automatic information extraction. In Proceedings of the International Conference on Intelligence Analysis, 2005. [4] B. Caprile and P. Tonella. Restructuring program identifier names. In ICSM, 2000. [5] ML Collard, HH Kagdi, and JI Maletic. An XML-based lightweight C++ fact extractor. Program Comprehension, 2003. 11th IEEE International Workshop on, pages 134–143, 2003. [6] E. Høst and B. Østvold. The programmer’s lexicon, volume i: The verbs. In International Working Conference on Source Code Analysis and Manipulation, Beijing, China, September 2008. [7] E. W. Høst and B. M. Østvold. Debugging method names. In ECOOP 09. Springer Berlin / Heidelberg, 2009. [8] J. Jiang and C. Zhai. Instance weighting for domain adaptation in nlp. In ACL 2007, 2007. [9] D. Lawrie, D. Binkley, and C. Morrell. Normalizing source code vocabulary. In Proceedings of the 17th Working Conference on Reverse Engineering, 2010. [10] L. Shen, G. Satta, and A. K. Joshi. Guided learning for bidirectional sequence classification. In ACL 07. ACL, June 2007. [11] D. Shepherd, Z. P. Fry, E. Hill, L. Pollock, and K. Vijay-Shanker. Using natural language program analysis to locate and understand action-oriented conerns. In AOSD 07. ACM, March 2007. [12] K. Toutanova, D. Klein, C. Manning, and Y. Singer. Feature-rich part-of-speech tagging with a cyclic dependency network. In HLTNAACL 2003, 2003.…

    • 3200 Words
    • 13 Pages
    Good Essays
  • Powerful Essays

    References: 1. Fromkin, V., Rodman, R., Hyams, N. An Introduction to Language. Thomson-Heinle Corporation Inc. Harcourt Brace and Jovanovich Co. 7th Edition, 2003.…

    • 2339 Words
    • 7 Pages
    Powerful Essays
  • Powerful Essays

    11) By applying a learning algorithm to parsed text, researchers from Stanford University's NLP lab have…

    • 2954 Words
    • 12 Pages
    Powerful Essays
  • Satisfactory Essays

    Case study

    • 617 Words
    • 3 Pages

    Evaluation of the overall quality of each case study will be made on the following criteria:…

    • 617 Words
    • 3 Pages
    Satisfactory Essays
  • Powerful Essays

    Website Project Plan

    • 3909 Words
    • 16 Pages

    McBride Financial is a startup lending company that plans on serving the regional area of Idaho, Montana, Wyoming, North Dakota, and South Dakota. The organization offers credit reports, mortgages, inspections, and appraisal services for a fixed rate of $1,500. McBride’s business plan involves utilizing limited personnel with the focus on technology to expand the organization’s reach through online loan applications and financial education, whether from the client’s home or in-office kiosk (Apollo Group, Inc., 2003).…

    • 3909 Words
    • 16 Pages
    Powerful Essays
  • Powerful Essays

    The advent of the Internet has been one of the most exciting major events in the second…

    • 2567 Words
    • 11 Pages
    Powerful Essays
  • Powerful Essays

    Mintzberg est un auteur phare de la théorie des organisations. Cet ouvrage présente deux intérêts :…

    • 2510 Words
    • 11 Pages
    Powerful Essays
  • Powerful Essays

    Bullet Screen Case Study

    • 1149 Words
    • 5 Pages

    The main purpose of nature language processing is to process, understand, and apply all kinds of human beings’ languages in written or oral forms. In recent years, “Bullet Screen”, which allows vieauthorsrs to post bullet-like, real-time comments on screen during their watching films, is an emerging craze in online video sites, especially in China and Japan, mainly popular among young people for their social interactivities. Meanwhile, since “Bullet Screen” can be regarded as a novel type of natural language, processing, restoring, and organizing them play an important role in providing retrieval platforms for users so as to help them find out highlights from oceans of videos. Taking Bilibili, a Chinese video authorsbsite, as example, this…

    • 1149 Words
    • 5 Pages
    Powerful Essays
  • Best Essays

    Managers Problems

    • 1843 Words
    • 8 Pages

    Session#8 - July 20, 2009 - Synchronous Meeting, 4:30 PM - 6:30 PM Central Time Indexing Another Format Readings -Rhind-Tutt, Stephen. "Different Direction for Electronic Publishers: How Indexing Can Increase Functionality." Technicalities 21(3):1,13-15, May/June, 2001. Available Online Research Resources, Library Literature & Information Science. -Wellisch, Hans. "Alphanumeric Arrangement." In: Indexing From A to Z. 2d ed. New York, H W Wilson, 1995. pp.6-22. [Electronic Reserves] -Leise, Fred. "Using Faceted Classification to Assist Indexing." http://www.contextualanalysis.com/pub_usingfacets.php -Anderson, James D. "Section 5: Design of Indexes." In: Guidelines for Indexes and Related Information Retrieval Devices. Baltimore, MD, NISO Press, 1997. pp.10-13. Download the report in PDF for free at: http://www.niso.org/standards/ ANSI/NISO Z39.14 [PDF linked there]…

    • 1843 Words
    • 8 Pages
    Best Essays
  • Satisfactory Essays

    The importance of the Internet grows rapidly in all fields of human life, including not only research and education but also marketing and trade as well as entertainment and hobbies. This implies that it becomes more and more important to know how to use Internet services and, as a part of this, to read and write English.…

    • 478 Words
    • 2 Pages
    Satisfactory Essays

Related Topics