Preview

Review

Good Essays
Open Document
Open Document
600 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Review
CHAPTER II
REVIEW OF RELATED LITERATURE

A survey of related studies was conducted by the researchers in order to provide more insight into the research in the field of an experimentation and to get support of the Boyer-Moore string searching algorithm as a relevant string matching algorithm that can be integrated with Natural Language Processing method and why it creates a better string searching process. The available literature related to the research work has been reviewed and presented under two distinct heads viz. (i) String Searching Algorithm (ii) Natural Language Processing

2.1. String Searching Algorithm
There are many existing string matching algorithms, and each is efficient and effective in one way or another. It is worth noting that string is used interchangeably with text. It is a sequence of characters that may be a set of alphabet. The researchers have selected the Boyer-Moore string matching algorithm because it is used in most software applications. Usually, it is used for searching and replacingtext in a text editor or word processing application.It is known for its efficiency. [WWW_003]. One of the many text editors that use this algorithm is the Subpad. It is a notepad with cloned features of Windows notepad. It is made faster by this algorithm that is written in assembler.Boyer-Moore string matching algorithm is developed by Robert Boyer and J Strother Moorein their paper in 1997 [REF_001]. It is practically used in the search command of all text editors [REF_002].
String matching algorithms work by matching two strings, the main string and the pattern. The main string is larger than or equal to the pattern that is the text being searched. Boyer-Moore String matching algorithm works by comparing from right to left. It is fast because it skips some of the characters. It is efficient because with each failed attempt to match between the search string and the pattern, it uses the gathered information from that attempt to

You May Also Find These Documents Helpful

  • Satisfactory Essays

    You use the grep command to search for a specified pattern in a file, such as a particular word or phrase.…

    • 604 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    Spelling correction: In natural language, misspelled errors are encountered. Companies like Google and Microsoft have achieved a decent accuracy level in automated spell correction. One can use algorithms like the Levenshtein Distances, Dictionary Lookup etc. or other modules and packages to fix these…

    • 522 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Compter Science

    • 819 Words
    • 4 Pages

    The process of searching for a special pattern of symbols within a larger collection of information…

    • 819 Words
    • 4 Pages
    Good Essays
  • Good Essays

    review

    • 711 Words
    • 3 Pages

    Sunscreen contains organic and and inorganic chemicals. The sun shines down UV Rays which are harmful to our skin. The organic chemicals in sunscreen absorb the UV Rays through their chemical bonds. As the bonds absorb UV radiation, the parts of the sunscreen slowly break down and release heat. The higher SPF a sunscreen is, the more it protects against the UV Rays the sun shines down.…

    • 711 Words
    • 3 Pages
    Good Essays
  • Good Essays

    review 3

    • 757 Words
    • 4 Pages

    family that is formed on the remarriage of a divorced or widowed person and that includes one or more children , basic family group consisting of married female and male parents and dependent children, living away from other…

    • 757 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    The method of loci, peg-word system, substitute word system, and method of word association are all examples of _____.…

    • 1087 Words
    • 5 Pages
    Satisfactory Essays
  • Satisfactory Essays

    MPI Assignment

    • 274 Words
    • 2 Pages

    Algorithms are mathematical formulas that combine weighted data elements to determine the probability of a duplicate in order to identify potential duplicate MPI entries. The three algorithms are deterministic, rules based, and probabilistic.…

    • 274 Words
    • 2 Pages
    Satisfactory Essays
  • Better Essays

    Critique

    • 1329 Words
    • 6 Pages

    Why is it that staying home to read a book is not at the top of a child’s priority list? Obviously it is because children would much rather be outside playing a game of baseball with friends than staying in to do school work. Children usually do not find school subjects interesting. In school, students learn the necessities that will generally help them get through life. Children typically are more interested in things they learn outside the classroom such as sports, video games or the Internet. What if these interests were brought into the classroom and taught? Gerald Graff, the author of “Hidden Intellectualism” argues that there is more than one way to measure intelligence. This essay significantly considers how Graff’s beliefs on “Hidden Intellectualism” can be related to today’s education system. While I somewhat agree with Graff I find it difficult to completely believe that children should be taught only of topics that interest them for the reason that a student should be well rounded and have a general understanding of the other basic academic skills.…

    • 1329 Words
    • 6 Pages
    Better Essays
  • Good Essays

    Cognitive Psy

    • 311 Words
    • 2 Pages

    A common observation within visual-search tasks, such as the one in this demonstration, is that the conjunctive absent condition takes about twice as long as the conjunctive present condition. Why would this be the case?…

    • 311 Words
    • 2 Pages
    Good Essays
  • Best Essays

    Text Mining Techniques

    • 3661 Words
    • 15 Pages

    “70% of an organization’s data is contained in free-text form” (Koslowsky, 2010); this means that there is a lot of benefits in digging data from written customer communications for a firm that’s deals directly with customers. Text mining allows businesses to notice the attitude of the customer in his/her written communications with the firm; whether the latter is angry, happy, on the verge of leaving the firm for competitors, etc. Text mining may be applied as a data extraction tool that would target those “pertinent words and phrases that identify the nature of the text” (Koslowsky, 2010). By analysing the actual words in a customer’s communication, a business can better cross-sell offerings; such usage of text mining allows businesses to be more sensitive to those customers that are prone to leave the firm for competitors and apply adequate solutions. As such, text mining is…

    • 3661 Words
    • 15 Pages
    Best Essays
  • Powerful Essays

    More people today are using the internet as their preferred source of information. Searching in databases and on the internet can be enhanced by the use of Boolean operators. Even if most people are capable of using the internet without knowledge of Boolean operators, this essay brings out their usefulness. This essay therefore attempts to support the assertion that Boolean operators are key in searching for information in databases as well as the internet. Working definitions given in this paper are Boolean operators, information search, database and the internet. Following is the history of Boolean operators; how they are used and their importance in searching for information with a conclusion based on their relevance to today’s dynamic search systems.…

    • 3084 Words
    • 13 Pages
    Powerful Essays
  • Better Essays

    In all languages, using compact or short form of words in text messages, emails, and blogs is rapidly increasing. It is particularly popularly amongst young urbanities as it allows for voiceless communication, useful in noisy environment that would defeat a voice conversation and also buffered communication since the message the sender wants to convey can be accessed by the receiver at any time. Compacting text is thus necessary because of limited message length in blog sites and tiny user interface of mobile phone. Getting the shortest word has no rule and it is mainly aimed at understanding. That is, those words should be understood by everyone. We can obtain the compact words by…

    • 1258 Words
    • 6 Pages
    Better Essays
  • Best Essays

    Image Retrieval Using Ann

    • 3358 Words
    • 14 Pages

    Previously the information was primarily text based. But with the rapid growth in the field of computer network and low cost permanent storage media, the shapes of information become more interactive. The people are accessing more multimedia files than the past. In past, images, videos and audio files were only used for the entertainment purpose but nowadays these are the major source of information. Because of intense dependency on multimedia files for information searching, to obtain a desired result is a major problem as the search engine searches within the text associated with the multimedia files, instead…

    • 3358 Words
    • 14 Pages
    Best Essays
  • Powerful Essays

    Bullet Screen Case Study

    • 1149 Words
    • 5 Pages

    While the use of natural language processing on “Bullet Screen” is beneficial to store and organize natural language, as authors provide users with search function to help users find the wonderful part of the video. According to Alexa, in January 2017, in all integrated authors site all over the world, Bilibili (http://search.bilibili.com/) ranked the 224th, while the number of its visitors ranked the 286th. It is the most active Chinese large-scale “Bullet Screen” video site. In this paper, research object is presented and related works about videos with “Bullet Screen” are detailed in 3. Research problems of the “Bullet Screen” retrieval system are discussed. Basic algorithms and the proposed ISB methods are demonstrated. Finally, the conclusions are provided. In fact, different types of “Bullet Screen” are applied in situations and have various characters. Subtitle is the result of the subtitle group (a small number of users) editing the video data. Most users use subtitle information rather than participate in the creation itself, which is similar to authorsb1.0. “Bullet Screen” is similar to Authorsb2.0, in which the users can be more interactive. In this process, the users can participate in the creation of the “Bullet Screen”. To some degree, texts “Bullet Screen” in live site have an impact on the broadcast itself. The anchor can communicate with the users through “Bullet Screen”. In this way, different “Bullet Screen” applications actually conform to the development of the Internet. Their development does not have a reciprocal relationship to each other. Whether it is subtitles, barrage or live barrage, it is not gone but applied in different situations. During the video viewing, the density of the “Bullet Screen” commentary is significantly correlated with the importance of the video…

    • 1149 Words
    • 5 Pages
    Powerful Essays
  • Best Essays

    Content-Based Image Retrieval (CBIR) is an active research discipline focused on computational strategies to search for relevant images based on visual content analysis. In this proposal, multimodal analysis is considered to develop CBIR systems, specially for image collections in which there is some text associated to images. Multimodality in Information Retrieval is sometimes referred to the interaction mechanisms and devices used to query the system. However, since the Multimedia Information Retrieval perspective, multimodality is referred to…

    • 2993 Words
    • 12 Pages
    Best Essays