Preview

Data Mining The Mushroom Database

Satisfactory Essays
Open Document
Open Document
494 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Data Mining The Mushroom Database
Title: “Data Mining: The Mushroom Database”
Author: Hemendra Pal Singh*

In this review “Data Mining: The Mushroom Database” is focuses in the study of database or datasets of a mushroom. The purpose of the research is to broaden the preceding researches by administer new data sets of stylometry, keystroke capture, and mouse movement data through Weka. Weka stands for Waikato environment for knowledge analysis, and it is a popular suite of machine learning software written in Java, developed at the University of Waikato. WEKA is free downloadable software and it is available under the GNU General Public License. To recognize the datasets and database of a mushroom the researchers uses Data Mining through WEKA using various data mining algorithms. The study will also broaden earlier research at Pace University into the uses of a human- machine interface to increase the correctness of machine learning.
In order to explain the use of various algorithms in this study, the algorithms will be discussed in this research. Naïve Bayes and Apriori will be used against the Stylometry data set. IBk will be used against the Keystroke Capture and Mouse Movement data sets. J48 will be used with the Mushroom Database. The choices of these techniques and their implementation will be discussed in detail in the methodologies section. According to Witten and Frank in Data Mining, the Naïve Bayes method is, “based on Bayes’srule and ‘Naïvely’ assumed independence — it is only valid to multiply probabilities when the According to Witten and Frank in Data Mining, the Naïve Bayes method is, “based on Bayes’s rule and ‘Naïvely’ assumed independence — it is only valid to multiply probabilities when the events are independent. The assumption that attributes are independent in real life certainly is simplistic one events are independent. The assumption that attributes are independent in real life certainly is a simplistic one.
The methodologies that they use are several different

You May Also Find These Documents Helpful

  • Good Essays

    Replication and Transmission of DNA and RNA Western Governors University DNA Replication DNA and the function of Ligase mRNA in Transcription and Translation Death by Inhibition: RNA polymerase and the Death Cap Mushroom Ingestion of the Death Cap Mushroom ● ● ● ● ● ● No Presenting symptoms for 48 hours The deadly toxin is alpha-amanitin Amanitin has a great attraction to RNA polymerase It’s toxin blocks RNA polymerase…

    • 407 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Fungi Web Quest

    • 1151 Words
    • 5 Pages

    Description: There are about 600 species in the genus Amanita worldwide. Each amanita starts as an egg-shaped button that can resemble a small puffball. These breaks open as the mushroom grows. Fully developed amanitas are gilled mushrooms with parasol-shaped caps that may be white, yellow, red or brown. They also have the following characteristics: 1. A saclike cub surrounding the base of the stem. This often is buried just beneath the soil surface and may not be obvious. 2. A ring on the stem. 3. White gills. 4. A white spore print. Both the ring and the bulb may be destroyed by rain or other disturbance. For this reason, beginning mushroom hunters should avoid all parasol-shaped mushrooms with white gills.…

    • 1151 Words
    • 5 Pages
    Powerful Essays
  • Powerful Essays

    Cis 500 Data Mining Report

    • 2046 Words
    • 9 Pages

    This report is an analysis of the benefits of data mining to business practices. It also assesses the reliability of data mining algorithms and with examples. “Data Mining is a process that uses statistical, mathematical, artificial intelligence, and machine learning techniques…

    • 2046 Words
    • 9 Pages
    Powerful Essays
  • Satisfactory Essays

    Mushrooms provide numerous health and nutritional benefits. It is mushroom with large amounts of nutrients including proteins, vitamins, minerals, and containing significant amounts of zinc, iron, potassium, and folic acid. Eryngii mushroom has naturally occurring antioxidants, including the amino acid Ergothioneine, which protects the body’s cells against free radicals.…

    • 156 Words
    • 1 Page
    Satisfactory Essays
  • Powerful Essays

    Airbus A3Xx

    • 8265 Words
    • 34 Pages

    2. Analysis Of Changes In Operating Margin Against Changes In Steady State Number of Planes…

    • 8265 Words
    • 34 Pages
    Powerful Essays
  • Good Essays

    CaseEF GroupC2 Team10

    • 1421 Words
    • 4 Pages

    The Tucson data-mining project try to use word length, punctuation, syntax, and content, etc, to identify the personality types of anonymously authors, which allows the system to specifically target those with potential threats such as militant leaders and their active followers. In other words, if someone avoid using sensitive words on the Internet, the system won’t interfere their daily life. Also, after the 911 terrorist attacks, anti-terrorism has become such an important topic of keeping world peace. By using the Tucson data-mining project, which can create a profile almost as unique as a fingerprint, to track potential threats, the government can more effectively and objectively distinguish the terrorists from the innocents, and thus, prevent the subversive behavior in advance. As it is for the wellbeing of the world, individual has a responsibility to play. In this sense, the data-mining project is an acceptable tradeoff.…

    • 1421 Words
    • 4 Pages
    Good Essays
  • Good Essays

    The data mining model chosen for this project is the Naïve Bayes classification model. This…

    • 642 Words
    • 3 Pages
    Good Essays
  • Better Essays

    Chorus of Mushrooms Essay

    • 1236 Words
    • 5 Pages

    Hiromi Goto’s Chorus of Mushrooms is an immigration narrative documenting the experiences of three generations of Japanese Canadian women both in Canada and abroad. Goto’s story offers a glimpse into the lives of the Canadian immigrants namely, Naoe, her daughter Keiko and her granddaughter Murasaki along with their successes and failures at cultural integration. Although some believe rejecting their cultural past would provide for a better existence, others feel absolutely incapable of separating themselves from it from the very start. Language, diet and lifestyle serve as forms of cultural expression. In her novel, Goto argues that neither through self-assimilation nor by repression of their roots will Canadian immigrants successfully integrate, but ultimately an embrace of both their past and new Canadian culture will lead them to an empowered coexistence.…

    • 1236 Words
    • 5 Pages
    Better Essays
  • Best Essays

    It Essay - Data Mining

    • 1998 Words
    • 8 Pages

    He, J. (2009). Advances in Data Mining: History and Future. Third International Symposium on Intelligent . Retrieved November 1, 2012, from http://ieeexplore.ieee.org.ezproxy.lib.ryerson.ca/stamp/stamp.jsp?tp=&arnumber=5370232&tag=1…

    • 1998 Words
    • 8 Pages
    Best Essays
  • Good Essays

    Foragers Standards

    • 686 Words
    • 3 Pages

    These standards turn out to be vague and useless in many cases. As this is a serious matter of life and death; therefore, no risk can be taken in this regard. Therefore, only those standards must be observed and fulfilled, which are accepted by majority of the people. Moreover, these standards have proved themselves that they can help foragers while discriminating the mushrooms. Therefore, this chapter is focused mainly upon those methods and approaches which can be used to accurately differentiate between edible and wild mushrooms:…

    • 686 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Her solution is based on the conditional probability, namely Bayes Theorem using a decision tree as shown in figure 2 bellow:…

    • 1070 Words
    • 5 Pages
    Good Essays
  • Good Essays

    In this section, we propose a heuristic, called Lazy Super Parent Tree Augmented Naive Bayes (LSPTAN) that seeks to solve the problems discussed above, enabling the application of a semi-Naive Bayes techniques in large ADC tasks. Thus, we can evaluate whether the premise of independence among attributes, assumed by Naive Bayes, impacts effectiveness in large ADC tasks, an open research problem.\looseness=-1…

    • 1277 Words
    • 6 Pages
    Good Essays
  • Better Essays

    Objective of this work is to design and develop a Thai herbs search system. The developed system is consisting of two parts which are data and user management, and ontology-based user interface modules. This project focuses on the first module which consists of data extraction and filtering, and user management modules. Data extraction and filtering are implemented using web-crawler and natural language processing techniques.…

    • 1945 Words
    • 8 Pages
    Better Essays
  • Good Essays

    Naive Bayes

    • 7200 Words
    • 29 Pages

    Hand D.J. and Yu K. (2001) Idiot’s Bayes—not so stupid after all? International Statistical Review, 69, 385–398. Hastie T.J. and Tibshirani R.J. (1990) Generalized Additive Models. London: Chapman and Hall. Jamain A. and Hand D.J. (2005) The na¨ve Bayes mystery: A statistical detective ı story. Pattern Recognition Letters, 26, 1752–1760. Jamain A. and Hand D.J. (2008) Mining supervised classification performance studies: A meta-analytic investigation. Journal of Classification, 25, 87–112. Langley P. (1993) Induction of recursive Bayesian classifiers. Proceedings of the Eighth European Conference on Machine Learning, Vienna, Austria: SpringerVerlag, 153–164. Mani S., Pazzani M.J., and West J. (1997) Knowledge discovery from a breast cancer database. Lecture Notes in Artificial Intelligence, 1211, 130–133. Metsis V., Androutsopoulos I., and Paliouras G. (2006) Spam filtering with na¨ve ı Bayes—which na¨ve Bayes? CEAS 2006—Third Conference on Email and Antiı Spam, Mountain View, California. Sahami M., Dumains S., Heckerman D., and Horvitz E. (1998) A Bayesian approach to filtering junk e-mail. In Learning for Text Categorization—Papers from the AAAI Workshop, Madison, Wisconsin, pp. 55–62. Titterington D.M., Murray G.D., Murray L.S., Spiegelhalter D.J., Skene A.M., Habbema J.D.F., and Gelpke G.J. (1981) Comparison of discrimination techniques applied to a complex data set of head injured patients. Journal of the Royal Statistical Society, Series A, 144, 145–175.…

    • 7200 Words
    • 29 Pages
    Good Essays
  • Satisfactory Essays

    Mushroom Farming

    • 806 Words
    • 4 Pages

    Mountain Mushroom Farming will be a new entrant to vegetable cultivation .The mountain mushroom farming will be a milestone for the non vegetarian as well as vegetarian consumer. The 2 ropani land at Bhaktapur will be the venue where different kinds of mushroom spice will be in cultivated. Mushrooms are cultivated for their nutritive and medicinal values. In traditional Nepalese food mushroom has relatively low priority due to number of misbelieves and low level of understanding. However, interestingly these days there is growing trend on consumption of mushroom because of its healthy benefits of very low cholesterol, low calorie and its fine taste. The annual mushroom production is estimated at more than 233.4 Mt. for 2061/62 in Kathmandu valley. Due to their high market value, the fresh local produce fetches around Rs. 100 – 150 / kg. From the trends of past years, the demand and consumption of mushrooms is expected to continue increasing. At present, Oyster and Button mushroom are easily available in the market but in future shiitake mushroom will be our favorite menu in Nepal mushroom market. The fertile land of Bhaktapur will be the most suitable land for the mushroom. The cold temperature for the germination of mushroom is another advantage of farming this spice in this region.…

    • 806 Words
    • 4 Pages
    Satisfactory Essays