Preview

Data Mining Project on IMDB Website

Powerful Essays
Open Document
Open Document
1238 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Data Mining Project on IMDB Website
Data Mining Project on IMDB website
ABSTRACT
The Internet Movie Database (IMDb) is an online database of information related to movies, television shows, stars, etc. We chose to do our project from 2008 to 2011 year’s movie database. We extracted data like Movie, Director, Star, Image Url, Studio from the IMDb website. For this extraction of data we used a tool named Mozenda. After the data extraction, the data was analyzed. For a particular star, his/her movie, director, studio with whom the star has worked was shown. A Graphical User Interface (GUI) for the same was developed. According to this GUI, when the user selects a Star his/her respective movies, directors, studios are displayed. A graph for the extracted data is also shown. For this a tool named NodeXL is used. This graph is having star and movie as the nodes and an edge is the relation between the star and the movie which shows that the star has worked in the movie and vice versa.

DATA EXTRACTION TOOL: MOZENDA

This tool was used to extract the web data. In the Mozenda agent builder, the url www.imdb.com was entered. The website page gets loaded in the agent builder. One can navigate through the pages from where to extract the data. We chose to extract data from January 2008 to April 2011. So the url for January 2008’s webpage (http://www.imdb.com/nowplaying/2008/1/) was entered. After the January 2008’s webpage is loaded, start new Agent from this page on the agent builder is clicked. As we have to extract the same set of data like movie name, director, image, studio for each movie, Create list of items on the agent builder is clicked. The movie names of the first two movies on the webpage are selected. Then a dialog box appears. A respective filed name like Movie is given. Same procedure is repeated for Director, Studio, Image Url. As we want to extract same type of data from multiple pages, Add list pager on the agent builder is clicked and then next month is clicked. Now the software

You May Also Find These Documents Helpful

  • Good Essays

    Ccld L3 Unit 5

    • 624 Words
    • 3 Pages

    In this homework you will research into different ways Information Technology and Computing are used to make Movies. The way the characters in Toy story come to life! The way Spiderman flies through the air! How do they do that?…

    • 624 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Turban, E., Rainer. K., & Potter. R. (2003). Introduction to Information Technology. John Wiley and Sons, Inc.…

    • 1409 Words
    • 6 Pages
    Powerful Essays
  • Powerful Essays

    Turban, E., Rainer, K., & Potter, R. (2003). Introduction to Information Technology (8th ed.). New York: John Wiley & Sons, Inc. .…

    • 979 Words
    • 4 Pages
    Powerful Essays
  • Satisfactory Essays

    According to Merriam-Webster, The definition of a double standard is a set of principles that applies differently and usually more rigorously to one group of people or circumstances than to another; especially: a code of morals that applies more severe standards of sexual behavior to women than to men. It’s ironic to me that the definition contains the example of sexual behavior. In Kate Chopin’s story The Storm I see her writing supporting women’s rights and also an example of double standards. Double standards are a huge debate in todays’ society, especially when it comes to sexual behavior. Men and women are biased differently when it comes to number of sexual partners and…

    • 554 Words
    • 3 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Syllabus

    • 627 Words
    • 3 Pages

    Chapter 2 The Internet and World Wide Web and Making use of the Web Pages 43 - 94 DB 2 (Word) Chapter 3 Application Software & Digital Video Technology Pages 95 - 134…

    • 627 Words
    • 3 Pages
    Satisfactory Essays
  • Better Essays

    contains graphics and a written text and analyze it according to the criteria that follow.…

    • 983 Words
    • 4 Pages
    Better Essays
  • Powerful Essays

    Analyzing Films

    • 1662 Words
    • 7 Pages

    The development of film can be a process that is extensive and complex. Film analysis helps the viewer to understand what the director is trying to convey to the audience. To analyze a film successfully, it is important to understand how collaborative filmmaking really works. There are a number of elements that must work together not only to have a successful production but also to guide the audience through the story. Some such elements are the film’s narrative structure, colorization, director’s style, camera shot, and actor selection. While the actor is the most visible of the elements on screen; there are many craftsmen that perform behind the scene functions in order to get the finished product in front of a viewing audience. To really have a handle on how movies work, it is helpful to watch a number of films in different genres to understand the conventions of each. Knowing and understanding all of the technical elements of film can help the viewer to analyze the film more carefully. Furthermore, they may gain an emotional attachment to the film, and find some level of truth as they become more aware of what has taken place in order to bring it to life.…

    • 1662 Words
    • 7 Pages
    Powerful Essays
  • Powerful Essays

    Netflix Information System

    • 1867 Words
    • 8 Pages

    One of the most important technologies that support Netflix’s customer relationship management is its custom-built intelligent agent. An intelligent agent is artificial intelligence software that helps or acts on behalf of the user to perform repetitive-computer related tasks (Haag 224). In particular, Netflix uses a buyer agent, also known as a shopping bot. A buyer agent is an intelligent agent on a website that assists the consumer in finding a product or service that he or she wants (Haag 225). Netflix’ shopping bots use two techniques in order to predict customers’ DVD preferences: collaborative filtering and adaptive filtering. Collaborative filtering is when a customer is matched with a group of users who have similar tastes. Then, the customer is presented with common selections in that group (Haag 225). Adaptive filtering is when the consumer is asked to rate a product or situation and then monitored over time (Haag 226). Ultimately, Netflix will know what the customer likes and dislikes. By using a hybrid technique, Netflix is able to give…

    • 1867 Words
    • 8 Pages
    Powerful Essays
  • Satisfactory Essays

    IMDb is known for listing every movie a star that has stared in a movie and give a…

    • 258 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Databases. This article would be great to use in my paper because the experts has made…

    • 109 Words
    • 1 Page
    Satisfactory Essays
  • Good Essays

    Samuel Colt

    • 443 Words
    • 2 Pages

    Samuel Colt was born on July in 1816, in Hartford, Connecticut. He was born an important man. He invented the revolver. It is one of history’s most important weapons.…

    • 443 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Citizens’ personal information has always been actively sought by government authorities and by private businesses, and up until recently, has been kept exclusively by the institutions requesting the information. However, those days of confidentiality are over, as the world becomes increasingly structured upon the evolution of the Internet. Today, government authorities and private businesses have a multitude of ways to access personal information that is submitted through the World Wide Web, one of these methods being the surveillance and tracking of search requests through online search engines such as Google (Search Engine Privacy). The collection of personally identifiable data by search engines threatens…

    • 989 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Nowadays, movies, which are the most important entertainment of people, has spent much more money and time than before by a growing number of people. Different kinds of new movies play nearly everyday; and the way to watch a movie isn’t confined to the cinema. Along with the improvement of digital postproduction and digital effect is applied to the movies, they make people to be personally on the scene when you watch a movie. In the past twenty years, the changes of the ways to watch a movie and the movie technology have already influenced entertainment for people deeply.…

    • 665 Words
    • 3 Pages
    Good Essays
  • Good Essays

    A movie is something that everyone can sit down and enjoy. All types of ages are able to watch a movie. The movie that I evaluated was Project Almanac. This movie was about a boy named David, who is crazy smart who dreams of going to MIT. In the movie the main character David, stumbles upon secret plans of his late father's. In these plans it includes a device to build which turns out to be a time machine. David and his friends then get to work to build this so called time machine. After many trials and errors they get the time machine to actually work. There were many consequences to building the time machine. You would just have to watch the movie to find out what they were. The whole movie was based on camerawork from the main character's…

    • 951 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    Entities and Attributes

    • 469 Words
    • 2 Pages

    Baranes, A. (2010, Jan 2). Lecture 6: Entities and Attributes in an Online Database, Internet Movie Database. Retrieved from https://sites.google.com/site/principlesofinformationsystems/lecture-6-entities-and-attributes-in-an-online-database…

    • 469 Words
    • 2 Pages
    Satisfactory Essays