Preview

The Architecture and Datasets of Docear’s Research Paper Recommender System

Powerful Essays
Open Document
Open Document
6703 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
The Architecture and Datasets of Docear’s Research Paper Recommender System
Pre-print of
Joeran Beel, Stefan Langer, Bela Gipp, and Andreas Nürnberger. 2014. The Architecture and Datasets of Docear’s Research Paper Recommender System. In Proceedings of the 3rd International Workshop on Mining Scientific Publications (WOSP 2014) at the ACM/IEEE Joint Conference on Digital Libraries (JCDL 2014). Downloaded from http://www.docear.org.

The Architecture and Datasets of Docear’s
Research Paper Recommender System
Joeran Beel

Stefan Langer

Otto-von-Guericke University
Dept. of Computer Science
Magdeburg
Germany

Docear
Magdeburg
Germany

beel@ovgu.org

langer@docear.org

ABSTRACT
In the past few years, we have developed a research paper recommender system for our reference management software
Docear. In this paper, we introduce the architecture of the recommender system and four datasets. The architecture comprises of multiple components, e.g. for crawling PDFs, generating user models, and calculating content-based recommendations. It supports researchers and developers in building their own research paper recommender systems, and is, to the best of our knowledge, the most comprehensive architecture that has been released in this field.
The four datasets contain metadata of 9.4 million academic articles, including 1.8 million articles publicly available on the Web; the articles’ citation network; anonymized information on 8,059 Docear users; information about the users’ 52,202 mind-maps and personal libraries; and details on the 308,146 recommendations that the recommender system delivered. The datasets are a unique source of information to enable, for instance, research on collaborative filtering, content-based filtering, and the use of reference management and mind-mapping software.

Categories and Subject Descriptors
H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval – information filtering.

General Terms
Algorithms, Design, Experimentation

Keywords
Dataset,



Citations: dcr_doc_id_54421) (Figure 4). This allows to apply weighting schemes, such as TF-IDF to citations, i.e

You May Also Find These Documents Helpful

  • Better Essays

    psy452

    • 980 Words
    • 8 Pages

    Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. SEE MORE…

    • 980 Words
    • 8 Pages
    Better Essays
  • Satisfactory Essays

    Periodical Database: A research aid that catalogues articles from a large number of journals or magazines…

    • 793 Words
    • 6 Pages
    Satisfactory Essays
  • Good Essays

    There are many different databases available in the Kean University Library, and many of these databases provide features to filter or target scholarly (peer reviewed) articles. However, instead of providing you with directions for each database, it is recommended that you examine the full-text article itself and use visual cues to determine if a particular article is scholarly. Provided below are some recommended visual guides:…

    • 570 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Scavenger Hunt

    • 754 Words
    • 4 Pages

    * Three specialized article databases in the University Library are: Opposing Viewpoints, Resource Center, Psych Articles, and Emerald.…

    • 754 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Discussion 3 4

    • 831 Words
    • 3 Pages

    14. Which search engine offers scholarly resources, including articles and theses, that span countless disciplines?…

    • 831 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Engle, M. (2013). Cornell University Library Guides. Distinguishing Scholarly from Non-Scholarly Periodicals: A Checklist of Criteria. Introduction & Definitions. Retrieved from http://guides.library.cornell.edu/scholarlyjournals…

    • 665 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    sandel paper

    • 692 Words
    • 3 Pages

    Research Databases: Connect to library databases by title or use the Search by Subject area to choose from the resources that the library recommends, including databases for journal articles, books, e‐books, and Web sites. There are tools on Blackboard.…

    • 692 Words
    • 3 Pages
    Powerful Essays
  • Satisfactory Essays

    When thinking about effective and efficient ways of finding the best resources for specific research needs, one needs to know exactly what they are researching. If you are doing research on a new medicine coming out, a proprietary or subscription based database would be the place to go. The main benefit or proprietary or subscription-based databases is their credibility. These databases take the extra pre-caution of screening their content before they store it.…

    • 482 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    President and Congress

    • 1436 Words
    • 6 Pages

    Encyclopedia of Library and Information Science: Lib-Pub. 3 (2 ed.). CRC Press. 2003. ISBN 9780824720797. http://books.google.com/books…

    • 1436 Words
    • 6 Pages
    Powerful Essays
  • Good Essays

    Pubmed Databases

    • 808 Words
    • 4 Pages

    As a student, a database search has enable me in writing scholar academic work. Every paper/topics I have to write in school required me to use a credible source to find a peer review articles or academic journals. It shows the contents of the information in the paper are reliable, facts and not misleading.…

    • 808 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Five ways popular articles are different from scholarly articles are that popular articles debates issues of collective awareness and are planned to enlighten in addition to amusing, scholarly articles give the writers? original study, like a revision that they led. The authors distribute the article to share their results with other investigators and scholars. Popular articles are frequently written by reporters or qualified writers. These reporters frequently don?t have a Ph.D. or other progressive degree in the part that they?re writing about (White, 2013). Scholarly articles are written by scholars in that arena, and their authorizations or theoretical degrees are generally provided (PhDs, associated…

    • 818 Words
    • 4 Pages
    Good Essays
  • Good Essays

    I was initially worried about my ability to navigate my way around and find what I needed, but my fears were soon put to rest, as the Galileo-JSTOR database turned out to be just as easy as searching through any other search…

    • 310 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    Do you know how to use a computerized index to do research? Believe or not, it is really quite easy. First all of, you need to type in your topic and press the return key. Then, the computer will search its database of newspapers, magazines, and journals and will give you a list of all the articles related to your topic. After that, you can choose which ones you want to look up, and you can print out the list of citations. But, if you have to use only articles from the past three years, the dates of the articles are given right here, so you can just look up the most recent ones. Next, you need to know another good feature of this program which gives you a brief summary of the main points of the article. At this point, you have to click on the box that says abstract, but also if the article is available in electronic form, there will be a link labeled “Electronic access” where you click on this link. As soon as, it will take to the opening screen for the magazine, newspaper, or journal; and you can choose the issue you want from there. From then on, if the article is available in print form, write down the call number, and you’re ready to find it on the shelf. Finally, you have to know that the magazines and journals are in the periodical section on the second floor. You can’t take them out, but there are several copy machines in that area if you need to make a copy of any of the articles. As you can see, it is not very difficult to…

    • 286 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    I chose that database because I used it when I was obtaining my BSN here at Chamberlain. Google Scholar is not only easy to navigate but it also contains a wealth of information from sources such as articles to university websites. Google Scholar has peer-reviewed content with abstracts and full text articles from all over the world. The articles come in many languages and in a vast amount of disciplines. The article that I searched for was “Thyroid disorders and fertility”(Karaca & Akpak, 2015).…

    • 395 Words
    • 2 Pages
    Good Essays
  • Better Essays

    Bibliometric laws

    • 1852 Words
    • 7 Pages

    Bibliometrics is a type of research method used in library and information science. It utilizes quantitative analysis and statistics to describe patterns of publication within a given field or body of literature. Researchers may use bibliometric methods of evaluation to determine the influence of a single writer, for example, or to describe the relationship between two or more writers or works. One common way of conducting bibliometric research is to use the Social Science Citation Index, the Science Citation Index or the Arts and Humanities Citation Index to trace citations.…

    • 1852 Words
    • 7 Pages
    Better Essays

Related Topics