Academic Journals Database
Disseminating quality controlled scientific knowledge

Improving a system to centralize the results returned by web crawlers for scientific documents

ADD TO MY LIST
 
Author(s): Liviu Mihai Simion | Ana Maria Lepar

Journal: Computer Science Master Research
ISSN 2247-5575

Volume: 1;
Issue: 2;
Start page: 34;
Date: 2011;
Original page

Keywords: Google Scholar | DBLP | research article | citations | papers | Data mining

ABSTRACT
There are a couple of web applications that index the scientific activity. We can mention GoogleScholar, Portal ACM, Xplore etc. Each site uses a certain database, but for a given scientist we may findsome articles indexed on Google Scholar for example and other articles on Xplore. In our research wefocus to integrate all the results returned by a couple of web crawlers for a certain researcher. We start byinterrogating Google Scholar to find the articles created by a certain person. In order to do this we havecreated an API (Google Scholar has no API) that sends requests to the site and interprets the results,keeping the name of the article, the number of the citations, the co-authors. We query other sources and tryto make a comparison between the results we have received. In our research, we have used the DBLPComputer Science Bibliography of University Trier, http://www.informatik.uni-trier.de/~ley/db/, as asecond source. Other sources can be integrated as well in the application and there can be generatedstatistics.
Why do you need a reservation system?      Save time & money - Smart Internet Solutions