marijakatic / The-Pagerank-Index

This project is reimplementation of the analysis undertaken in paper "The Pagerank-Index: Going beyond Citation Counts in Quantifying Scientific Impact of Researches", Upul Senanayake, Mahendra Piraveenan, Albert Zomaya

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The Pagerank-Index

Student project on course Scientific Computing, Master Degree Studies, University of Belgrade, Faculty of Mathematics

About The Project

This project is reimplementation of the analysis undertaken in paper:

"The Pagerank-Index: Going beyond Citation Counts in Quantifying Scientific Impact of Researchers", Upul Senanayake, Mahendra Piraveenan, Albert Zomaya

In this paper authors introduce the pagerank-index, which is designed to address the drawbacks of existing indices (e.g. h-index) by utilizing the underlying citation network dynamics. The index is calculated by running the well known pagerank algorithm on the evolving citation network to give scores to papers. They develop and employ a realistic simulation system which synthesizes the evolution process of citation and collaboration networks in an emerging field. They also demonstrate the utility of the new index by applying it to two real-world data sets.

In this project, a system that stores a citation network and calculates mentioned indices is implemented, mentioned simulation system is developed mostly following instructions from the paper and the indices are also applied to one of two real-world datasets the authors uploaded to the GitHub (see References ). Some additional observations were made with the obtained results.

Languages and technologies used

Notes

  • There is a lot of room for experiment with implemented simulation system which synthesizes the evolution process of citation networks. Parameters of the system (e.g. the probability of new authors being spawned, average length of paper's reference list and author list, etc.) can be varied to monitor changes in results.
  • Since the results in some scenarios on the synthesized data in this project are different from those in the paper, the cause should be investigated.
    • We can delve deeper into the nature of real-world citation networks, to understand how relevant the results on our current artificial network are.
    • We can develop a system for recognizing groups of authors who 'massage' their h-index (those who cite themselves, who work in large groups, who focus on quantity rather than quality) and apply it to real-world data to compare indices that way.
  • We can test efficiency of the system on very large real-world database.

References

Contact

About

This project is reimplementation of the analysis undertaken in paper "The Pagerank-Index: Going beyond Citation Counts in Quantifying Scientific Impact of Researches", Upul Senanayake, Mahendra Piraveenan, Albert Zomaya


Languages

Language:Jupyter Notebook 100.0%