Skip to main content
Log in

h-Index-based link prediction methods in citation network

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

Link prediction implies the mining of the missing links in networks or prediction of the next node pair to be connected by a link. Link prediction is useful for mining information in citation networks, and most of the existing related studies commonly use degree rather than more advanced methods to measure the importance of nodes. However, such a method cannot easily measure the importance of a paper in reality; some papers have high degree in citation networks but are not very influential. This issue restricts the performance of the link prediction methods applied to citation networks. The current study analyzed h-type indices, which are more suitable than degree for measuring the importance of citation network nodes. We propose two h-index-based link prediction methods. Experiments conducted on real citation networks demonstrate that the use of h-type index to measure the importance of nodes in citation networks can significantly improve the prediction accuracy of link prediction methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Adamic, L. A., & Adar, E. (2003). Friends and neighbors on the Web. Social Networks, 25(3), 211–230.

    Article  Google Scholar 

  • Ayaz, S., Masood, N., & Islam, M. A. (2017). Predicting scientific impact based on h-index. Scientometrics, 114(3), 993–1010.

    Article  Google Scholar 

  • Batagelj, V., & Mrvar, A. (2006). Pajek datasets website. http://vlado.fmf.uni-lj.si/pub/networks/data/. Accessed 14 Jan 2013.

  • Campiteli, M. G., Holanda, A. J., Soares, L. D. H., Soles, P. R. C., & Kinouchi, O. (2013). Lobby index as a network centrality measure. Physica A: Statistical Mechanics and Its Applications, 392(21), 5511–5515.

    Article  Google Scholar 

  • Clauset, A., Moore, C., & Newman, M. E. (2008). Hierarchical structure and the prediction of missing links in networks. Nature, 453(7191), 98–101.

    Article  Google Scholar 

  • Ermann, L., Frahm, K. M., & Shepelyansky, D. L. (2015). Google matrix analysis of directed networks. Reviews of Modern Physics, 87(4), 1261.

    Article  Google Scholar 

  • Getoor, L., & Diehl, C. P. (2005). Link mining: A survey. ACM SIGKDD Explorations Newsletter, 7(2), 3–12.

    Article  Google Scholar 

  • Glänzel, W. (2012). The role of core documents in bibliometric network analysis and their relation with h-type indices. Scientometrics, 93(1), 113–123.

    Article  Google Scholar 

  • Gleich, D. F. (2015). PageRank beyond the Web. SIAM Review, 57(3), 321–363.

    Article  MathSciNet  MATH  Google Scholar 

  • Guimerà, R., & Sales-Pardo, M. (2009). Missing and spurious interactions and the reconstruction of complex networks. Proceedings of the National Academy of Sciences, 106(52), 22073–22078.

    Article  Google Scholar 

  • Hirsch, J. E. (2005). An index to quantify an individual’s scientific research output. Proceedings of the National Academy of Sciences, 102(46), 16569.

    Article  MATH  Google Scholar 

  • Kitsak, M., Gallos, L. K., Havlin, S., Liljeros, F., Muchnik, L., Stanley, H. E., et al. (2010). Identification of influential spreaders in complex networks. Nature Physics, 6(11), 888–893.

    Article  Google Scholar 

  • Korn, A., Schubert, A., & Telcs, A. (2009). Lobby index in networks. Physica A: Statistical Mechanics and Its Applications, 388(11), 2221–2226.

    Article  Google Scholar 

  • Lü, L., & Zhou, T. (2011). Link prediction in complex networks: A survey. Physica A: Statistical Mechanics and Its Applications, 390(6), 1150–1170.

    Article  Google Scholar 

  • Lü, L., Zhou, T., Zhang, Q.-M., & Stanley, H. E. (2016). The H-index of a network node and its relation to degree and coreness. Nature Communications, 7, 10168.

    Article  Google Scholar 

  • Price, D. J. (1965). Network of scientific papers. Science, 149(3683), 510–515.

    Article  Google Scholar 

  • Salton, G., & McGill, M. J. (1986). Introduction to modern information retrieval. New York City: McGraw-Hill Inc.

    MATH  Google Scholar 

  • Schubert, A. (2009). Using the h-index for assessing single publications. Scientometrics, 78(3), 559–565.

    Article  MathSciNet  Google Scholar 

  • Schubert, A., Korn, A., & Telcs, A. (2009). Hirsch-type indices for characterizing networks. Scientometrics, 78(2), 375–382.

    Article  Google Scholar 

  • Uzzi, B., & Jones, B. (2013). Atypical combinations and scientific impact. Science, 342(6157), 468–472.

    Article  Google Scholar 

  • Newman, M. (2010). Networks: An introduction. Oxford: Oxford University Press.

    Book  MATH  Google Scholar 

  • Yang, Y., Nishikawa, T., & Motter, A. E. (2017). Small vulnerable sets determine large network cascades in power grids. Science, 358(6365), eaan3184.

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by National Natural Science Foundation of China (NSFC No. 71203135).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Wen Zhou or Yifan Jia.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, W., Gu, J. & Jia, Y. h-Index-based link prediction methods in citation network. Scientometrics 117, 381–390 (2018). https://doi.org/10.1007/s11192-018-2867-7

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-018-2867-7

Keywords

Navigation