Prediction and application of article potential citations based on nonlinear citation-forecasting combined model

Wang, Kehan; Shi, Wenxuan; Bai, Junsong; Zhao, Xiaoping; Zhang, Liying

doi:10.1007/s11192-021-04026-6

Prediction and application of article potential citations based on nonlinear citation-forecasting combined model

Published: 23 May 2021

Volume 126, pages 6533–6550, (2021)
Cite this article

Scientometrics Aims and scope Submit manuscript

Kehan Wang¹,
Wenxuan Shi¹,
Junsong Bai²,
Xiaoping Zhao³ &
…
Liying Zhang ORCID: orcid.org/0000-0003-3392-6670¹

661 Accesses
7 Citations
Explore all metrics

Abstract

As the number of academic articles rapidly increases, a reasonable evaluation method for the articles is highly required in the current academic research. Meanwhile, a faster access to the high-quality academic articles for the researchers is also of critical significance. This paper first improves the AVG model and presents a new Nonlinear Citation-Forecasting Combined Model (NCFCM) based on a neural network to predict the potential increase of citation counts. Then, the NCFCM is used to analyze and rank the academic articles in online databases. The results of NCFCM model are compared to the results from other existing methods. Empirical analysis and comparisons demonstrate that the NCFCM model is of high accuracy and robustness in forecasting potential citation counts and ranking academic articles. Ranking academic articles according to the potentional citation counts can help researchers retrieve the desired articles efficiently in a short time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Features, techniques and evaluation in predicting articles’ citations: a review from years 2010–2023

Article 07 December 2023

Wan Siti Nur Aiza, Liyana Shuib, … Nur Baiti Afini Normadhi

Trend-Based Citation Count Prediction for Research Articles

Predicting the impact of no. of authors on no. of citations of research publications based on neural networks

Article Open access 31 May 2022

Fatma M. Talaat & Samah A. Gamel

References

Abrishami, A., & Aliakbary, S. (2019). Predicting citation counts based on deep neural network learning techniques. Journal of Informetrics, 13(2), 485–499. https://doi.org/10.1016/j.joi.2019.02.011.
Article Google Scholar
Aladag, C. H., Egrioglu, E., & Yolcu, U. (2010). Forecast combination by using artificial neural networks. Neural Processing Letters, 32(3), 269–276. https://doi.org/10.1007/s11063-010-9156-7.
Article MATH Google Scholar
Avramescu, A. (2010). Actuality and obsolescence of scientific literature. Journal of the American Society for Information Science and Technology, 30(5), 296–303.
Article Google Scholar
Bai, X., Liu, H., Zhang, F., Ning, Z., Kong, X., Lee, I., & Xia, F. (2017). An overview on evaluating and predicting scholarly article impact. Information, 8(3), 73. https://doi.org/10.3390/info8030073.
Article Google Scholar
Bai, X., Zhang, F., & Lee, I. (2019). Predicting the citations of scholarly paper. Journal of Informetrics, 13, 407–418. https://doi.org/10.1016/j.joi.2019.01.010.
Article Google Scholar
Bates, J. M., & Granger, C. W. J. (2001). The combination of forecasts. Cambridge: Cambridge University Press.
Book Google Scholar
Bello, I., Pham, H., Le, Q.V., Norouzi, M., & Bengio, S.: Neural combinatorial optimization with reinforcement learning. arXiv preprint arXiv:1611.09940 (2016). https://arxiv.org/abs/1611.09940
Bose, B. K. (2007). Neural network applications in power electronics and motor drives-an introduction and perspective. IEEE Transactions on Industrial Electronics, 54(1), 14–33. https://doi.org/10.1109/TIE.2006.888683.
Article Google Scholar
Cao, X., Chen, Y., & Liu, K. R. (2016). A data analytic approach to quantifying scientific impact. Journal of Informetrics, 10(2), 471–484.
Article Google Scholar
Chakraborty, T., Kumar, S., Goyal, P., Ganguly, N., & Mukherjee, A.: Towards a stratified learning approach to predict future citation counts. In: IEEE/ACM Joint Conference on Digital Libraries, pp. 351–360 (2014). https://doi.org/10.1109/JCDL.2014.6970190
Chen, J., Zhang, C.: Predicting citation counts of papers. In: 2015 IEEE 14th International Conference on Cognitive Informatics Cognitive Computing (ICCI*CC), pp. 434–440 (2015). https://doi.org/10.1109/ICCI-CC.2015.7259421
Chen, K. Y. (2011). Combining linear and nonlinear model in forecasting tourism demand. Expert Systems with Applications, 38(8), 10368–10376. https://doi.org/10.1016/j.eswa.2011.02.049.
Article Google Scholar
Clauset, A., Larremore, D. B., & Sinatra, R. (2017). Data-driven predictions in the science of science. Science, 355(6324), 477–480. https://doi.org/10.1126/science.aal4217.
Article Google Scholar
Cressey, Daniel. (2015). sleeping beauty papers slumber for decades. Nature.
Egghe, L. (2006). Theory and practise of the g-index. Scientometrics, 69(1), 131–152. https://doi.org/10.1007/s11192-006-0144-7.
Article MathSciNet Google Scholar
Garfield, E. (1972). Citation analysis as a tool in journal evaluation. Science, 178(4060), 471–479.
Article Google Scholar
Garfield, E. (2006). The history and meaning of the journal impact factor. Journal of the American Medical Association, 295(1), 90–93.
Article Google Scholar
Hirsch, J. E. (2005). An index to quantify an individual’s scientific research output. Proceedings of the National Academy of Sciences, 102(46), 16569–16572. https://doi.org/10.1073/pnas.0507655102.
Article MATH Google Scholar
Jin, B., Liang, L., Rousseau, R., & Egghe, L. (2007). The r-and ar-indices: Complementing the h-index. Chinese science bulletin, 52(6), 855–863. https://doi.org/10.1007/s11434-007-0145-9.
Article Google Scholar
Khashei, M., & Bijari, M. (2010). An artificial neural network (p, d, q) model for timeseries forecasting. Expert Systems with Applications, 37(1), 479–489. https://doi.org/10.1016/j.eswa.2009.05.044.
Article MATH Google Scholar
Kolidakis, S., Botzoris, G., Profillidis, V., & Lemonakis, P. (2019). Road traffic forecasting- a hybrid approach combining artificial neural network with singular spectrum analysis. Economic Analysis and Policy, 64, 159–171. https://doi.org/10.1016/j.eap.2019.08.002.
Article Google Scholar
Lariviere, V., Archambault, E., & Gingras, Y. (2010). Long-term variations in the aging of scientific literature: From exponential growth to steady-state science (1900–2004). Journal of the American Society for Information Science and Technology, 59(2), 288–296.
Article Google Scholar
Lawrence, S., Giles, C. L., Tsoi, Ah Chung, & Back, A. D. (1997). Face recognition: A convolutional neural-network approach. IEEE Transactions on Neural Networks, 8(1), 98–113. https://doi.org/10.1109/72.554195.
Article Google Scholar
Liu, D. C., & Nocedal, J. (1989). On the limited memory bfgs method for large scale optimization. Mathematical Programming, 45(1), 503–528.
Article MathSciNet Google Scholar
Liu, X.: A grey neural network and input-output combined forecasting model and its application in primary energy related co2 emissions estimation by sector in china. Energy Procedia 36, 815 – 824 (2013). https://doi.org/10.1016/j.egypro.2013.07.094. TerraGreen 13 International Conference 2013 - Advancements in Renewable Energy and Clean Environment
Liu, X., Moreno, B., & Garcí, A. S. (2016). A grey neural network and input-output combined forecasting model. primary energy consumption forecasts in spanish economic sectors. Energy, 115, 1042–1054. https://doi.org/10.1016/j.energy.2016.09.017.
Article Google Scholar
Melin, P., & Sínchez, D. (2018). Multi-objective optimization for modular granular neural networks applied to pattern recognition. Information Sciences, 460–461, 594–610. https://doi.org/10.1016/j.ins.2017.09.031.
Article MathSciNet Google Scholar
Moed, H. F., Colledge, L., Reedijk, J., Moya-Anegon, F., Guerrero-Bote, V., Plume, A., & Amin, M. (2012). Citation-based metrics are appropriate tools in journal assessment provided that they are accurate and used in an informed way. Scientometrics, 92(2), 367–376. https://doi.org/10.1007/s11192-012-0679-8.
Article Google Scholar
Molinari, J. F., & Molinari, A. (2008). A new methodology for ranking scientific institutions. Scientometrics, 75(1), 163–174. https://doi.org/10.1007/s11192-007-1853-2.
Article MathSciNet Google Scholar
Newman, M. E. J. (2009). The first-mover advantage in scientific publication. EPL (Europhysics Letters), 86(6), 68001. https://doi.org/10.1209/0295-5075/86/68001.
Article Google Scholar
Pobiedina, N., & Ichise, R. (2016). Citation count prediction as a link prediction problem. Applied Intelligence, 44(2), 252–268. https://doi.org/10.1007/s10489-015-0657-y.
Article Google Scholar
Qing KE, Emilio F, Filippo, R. (2015). Alessandro, & Flammini: Defining and identifying sleeping beauties in science. Proceedings of the National Academy of Sciences of the United States of America.
Redner, S. (1998). How popular is your paper? an empirical study of the citation distribution. European Physical Journal B, 4(2), 131–134. https://doi.org/10.1007/s100510050359.
Article Google Scholar
Tahamtan, I., Afshar, A. S., & Ahamdzadeh, K. (2016). Factors affecting number of citations: A comprehensive review of the literature. Scientometrics, 107(3), 1195–1225. https://doi.org/10.1007/s11192-016-1889-2.
Article Google Scholar
Teshnehlab, M., & Kyoko, W. (2013). Intelligent control based on flexible neural networks,. Springer Science & Business Media.
Van Raan, A. F. (2004). Sleeping beauties in science. Scientometrics, 59(3), 467–472. https://doi.org/10.1023/B:SCIE.0000018543.82441.f1.
Article Google Scholar
Waltman, L. (2016). A review of the literature on citation impact indicators. Journal of Informetrics, 10(2), 365–391. https://doi.org/10.1016/j.joi.2016.02.007.
Article Google Scholar
Wang, Mingyang, Chen, Guangsheng, Li, & Shi: Detecting latent referential articles based on their vitality performance in the latest 2 years. Scientometrics An International Journal for All Quantitative Aspects of the Science of Science Policy (2017)
Wang, D., Song, C., & Barabási, A. L. (2013). Quantifying long-term scientific impact. Science, 342(6154), 127–132. https://doi.org/10.1126/science.1237825.
Article Google Scholar
Wang, J., Mei, Y., & Hicks, D. (2014). Comment on quantifying long-term scientific impact. Science, 345(6193), 149–149. https://doi.org/10.1126/science.1248770.
Article Google Scholar
Wang, J., Zhu, S., Zhang, W., & Lu, H.: Combined modeling for electric load forecasting with adaptive particle swarm optimization. Energy 35(4), 1671–1678 (2010). https://doi.org/10.1016/j.energy.2009.12.015. Demand Response Resources: the US and International Experience
Wang, M., Wang, Z., & Chen, G. (2019). Which can better predict the future success of articles? bibliometric indices or alternative metrics. Scientometrics.
Webber, W., Moffat, A., & Zobel, J.: A similarity measure for indefinite rankings. ACM Trans. Inf. Syst. 28(4) (2010). https://doi.org/10.1145/1852102.1852106
Wildgaard, L., Schneider, J. W., & Larsen, B. (2014). A review of the characteristics of 108 author-level bibliometric indicators. Scientometrics, 101(1), 125–158. https://doi.org/10.1007/s11192-014-1423-3.
Article Google Scholar
Xu, J., Li, M., Jiang, J., Ge, B., & Cai, M. (2019). Early prediction of scientific impact based on multi-bibliographic features and convolutional neural network. IEEE Access, 7, 92248–92258. https://doi.org/10.1109/ACCESS.2019.2927011.
Article Google Scholar
Yu, T., Yu, G., Li, P. Y., & Wang, L. (2014). Citation impact prediction for scientific papers using stepwise regression analysis. Scientometrics, 101(2), 1233–1252. https://doi.org/10.1007/s11192-014-1279-6.
Article Google Scholar
Yuan, S., Tang, J., Zhang, Y., Wang, Y., & DXiao, T.: Modeling and predicting citation count via recurrent neural network with long short-term memory. CoRR abs/1811.02129 (2018)
Zeng, A., Shen, Z., Zhou, J., Wu, J., Fan, Y., Wang, Y., & Stanley, H. E. (2017). The science of science: From the perspective of complex systems. Physics Reports, 714—-715, 1–73.
Article MathSciNet Google Scholar
Zhang, G. (2003). Time series forecasting using a hybrid arima and neural network model. Neurocomputing, 50, 159–175.
Article Google Scholar
Zhou, Z., Si, G., Zheng, K., Xu, X., Qu, K., & Zhang, Y. (2019). Cmbcf: A cloud model based hybrid method for combining forecast. Applied Soft Computing, 85, 105766.
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Mathematics and Statistics, Zhengzhou University, Zhengzhou, China
Kehan Wang, Wenxuan Shi & Liying Zhang
School of Health Management, Southern Medical University, Guangzhou, China
Junsong Bai
School of Physical Science, University of California. Irvine, Irvine, USA
Xiaoping Zhao

Authors

Kehan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wenxuan Shi
View author publications
You can also search for this author in PubMed Google Scholar
Junsong Bai
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoping Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Liying Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liying Zhang.

Additional information

This work was supported by the National Social Science Foundation of China (NO.18BTJ021) and National Training Program of Innovation and Entrepreneurship for Undergraduates (NO.202010459036)

A Appendix

See Table 6.

Table 6 Literature Data

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, K., Shi, W., Bai, J. et al. Prediction and application of article potential citations based on nonlinear citation-forecasting combined model. Scientometrics 126, 6533–6550 (2021). https://doi.org/10.1007/s11192-021-04026-6

Download citation

Received: 29 July 2020
Accepted: 29 April 2021
Published: 23 May 2021
Issue Date: August 2021
DOI: https://doi.org/10.1007/s11192-021-04026-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Prediction and application of article potential citations based on nonlinear citation-forecasting combined model

Abstract

Access this article

Similar content being viewed by others

Features, techniques and evaluation in predicting articles’ citations: a review from years 2010–2023

Trend-Based Citation Count Prediction for Research Articles

Predicting the impact of no. of authors on no. of citations of research publications based on neural networks

References

Author information

Authors and Affiliations

Corresponding author

Additional information

A Appendix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Prediction and application of article potential citations based on nonlinear citation-forecasting combined model

Abstract

Access this article

Similar content being viewed by others

Features, techniques and evaluation in predicting articles’ citations: a review from years 2010–2023

Trend-Based Citation Count Prediction for Research Articles

Predicting the impact of no. of authors on no. of citations of research publications based on neural networks

References

Author information

Authors and Affiliations

Corresponding author

Additional information

A Appendix

A Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation