Detecting Citation Types Using Finite-State Machines

Le, Minh-Hoang; Ho, Tu-Bao; Nakamori, Yoshiteru

doi:10.1007/11731139_32

Minh-Hoang Le²²,
Tu-Bao Ho²² &
Yoshiteru Nakamori²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3918))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

3060 Accesses
2 Citations

Abstract

This paper presents a method to extract citation types from scientific articles, viewed as an intrinsic part of emerging trend detection (ETD) in scientific literature. There are two main contributions in this work: (1) Definition of six categories (types) of citations in the literature that are extractable, human-understandable, and appropriate for building the interest and utility functions in emerging trend detection models, and (2) A method to classify citation types using finite-state machines which does not require user-interactions or explicit knowledge. The experimental comparative evaluations show the high performance of the method and the proposed ETD model shows the crucial role of classified citation types in the detection of emerging trends in scientific literature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kontostathis, A., Galitsky, L., Pottenger, W.M., Roy, S., Phelps, D.J.: A survey of emerging trend detection in textual data mining. In: Berry, M. (ed.) A Comprehensive Survey of Text Mining, ch. 9. Springer, Heidelberg (2003)
Google Scholar
Pottenger, W.M., Yang, T.-H.: Detecting emerging concepts in textual data mining. Computational information retrieval, 89–105 (2001)
Google Scholar
Swan, R., Allan, J.: Automatic generation of overview timelines. In: SIGIR 2000: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, pp. 49–56. ACM Press, New York (2000)
Google Scholar
Small, H.: Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society of Information Science 24, 265–269 (1973)
Article Google Scholar
Lawrence, S., Giles, C.L., Bollacker, K.: Digital libraries and autonomous citation indexing. IEEE Computer 32(6), 67–71 (1999)
Article Google Scholar
Kostoff, R.N., del Rio, J.A., Humenik, J.A., Garcia, E.O., Ramirez, A.M.: Citation mining: integrating text mining and bibliometrics for research user profiling. Journal of the American Society for Information Science and Technology 52(13), 1148–1156 (2001)
Article Google Scholar
Gevry, D.R.: Detection of emerging trends: Automation of domain expert practices (2002)
Google Scholar
Nanba, H., Okumura, M.: Towards multi-paper summarization using reference information. In: Proceedings of 16th International Joint Conference on Artificial Intelligence – IJCAI 1999, pp. 926–931 (1999)
Google Scholar
Teufel, S.: Argumentative Zoning: Information Extraction from Scientific Text. PhD thesis, University of Edinburgh (1999)
Google Scholar
Pham, S.B., Hoffmann, A.G.: A new approach for scientific citation classification using cue phrases. In: Australian Conference on Artificial Intelligence, pp. 759–771 (2003)
Google Scholar
Weinstock, M.: Citation indexes. Encyclopedia of Library and Information Science 5, 16–41 (1971)
Google Scholar
Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Article Google Scholar
McCallum, A., Freitag, D., Pereira, F.: Maximum entropy Markov models for information extraction and segmentation. In: Proceedings of the 17th International Conference on Machine Learning, pp. 591–598 (2000)
Google Scholar
Darroch, J.N., Ratcliff, D.: Generalized iterative scaling for log-linear models. The Annals of Mathematical Statistics, 1470–1480 (1972)
Google Scholar
Wordnet:Alexical database for the english language, http://wordnet.princeton.edu

Download references

Author information

Authors and Affiliations

School of Knowledge Science, Japan Advanced Institute of Science and Technology, 1-1, Asahidai, Nomi, Ishikawa, 923-1292, Japan
Minh-Hoang Le, Tu-Bao Ho & Yoshiteru Nakamori

Authors

Minh-Hoang Le
View author publications
You can also search for this author in PubMed Google Scholar
Tu-Bao Ho
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiteru Nakamori
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Nanyang Technological University, Singapore
Wee-Keong Ng
Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, 153-8505, Tokyo, Japan
Masaru Kitsuregawa
School of Computer Science and Technology, Heilongjiang University, China
Jianzhong Li
School of Computer Engineering, Nanyang Technological University, 639798, Singapore, Singapore
Kuiyu Chang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Le, MH., Ho, TB., Nakamori, Y. (2006). Detecting Citation Types Using Finite-State Machines. In: Ng, WK., Kitsuregawa, M., Li, J., Chang, K. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2006. Lecture Notes in Computer Science(), vol 3918. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11731139_32

Download citation

DOI: https://doi.org/10.1007/11731139_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33206-0
Online ISBN: 978-3-540-33207-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics