Skip to main content

Networked XML Compression by Encoding Pre-order Traversals

  • Conference paper
  • First Online:
Web Information Systems and Technologies (WEBIST 2013)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 189))

Included in the following conference series:

  • 563 Accesses

Abstract

The advantages of the eXtensible Markup Language, XML, come at a cost, especially for huge datasets or when used on small mobile devices. Several known XML-conscious compressors used in real time environments compress data during data streaming. This paper presents a study of new real time algorithms that exploit local structural redundancies of pre-order traversals of an XML tree. These algorithms focus on reducing the overhead of streaming data while maintaining load balancing between the sender and receiver. Our algorithms have similar or better performance than existing algorithms, while emphasizing low memory and processing overheads.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. XML: EXtensible Markup Language (XML) 1.0, 5th edn. (2012). http://www.w3.org/TR/REC-xml/. Accessed 20 June 2013

  2. GZIP: The gzip home page (2012). http://www.gzip.org. Accessed 20 June 2013

  3. Hartmut, L., Suciu, D.: XMill: an efficient compressor for XML data. ACM Special Interest Group on Management of Data (SIGMOD) Record 29, 153–164 (2000)

    Google Scholar 

  4. Tolani, P., Haritsa, J.: XGRIND: a query-friendly XML compressor. In: International Conference on Data Engineering (ICDE)’ 02, pp. 225–234 (2002)

    Google Scholar 

  5. Arion, A., Bonifati, A., Manolescu, I., Pugliese, A.: XQueC: a query-conscious compressed XML database. ACM Trans. Internet Technol. 7, 1–35 (2007)

    Article  Google Scholar 

  6. Corbin, T., Müldner, T., Miziołek, J.: Pre-order compression schemes for XML in the real time environment. In: SciTePress Digital Library - WEBIST 2013 - 9th International Conference on Web Information Systems and Technologies, Aachen, Germany, pp. 5–15. SciTePress Digital Library (2013)

    Google Scholar 

  7. Skibiński, P., Swacha, J.: Combining efficient XML compression with query processing. In: Ioannidis, Y., Novikov, B., Rachev, B. (eds.) ADBIS 2007. LNCS, vol. 4690, pp. 330–342. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  8. Lin, Y., Zhang, Y., Li, Q., Yang, J.: Supporting efficient query processing on compressed XML files. In: Proceedings of the Symposium on Applied Computing (SAC) ’05, pp. 660–665. ACM, New York (2005)

    Google Scholar 

  9. Ng, W., Lam, W.Y., Wood, P., Levene, N.: XCQ: a queriable XML compression system. Knowl. Inf. Syst. 10, 421–452 (2006)

    Article  Google Scholar 

  10. XPath: XML Path Language (XPath) (2012). http://www.w3.org/TR/xpath/. Accessed on 20 June 2013

  11. XQuery: XQuery 1.0: An XML Query Language, 2nd edn. (2012). http://www.w3.org/TR/xquery/. Accessed 20 June 2013

  12. Müldner, T., Fry, C., Miziołek, J., Durno, S.: SXSAQCT and XSAQCT: XML queryable compressors. In: Böttcher, S., M. Lohrey, S.M., Rytter, W. (eds.) Structure-Based Compression of Complex Massive Data. Dagstuhl Seminar Proceedings, vol. 08261, Dagstuhl, Germany. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Germany (2008)

    Google Scholar 

  13. Müldner, T., Fry, C., Miziołek, J.: Online Internet communication using an XML compressor. In: The Seventh International Conference on Internet and Web Applications and Services, Stuttgart, Germany, pp. 131–136. International Academy, Research, and Industry Association (IARIA) (2012)

    Google Scholar 

  14. Müldner, T., Fry, C., Miziołek, J., Durno, S.: XSAQCT: XML queryable compressor. In: Balisage: The Markup Conference 2009, Montreal, Canada (2009)

    Google Scholar 

  15. soap: SOAP Version 1.2 Part 1: Messaging Framework, 2nd edn. (2012). http://www.w3.org/TR/soap12-part1/. Accessed 20 June 2013

  16. Leighton, G., Müldner, T., Diamond, J.: TREECHOP: a tree-based query-able compressor for XML. In: The Ninth Canadian Workshop on Information Theory, pp. 115–118 (2005)

    Google Scholar 

  17. Java: Java version 7 (2012). http://www.oracle.com/technetwork/java/javase/7u-relnotes-515228.html. Accessed 20 June 2013

  18. HTTP: HTTP RFC 2616 (2012). http://www.w3.org/Protocols/rfc2616/rfc2616.html. Accessed 20 June 2013

  19. Peintner, D.: EXI: EXIficient (2012). http://exificient.sourceforge.net. Accessed 20 June 2013

  20. enwiki dumps: enwiki-latest.xml (2012). http://dumps.wikimedia.org/enwiki/latest/. Accessed 20 June 2013

  21. xmlgen: The benchmark data generator (2012). http://www.xml-benchmark.org/generator.html. Accessed 20 June 2013

  22. Corpus: Wratislavia XML corpus (2012). http://www.ii.uni.wroc.pl/~inikep/research/Wratislavia/. Accessed 20 June 2013

  23. Baseball: baseball.xml (2012). http://rassyndrome.webs.com/CC/Baseball.xml. Accessed 20 June 2013

  24. Macbeth: macbeth.xml (2012). http://www.ibiblio.org/xml/examples/. Accessed 20 June 2013

Download references

Acknowledgements

The work of the first and second authors are partially supported by the NSERC CSG-M (Canada Graduate Scholarship-Masters) and NSERC RG-PIN grant respectively. We would like to thank the anonymous reviewers for their detailed and helpful comments used to improve the final version of our paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tomasz Müldner .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Corbin, T., Müldner, T., Miziołek, J.K. (2014). Networked XML Compression by Encoding Pre-order Traversals. In: Krempels, KH., Stocker, A. (eds) Web Information Systems and Technologies. WEBIST 2013. Lecture Notes in Business Information Processing, vol 189. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44300-2_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-44300-2_1

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-44299-9

  • Online ISBN: 978-3-662-44300-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics