Schema based XML compression Conference

Rishe, N, Wolfson, O, Wongsaroj, B et al. (2007). Schema based XML compression . 1-6.

cited authors

  • Rishe, N; Wolfson, O; Wongsaroj, B; Small, D; Alarcon, M; Lorenzo, N; Koller, R; Kundu, S; Graham, S; Alexander, K; Adjouadi, M

abstract

  • XML has grown into a widely used and highly developed technology, due in part to the subcomponents built around the technology (advanced parsers, frameworks, libraries, etc). The use of XML reduces development time and increases the robustness of distributed applications. Due to these advantages, a large and growing range of distributed applications, such as Web-services, use XML as the basic unit of communication. This paper surveys the topic of XML compression and proposes a new method that uses schema information for the compression algorithm. The schema provides valuable information to the compressor by specifying the data type and format of each element in the XML document. For example, if the compressor knows that a portion of data is numeric, it can intelligently save it using a binary representation instead of trying to compress the string representation.

publication date

  • December 1, 2007

International Standard Book Number (ISBN) 13

start page

  • 1

end page

  • 6