Discovering critical edge sequences in E-commerce catalogs Conference

Dutta, K, VanderMeer, D, Datta, A et al. (2001). Discovering critical edge sequences in E-commerce catalogs . 65-74. 10.1145/501158.501166

cited authors

  • Dutta, K; VanderMeer, D; Datta, A; Ramamritham, K

abstract

  • Web sites allow the collection of vast amounts of navigational data - clickstreams of user traversals through the site. These massive data stores offer the tantalizing possibility bility of uncovering interesting patterns within the dataset. For e-businesses, always looking for an edge in the hyper-competitive online marketplace, this possibility is of particular interest. Of significant particular interest to e-businesses is the discovery of Critical Edge Sequences (CES), which denote frequently traversed subpaths in the catalog. CESs can be used to improve site performance and site management, increase the effectiveness of advertising on the site, and gather additional knowledge of customer interest patterns on the site. Using traditional graph-based and web mining strategies to find CESs could turn out to be expensive in both space and time. In this paper, we propose a method to compute the most popular paths between node pairs in a catalog, which are then used to discover CESs. Our method is both space-efficient and accurate, providing a vast reduction in the storage requirement with a minimum impact on accuracy. This algorithm, executed off-line in batch mode, is also practical with respect to running time. As a variant of single-source shortest-path, it runs in log linear time.

publication date

  • January 1, 2001

Digital Object Identifier (DOI)

start page

  • 65

end page

  • 74