Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
Since Jim Gray introduced the concept of ”data cube”
in 1997, data cube, associated with online analytical processing
(OLAP), has become a driving engine in data warehouse
industry. Because the boom of Internet has given rise
to an ever increasing amount of text data associated with
other multidimensional information, it is natural to propose
a data cube model that integrates the power of traditional
OLAP and IR techniques for text. In this paper, we propose
a Text-Cube model on multidimensional text database and
study effective OLAP over such data. Two kinds of hierarchies
are distinguishable inside: dimensional hierarchy
and term hierarchy. By incorporating these hierarchies, we
conduct systematic studies on efficient text-cube implementation,
OLAP execution and query processing. Our performance
study shows the high promise of our methods.
Date: December 02, 2008
Book Title: Int. Conf. on Data Mining (ICDM'08)
Type: InProceedings
Edition: Proc 2008
Address: Pisa, Italy
Downloads: 177
Has 1 soft copy
remote linkBibtex
@InProceedings{Text_Cube_Computing_IR_Measures_for_Mult,
author = "Cindy Xide Lin and Bolin Ding and Jiawei Han and Feida Zhu and Bo Zhao",
title = "{Text Cube: Computing IR Measures for Multidimensional Text Database Analysis}",
month = "December",
year = "2008",
edition = "Proc 2008",
address = ", Pisa, Italy",
booktitle = "Int. Conf. on Data Mining (ICDM'08)",
}