The BlogVox Opinion Retrieval System
The BlogVox system retrieves opinionated blog posts specified by ad
hoc queries. BlogVox was developed for the 2006 TREC blog track by the University of
Maryland, Baltimore County and the Johns Hopkins University
Applied Physics Laboratory using a novel system to recognize legitimate
posts and discriminate against spam blogs. It also processes posts to eliminate extraneous
non-content, including blog-rolls, link-rolls, advertisements and
sidebars. After retrieving posts relevant to a topic query, the
system processes them to produce a set of independent features
estimating the likelihood that a post expresses an opinion about the
topic. These are combined using an SVM-based system and integrated
with the relevancy score to rank the results. We evaluate BlogVox's performance against human assessors. We also evaluate the individual splog filtering and non-content removal components of BlogVox.
Date: February 09, 2007
Book Title: Proceedings of the Fifteenth Text REtrieval Conference (TREC 2006)
Type: InProceedings
Downloads: 1251
Has 1 soft copy
size 477365 bytesBibtex
@InProceedings{The_BlogVox_Opinion_Retrieval_System,
author = "Akshay Java and Pranam Kolari and Tim Finin and Anupam Joshi and Justin Martineau and James Mayfield",
title = "{The BlogVox Opinion Retrieval System}",
month = "February",
year = "2007",
booktitle = "Proceedings of the Fifteenth Text REtrieval Conference (TREC 2006)",
}