In: Journal of the American society for information science and technology, 2012, vol. 63, no. 2, p. 354–365
The goal in blog search is to rank blogs according to their recurrent relevance to the topic of the query. State-of-the-art approaches view it as an expert search or resource selection problem. We investigate the effect of content-based similarity between posts on the performance of the retrieval system. We test two different approaches for smoothing (regularizing) relevance scores of posts...
|
In: Lecture notes in computer science, 2011, vol. 7022, p. 198-209
The importance of the Internet as a communication medium is reflected in the large amount of documents being generated every day by users of the different services that take place online. In this work we aim at analyzing the properties of these online user-generated documents for some of the established services over the Internet (Kongregate, Twitter, Myspace and Slashdot) and comparing them...
|
In: Lecture notes in computer science, 2010, vol. 5993, p. 649-652
User-generated short documents assume an important role in online communication due to the established utilization of social networks and real- time text messaging on the Internet. In this paper we compare the statistics of different online user-generated datasets and traditional TREC collections, investigating their similarities and dferences. Our results support the applicability of...
|