Università della Svizzera italiana

Linguistic aggregation methods in blog retrieval

Keikha, Mostafa ; Crestani, Fabio

In: Information processing and management: an international journal, 2012, vol. 48, no. 3, p. 467–475

This paper addresses the blog distillation problem, that is, given a user query find the blogs that are most related to the query topic. We model each post as evidence of the relevance of a blog to the query, and use aggregation methods like Ordered Weighted Averaging (OWA) operators to combine the evidence. We show that using only highly relevant evidence (posts) for each blog can result in...

Università della Svizzera italiana

Methods for ranking user-generated text streams : a case study in blog feed retrieval

Keikha, Mostafa ; Crestani, Fabio (Dir.)

Thèse de doctorat : Università della Svizzera italiana, 2012 ; 2012INFO003.

User generated content are one of the main sources of information on the Web nowadays. With the huge amount of this type of data being generated everyday, having an efficient and effective retrieval system is essential. The goal of such a retrieval system is to enable users to search through this data and retrieve documents relevant to their information needs. Among the different retrieval...

Università della Svizzera italiana

Employing document dependency in blog search

Keikha, Mostafa ; Carman, Mark James ; Crestani, Fabio

In: Journal of the American society for information science and technology, 2012, vol. 63, no. 2, p. 354–365

The goal in blog search is to rank blogs according to their recurrent relevance to the topic of the query. State-of-the-art approaches view it as an expert search or resource selection problem. We investigate the effect of content-based similarity between posts on the performance of the retrieval system. We test two different approaches for smoothing (regularizing) relevance scores of posts...

Università della Svizzera italiana

Building queries for prior-art search

Mahdabi, Parvaz ; Keikha, Mostafa ; Gerani, Shima ; Landoni, Monica ; Crestani, Fabio

In: Lecture notes in computer science, 2011, vol. 6653, no. -, p. 3-15

Prior-art search is a critical step in the examination procedure of a patent application. This study explores automatic query generation from patent documents to facilitate the time-consuming and labor-intensive search for relevant patents. It is essential for this task to identify discriminative terms in different fields of a query patent, which enables us to distinguish relevant patents from...