Università della Svizzera italiana

Statistical models for the analysis of short user-generated documents : author identification for conversational documents

Inches, Giacomo ; Crestani, Fabio (Dir.)

Thèse de doctorat : Università della Svizzera italiana, 2014 ; 2014INFO019.

In recent years short user-generated documents have been gaining popularity on the Internet and attention in the research communities. This kind of documents are generated by users of the various online services: platforms for instant messaging communication, for real-time status posting, for discussing and for writing reviews. Each of these services allows users to generate written texts...

Università della Svizzera italiana

Investigating the statistical properties of user-generated documents

Inches, Giacomo ; Carman, Mark J. ; Crestani, Fabio

In: Lecture notes in computer science, 2011, vol. 7022, p. 198-209

The importance of the Internet as a communication medium is reflected in the large amount of documents being generated every day by users of the different services that take place online. In this work we aim at analyzing the properties of these online user-generated documents for some of the established services over the Internet (Kongregate, Twitter, Myspace and Slashdot) and comparing them...

Università della Svizzera italiana

Statistics of online user-generated short documents

Inches, Giacomo ; Carman, Mark J. ; Crestani, Fabio

In: Lecture notes in computer science, 2010, vol. 5993, p. 649-652

User-generated short documents assume an important role in online communication due to the established utilization of social networks and real- time text messaging on the Internet. In this paper we compare the statistics of different online user-generated datasets and traditional TREC collections, investigating their similarities and dferences. Our results support the applicability of...