In: The Web Conference 2021, Ljubljana, Slovenia, April 12-23, 2021, 2021, p. 1-12
Scientific peer review is pivotal to maintain quality standards for academic publication. The effectiveness of the reviewing process is currently being challenged by the rapid increase of paper submissions in various conferences. Those venues need to recruit a large number of reviewers of different levels of expertise and background. The submitted reviews often do not meet the conformity...
|
In: Proceedings of the VLDB Endowment, 2020, vol. 14, no. 3, p. 294-306
With the emergence of the Internet of Things (IoT), time series streams have become ubiquitous in our daily life. Recording such data is rarely a perfect process, as sensor failures frequently occur, yielding occasional blocks of data that go missing in multiple time series. These missing blocks do not only affect real-time monitoring but also compromise the quality of online data analyses....
|
In: Proceedings of the VLDB Endowment, 2020, vol. 13, no. 5, p. 768-782
Recording sensor data is seldom a perfect process. Failures in power, communication or storage can leave occasional blocks of data missing, affecting not only real-time monitoring but also compromising the quality of near- and off-line data analysis. Several recovery (imputation) algorithms have been proposed to replace missing blocks. Unfortunately, little is known about their relative...
|
In: Proceedings of the 13th International Conference on Semantic Systems, 2017, p. 65–72
Webpages are an abundant source of textual information with manually annotated entity links, and are often used as a source of training data for a wide variety of machine learning NLP tasks. However, manual annotations such as those found on Wikipedia are sparse, noisy, and biased towards popular entities. Existing entity linking systems deal with those issues by relying on simple statistics...
|
In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018, p. 437–446
The graph embedding paradigm projects nodes of a graph into a vector space, which can facilitate various downstream graph analysis tasks such as node classification and clustering. To efficiently learn node embeddings from a graph, graph embedding techniques usually preserve the proximity between node pairs sampled from the graph using random walks. In the context of a heterogeneous graph,...
|
In: IEEE Transactions on Knowledge and Data Engineering, 2019, vol. 31, no. 3, p. 507–520
Personalized recommendation is crucial to help users find pertinent information. It often relies on a large collection of user data, in particular users' online activity (e.g., tagging/rating/checking-in) on social media, to mine user preference. However, releasing such user activity data makes users vulnerable to inference attacks, as private data (e.g., gender) can often be inferred from...
|
In: Proceedings of the Web Conference (WWW 2019)
|
In: Proceedings of the Web Conference (WWW 2019)
|
In: Proceedings of the Web Conference (WWW 2019)
|
In: IEEE Transactions on Knowledge and Data Engineering, 2018, p. 1–1
Histogram-based similarity has been widely adopted in many machine learning tasks. However, measuring histogram similarity is a challenging task for streaming histograms, where the elements of a histogram are observed one after the other in an online manner. The ever-growing cardinality of histogram elements over the data streams makes any similarity computation inefficient in that case. To...
|