Latent Dirichlet Truth Discovery: Separating Trustworthy And Untrustworthy Components In Data Sources

Keywords

latent Dirichlet model; trustworthy component; Truth discovery

Abstract

The discovery of truth is a critical step toward effective information and knowledge utilization, especially in Web services, social media networks, and sensor networks. Typically, a set of sources with varying reliability claim observations about a set of objects and the goal is to jointly discover the true fact for each object and the trustworthy degree of each source. In this paper, we propose a latent Dirichlet truth (LDT) discovery model to approach this problem. It defines a random field over all the possible configurations of the trustworthy degrees of sources and facts, and the most probable configuration is inferred by a maximum a posteriori criterion over the observed claims. We note that a typical source is usually made of mixed trustworthy and untrustworthy components, since it can make true or false claims on different objects. While most of the existing algorithms do not attempt separate the untrustworthy component from the trustworthy one in each source, the proposed model explicitly identifies untrustworthy component in each source. This makes the LDT model more capable of separating the trustworthy and untrustworthy components, and in turn improves the accuracy of truth discovery. Experiments on real data sets show competitive results compared with existing algorithms.

Publication Date

12-4-2017

Publication Title

IEEE Access

Volume

6

Number of Pages

1741-1752

Document Type

Article

Personal Identifier

scopus

DOI Link

https://doi.org/10.1109/ACCESS.2017.2780182

Socpus ID

85038400242 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/85038400242

This document is currently not available here.

Share

COinS