Sampling Triples from Restricted Networks Using MCMC Strategy

dc.contributor.authorRahman, Mahmudur
dc.contributor.authorHasan, Mohammad Al
dc.contributor.departmentDepartment of Computer Science, IUPUIen_US
dc.date.accessioned2015-12-21T20:01:35Z
dc.date.available2015-12-21T20:01:35Z
dc.date.issued2014
dc.description.abstractIn large networks, the connected triples are useful for solving various tasks including link prediction, community detection, and spam filtering. Existing works in this direction concern mostly with the exact or approximate counting of connected triples that are closed (aka, triangles). Evidently, the task of triple sampling has not been explored in depth, although sampling is a more fundamental task than counting, and the former is useful for solving various other tasks, including counting. In recent years, some works on triple sampling have been proposed that are based on direct sampling, solely for the purpose of triangle count approximation. They sample only from a uniform distribution, and are not effective for sampling triples from an arbitrary user-defined distribution. In this work we present two indirect triple sampling methods that are based on Markov Chain Monte Carlo (MCMC) sampling strategy. Both of the above methods are highly efficient compared to a direct sampling-based method, specifically for the task of sampling from a non-uniform probability distribution. Another significant advantage of the proposed methods is that they can sample triples from networks that have restricted access, on which a direct sampling based method is simply not applicable.en_US
dc.eprint.versionAuthor's manuscripten_US
dc.identifier.citationRahman, M., & Hasan, M. A. (2014). Sampling Triples from Restricted Networks Using MCMC Strategy. In Proceedings of the 23rd ACM International Conference on Information and Knowledge Management (pp. 1519–1528). New York, NY, USA: ACM. http://doi.org/10.1145/2661829.2662075en_US
dc.identifier.urihttps://hdl.handle.net/1805/7786
dc.language.isoen_USen_US
dc.publisherACMen_US
dc.relation.isversionof10.1145/2661829.2662075en_US
dc.relation.journalIn Proceedings of the 23rd ACM International Conference on Information and Knowledge Managementen_US
dc.rightsIUPUI Open Access Policyen_US
dc.sourceAuthoren_US
dc.subjectapproximate triangle countingen_US
dc.subjectmarkov chain monte carlo samplingen_US
dc.subjecttriple samplingen_US
dc.titleSampling Triples from Restricted Networks Using MCMC Strategyen_US
dc.typeConference proceedingsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Rahman-2014-Sampling-triples.pdf
Size:
528.08 KB
Format:
Adobe Portable Document Format
Description:
Author's manuscript
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: