E-CLoG: Counting edge-centric local graphlets
dc.contributor.author | Dave, Vachik S. | |
dc.contributor.author | Ahmed, Nesreen K. | |
dc.contributor.author | Al Hasan, Mohammad | |
dc.contributor.department | Computer and Information Science, School of Science | en_US |
dc.date.accessioned | 2018-11-09T16:46:06Z | |
dc.date.available | 2018-11-09T16:46:06Z | |
dc.date.issued | 2017-12 | |
dc.description.abstract | In recent years, graphlet counting has emerged as an important task in topological graph analysis. However, the existing works on graphlet counting obtain the graphlet counts for the entire network as a whole. These works capture the key graphical patterns that prevail in a given network but they fail to meet the demand of the majority of real-life graph related prediction tasks such as link prediction, edge/node classification, etc., which require to build features for an edge (or a vertex) of a network. To meet the demand for such applications, efficient algorithms are needed for counting local graphlets within the context of an edge (or a vertex). In this work, we propose an efficient method, titled E-CLOG, for counting all 3,4 and 5 size local graphlets with the context of a given edge for its all different edge orbits. We also provide a shared-memory, multi-core implementation of E-CLOG, which makes it even more scalable for very large real-world networks. In particular, We obtain strong scaling on a variety of graphs (14x-20x on 36 cores). We provide extensive experimental results to demonstrate the efficiency and effectiveness of the proposed method. For instance, we show that E-CLOG is faster than existing work by multiple order of magnitudes; for the Wordnet graph E-CLOG counts all 3,4 and 5-size local graphlets in 1.5 hours using a single thread and in only a few minutes using the parallel implementation, whereas the baseline method does not finish in more than 4 days. We also show that local graphlet counts around an edge are much better features for link prediction than well-known topological features; our experiments show that the former enjoys between 10% to 45% of improvement in the AUC value for predicting future links in three real-life social and collaboration networks. | en_US |
dc.eprint.version | Author's manuscript | en_US |
dc.identifier.citation | Dave, V. S., Ahmed, N. K., & Hasan, M. A. (2017). E-CLoG: Counting edge-centric local graphlets. In 2017 IEEE International Conference on Big Data (Big Data) (pp. 586–595). https://doi.org/10.1109/BigData.2017.8257974 | en_US |
dc.identifier.uri | https://hdl.handle.net/1805/17739 | |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.relation.isversionof | 10.1109/BigData.2017.8257974 | en_US |
dc.relation.journal | 2017 IEEE International Conference on Big Data | en_US |
dc.rights | Publisher Policy | en_US |
dc.source | Author | en_US |
dc.subject | orbits | en_US |
dc.subject | approximation algorithms | en_US |
dc.subject | context | en_US |
dc.title | E-CLoG: Counting edge-centric local graphlets | en_US |
dc.type | Conference proceedings | en_US |