E-CLoG: Counting edge-centric local graphlets

dc.contributor.authorDave, Vachik S.
dc.contributor.authorAhmed, Nesreen K.
dc.contributor.authorAl Hasan, Mohammad
dc.contributor.departmentComputer and Information Science, School of Scienceen_US
dc.date.accessioned2018-11-09T16:46:06Z
dc.date.available2018-11-09T16:46:06Z
dc.date.issued2017-12
dc.description.abstractIn recent years, graphlet counting has emerged as an important task in topological graph analysis. However, the existing works on graphlet counting obtain the graphlet counts for the entire network as a whole. These works capture the key graphical patterns that prevail in a given network but they fail to meet the demand of the majority of real-life graph related prediction tasks such as link prediction, edge/node classification, etc., which require to build features for an edge (or a vertex) of a network. To meet the demand for such applications, efficient algorithms are needed for counting local graphlets within the context of an edge (or a vertex). In this work, we propose an efficient method, titled E-CLOG, for counting all 3,4 and 5 size local graphlets with the context of a given edge for its all different edge orbits. We also provide a shared-memory, multi-core implementation of E-CLOG, which makes it even more scalable for very large real-world networks. In particular, We obtain strong scaling on a variety of graphs (14x-20x on 36 cores). We provide extensive experimental results to demonstrate the efficiency and effectiveness of the proposed method. For instance, we show that E-CLOG is faster than existing work by multiple order of magnitudes; for the Wordnet graph E-CLOG counts all 3,4 and 5-size local graphlets in 1.5 hours using a single thread and in only a few minutes using the parallel implementation, whereas the baseline method does not finish in more than 4 days. We also show that local graphlet counts around an edge are much better features for link prediction than well-known topological features; our experiments show that the former enjoys between 10% to 45% of improvement in the AUC value for predicting future links in three real-life social and collaboration networks.en_US
dc.eprint.versionAuthor's manuscripten_US
dc.identifier.citationDave, V. S., Ahmed, N. K., & Hasan, M. A. (2017). E-CLoG: Counting edge-centric local graphlets. In 2017 IEEE International Conference on Big Data (Big Data) (pp. 586–595). https://doi.org/10.1109/BigData.2017.8257974en_US
dc.identifier.urihttps://hdl.handle.net/1805/17739
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.isversionof10.1109/BigData.2017.8257974en_US
dc.relation.journal2017 IEEE International Conference on Big Dataen_US
dc.rightsPublisher Policyen_US
dc.sourceAuthoren_US
dc.subjectorbitsen_US
dc.subjectapproximation algorithmsen_US
dc.subjectcontexten_US
dc.titleE-CLoG: Counting edge-centric local graphletsen_US
dc.typeConference proceedingsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Dave_2018_E-CLoG.pdf
Size:
483.77 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: