Knowledge Reused Outlier Detection

dc.contributor.authorYu, Weiren
dc.contributor.authorDing, Zhengming
dc.contributor.authorHu, Chunming
dc.contributor.authorLiu, Hongfu
dc.contributor.departmentComputer and Information Science, School of Scienceen_US
dc.date.accessioned2020-08-06T20:17:24Z
dc.date.available2020-08-06T20:17:24Z
dc.date.issued2019-03
dc.description.abstractTremendous efforts have been invested in the unsupervised outlier detection research, which is conducted on unlabeled data set with abnormality assumptions. With abundant related labeled data available as auxiliary information, we consider transferring the knowledge from the labeled source data to facilitate the unsupervised outlier detection on target data set. To fully make use of the source knowledge, the source data and target data are put together for joint clustering and outlier detection using the source data cluster structure as a constraint. To achieve this, the categorical utility function is employed to regularize the partitions of target data to be consistent with source data labels. With an augmented matrix, the problem is completely solved by a K-means - a based method with the rigid mathematical formulation and theoretical convergence guarantee. We have used four real-world data sets and eight outlier detection methods of different kinds for extensive experiments and comparison. The results demonstrate the effectiveness and significant improvements of the proposed methods in terms of outlier detection and cluster validity metrics. Moreover, the parameter analysis is provided as a practical guide, and noisy source label analysis proves that the proposed method can handle real applications where source labels can be noisy.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationYu, W., Ding, Z., Hu, C., & Liu, H. (2019). Knowledge Reused Outlier Detection. IEEE Access, 7, 43763–43772. https://doi.org/10.1109/ACCESS.2019.2906644en_US
dc.identifier.urihttps://hdl.handle.net/1805/23546
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.isversionof10.1109/ACCESS.2019.2906644en_US
dc.relation.journalIEEE Accessen_US
dc.rightsPublisher Policyen_US
dc.sourcePublisheren_US
dc.subjectoutlier detectionen_US
dc.subjecttransfer learningen_US
dc.subjectK-meansen_US
dc.titleKnowledge Reused Outlier Detectionen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Yu_2019_knowledge.pdf
Size:
7.87 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: