Knowledge Reused Outlier Detection
dc.contributor.author | Yu, Weiren | |
dc.contributor.author | Ding, Zhengming | |
dc.contributor.author | Hu, Chunming | |
dc.contributor.author | Liu, Hongfu | |
dc.contributor.department | Computer and Information Science, School of Science | en_US |
dc.date.accessioned | 2020-08-06T20:17:24Z | |
dc.date.available | 2020-08-06T20:17:24Z | |
dc.date.issued | 2019-03 | |
dc.description.abstract | Tremendous efforts have been invested in the unsupervised outlier detection research, which is conducted on unlabeled data set with abnormality assumptions. With abundant related labeled data available as auxiliary information, we consider transferring the knowledge from the labeled source data to facilitate the unsupervised outlier detection on target data set. To fully make use of the source knowledge, the source data and target data are put together for joint clustering and outlier detection using the source data cluster structure as a constraint. To achieve this, the categorical utility function is employed to regularize the partitions of target data to be consistent with source data labels. With an augmented matrix, the problem is completely solved by a K-means - a based method with the rigid mathematical formulation and theoretical convergence guarantee. We have used four real-world data sets and eight outlier detection methods of different kinds for extensive experiments and comparison. The results demonstrate the effectiveness and significant improvements of the proposed methods in terms of outlier detection and cluster validity metrics. Moreover, the parameter analysis is provided as a practical guide, and noisy source label analysis proves that the proposed method can handle real applications where source labels can be noisy. | en_US |
dc.eprint.version | Final published version | en_US |
dc.identifier.citation | Yu, W., Ding, Z., Hu, C., & Liu, H. (2019). Knowledge Reused Outlier Detection. IEEE Access, 7, 43763–43772. https://doi.org/10.1109/ACCESS.2019.2906644 | en_US |
dc.identifier.uri | https://hdl.handle.net/1805/23546 | |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.relation.isversionof | 10.1109/ACCESS.2019.2906644 | en_US |
dc.relation.journal | IEEE Access | en_US |
dc.rights | Publisher Policy | en_US |
dc.source | Publisher | en_US |
dc.subject | outlier detection | en_US |
dc.subject | transfer learning | en_US |
dc.subject | K-means | en_US |
dc.title | Knowledge Reused Outlier Detection | en_US |
dc.type | Article | en_US |