OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System
dc.contributor.author | Guo, Xiaoyuan | |
dc.contributor.author | Duan, Jiali | |
dc.contributor.author | Purkayastha, Saptarshi | |
dc.contributor.author | Trivedi, Hari | |
dc.contributor.author | Gichoya, Judy Wawira | |
dc.contributor.author | Banerjee, Imon | |
dc.contributor.department | BioHealth Informatics, School of Informatics and Computing | en_US |
dc.date.accessioned | 2022-10-05T21:08:50Z | |
dc.date.available | 2022-10-05T21:08:50Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Improving the retrieval relevance on noisy datasets is an emerging need for the curation of a large-scale clean dataset in the medical domain. While existing methods can be applied for class-wise retrieval (aka. inter-class), they cannot distinguish the granularity of likeness within the same class (aka. intra-class). The problem is exacerbated on medical external datasets, where noisy samples of the same class are treated equally during training. Our goal is to identify both intra/inter-class similarities for fine-grained retrieval. To achieve this, we propose an Outlier-Sensitive Content-based rAdiologhy Retrieval System (OSCARS), consisting of two steps. First, we train an outlier detector on a clean internal dataset in an unsupervised manner. Then we use the trained detector to generate the anomaly scores on the external dataset, whose distribution will be used to bin intra-class variations. Second, we propose a quadruplet (a, p, nintra, ninter) sampling strategy, where intra-class negatives nintra are sampled from bins of the same class other than the bin anchor a belongs to, while niner are randomly sampled from inter-classes. We suggest a weighted metric learning objective to balance the intra and inter-class feature learning. We experimented on two representative public radiography datasets. Experiments show the effectiveness of our approach. | en_US |
dc.eprint.version | Author's manuscript | en_US |
dc.identifier.citation | Guo, X., Duan, J., Purkayastha, S., Trivedi, H., Gichoya, J. W., & Banerjee, I. (2022). OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System. arXiv preprint arXiv:2204.03074. https://doi.org/10.48550/arXiv.2204.03074 | en_US |
dc.identifier.uri | https://hdl.handle.net/1805/30212 | |
dc.language.iso | en | en_US |
dc.publisher | arXiv | en_US |
dc.relation.isversionof | 10.48550/arXiv.2204.03074 | en_US |
dc.relation.journal | arXiv | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | * |
dc.source | Author | en_US |
dc.subject | medical image retrieval | en_US |
dc.subject | deep metric learning | en_US |
dc.subject | outlier detection | en_US |
dc.title | OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System | en_US |
dc.type | Article | en_US |