Comparison of Deep Learning based Concept Representations for Biomedical Document Clustering

dc.contributor.authorShah, Setu
dc.contributor.authorLuo, Xiao
dc.contributor.departmentComputer and Information Science, School of Scienceen_US
dc.date.accessioned2019-03-28T17:16:06Z
dc.date.available2019-03-28T17:16:06Z
dc.date.issued2018
dc.description.abstractIn this research, document representations based on distributed representations of the concepts along with new weighting schemes for the documents are explored. The baseline weighting scheme is the traditional Term Frequency-Inverse Document Frequency (TF-IDF) of the concepts, whereas, the other two newly proposed ones consider both local content using the TF-IDF and associations between concepts. The distributed representations of the concepts are measured using a deep learning algorithm. The evaluation of the proposed document representations is based on the k-means clustering results. The results show that document representation based on TF-IDF in combination with the term based distributed representations for concepts outperforms the other two based on the returned evaluation metrics - F1-measure (80.21%) and Purity (77.1%).en_US
dc.eprint.versionAuthor's manuscripten_US
dc.identifier.citationShah, S., & Luo, X. (2018). Comparison of deep learning based concept representations for biomedical document clustering. In 2018 IEEE EMBS International Conference on Biomedical Health Informatics (BHI) (pp. 349–352). https://doi.org/10.1109/BHI.2018.8333440en_US
dc.identifier.urihttps://hdl.handle.net/1805/18701
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.isversionof10.1109/BHI.2018.8333440en_US
dc.relation.journal2018 IEEE EMBS International Conference on Biomedical Health Informaticsen_US
dc.rightsPublisher Policyen_US
dc.sourceAuthoren_US
dc.subjectbiomedical measurementen_US
dc.subjectdiabetesen_US
dc.subjectcanceren_US
dc.titleComparison of Deep Learning based Concept Representations for Biomedical Document Clusteringen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Shah_2018_comparison.pdf
Size:
273.1 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: