Open Set Authorship Attribution Toward Demystifying Victorian Periodicals
dc.contributor.author | Badirli, Sarkhan | |
dc.contributor.author | Borgo Ton, Mary | |
dc.contributor.author | Gungor, Abdulmecit | |
dc.contributor.author | Dundar, Murat | |
dc.contributor.department | Computer and Information Science, School of Science | en_US |
dc.date.accessioned | 2023-04-26T17:16:57Z | |
dc.date.available | 2023-04-26T17:16:57Z | |
dc.date.issued | 2021-09 | |
dc.description.abstract | Existing research in computational authorship attribution (AA) has primarily focused on attribution tasks with a limited number of authors in a closed-set configuration. This restricted set-up is far from being realistic in dealing with highly entangled real-world AA tasks that involve a large number of candidate authors for attribution during test time. In this paper, we study AA in historical texts using a new data set compiled from the Victorian literature. We investigate the predictive capacity of most common English words in distinguishing writings of most prominent Victorian novelists. We challenged the closed-set classification assumption and discussed the limitations of standard machine learning techniques in dealing with the open set AA task. Our experiments suggest that a linear classifier can achieve near perfect attribution accuracy under closed set assumption yet, the need for more robust approaches becomes evident once a large candidate pool has to be considered in the open-set classification setting. | en_US |
dc.eprint.version | Author's manuscript | en_US |
dc.identifier.citation | Badirli, S., Borgo Ton, M., Gungor, A., & Dundar, M. (2021). Open Set Authorship Attribution Toward Demystifying Victorian Periodicals. In J. Lladós, D. Lopresti, & S. Uchida (Eds.), Document Analysis and Recognition – ICDAR 2021 (Vol. 12824, pp. 221–235). Springer International Publishing. https://doi.org/10.1007/978-3-030-86337-1_15 | en_US |
dc.identifier.issn | 978-3-030-86336-4 978-3-030-86337-1 | en_US |
dc.identifier.uri | https://hdl.handle.net/1805/32619 | |
dc.language.iso | en_US | en_US |
dc.publisher | Springer | en_US |
dc.relation.isversionof | 10.1007/978-3-030-86337-1_15 | en_US |
dc.relation.journal | Document Analysis and Recognition – ICDAR 2021 | en_US |
dc.rights | Publisher Policy | en_US |
dc.source | ArXiv | en_US |
dc.subject | Author attribution | en_US |
dc.subject | Open-set classification | en_US |
dc.subject | Victorian literature | en_US |
dc.title | Open Set Authorship Attribution Toward Demystifying Victorian Periodicals | en_US |
dc.type | Conference proceedings | en_US |