Open Set Authorship Attribution Toward Demystifying Victorian Periodicals

dc.contributor.authorBadirli, Sarkhan
dc.contributor.authorBorgo Ton, Mary
dc.contributor.authorGungor, Abdulmecit
dc.contributor.authorDundar, Murat
dc.contributor.departmentComputer and Information Science, School of Scienceen_US
dc.date.accessioned2023-04-26T17:16:57Z
dc.date.available2023-04-26T17:16:57Z
dc.date.issued2021-09
dc.description.abstractExisting research in computational authorship attribution (AA) has primarily focused on attribution tasks with a limited number of authors in a closed-set configuration. This restricted set-up is far from being realistic in dealing with highly entangled real-world AA tasks that involve a large number of candidate authors for attribution during test time. In this paper, we study AA in historical texts using a new data set compiled from the Victorian literature. We investigate the predictive capacity of most common English words in distinguishing writings of most prominent Victorian novelists. We challenged the closed-set classification assumption and discussed the limitations of standard machine learning techniques in dealing with the open set AA task. Our experiments suggest that a linear classifier can achieve near perfect attribution accuracy under closed set assumption yet, the need for more robust approaches becomes evident once a large candidate pool has to be considered in the open-set classification setting.en_US
dc.eprint.versionAuthor's manuscripten_US
dc.identifier.citationBadirli, S., Borgo Ton, M., Gungor, A., & Dundar, M. (2021). Open Set Authorship Attribution Toward Demystifying Victorian Periodicals. In J. Lladós, D. Lopresti, & S. Uchida (Eds.), Document Analysis and Recognition – ICDAR 2021 (Vol. 12824, pp. 221–235). Springer International Publishing. https://doi.org/10.1007/978-3-030-86337-1_15en_US
dc.identifier.issn978-3-030-86336-4 978-3-030-86337-1en_US
dc.identifier.urihttps://hdl.handle.net/1805/32619
dc.language.isoen_USen_US
dc.publisherSpringeren_US
dc.relation.isversionof10.1007/978-3-030-86337-1_15en_US
dc.relation.journalDocument Analysis and Recognition – ICDAR 2021en_US
dc.rightsPublisher Policyen_US
dc.sourceArXiven_US
dc.subjectAuthor attributionen_US
dc.subjectOpen-set classificationen_US
dc.subjectVictorian literatureen_US
dc.titleOpen Set Authorship Attribution Toward Demystifying Victorian Periodicalsen_US
dc.typeConference proceedingsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Badirli2021Open-AAM.pdf
Size:
543.92 KB
Format:
Adobe Portable Document Format
Description:
Article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: