Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/40553

TítuloAutomatic distinction of Fernando Pessoas’ heteronyms
Autor(es)Teixeira, João F.
Couto, Marco
Palavras-chaveAuthorship Classification
Machine Learning
SVM
Text Mining
Data2015
EditoraSpringer Verlag
RevistaLecture Notes in Computer Science
CitaçãoTeixeira, J. F., & Couto, M. (2015) Automatic distinction of Fernando Pessoas’ heteronyms. Vol. 9273. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 783-788).
Resumo(s)Text Mining has opened a vast array of possibilities concerning automatic information retrieval from large amounts of text documents. A variety of themes and types of documents can be easily analyzed. More complex features such as those used in Forensic Linguistics can gather deeper understanding from the documents, making possible performing di cult tasks such as author identi cation. In this work we explore the capabilities of simpler Text Mining approaches to author identification of unstructured documents, in particular the ability to distinguish poetic works from two of Fernando Pessoas' heteronyms: Alvaro de Campos and Ricardo Reis. Several processing options were tested and accuracies of 97% were reached, which encourage further developments.
TipoArtigo em ata de conferência
URIhttps://hdl.handle.net/1822/40553
ISBN978-3-319-23485-4
978-3-319-23484-7
DOI10.1007/978-3-319-23485-4_78
ISSN0302-9743
Versão da editorahttp://link.springer.com/chapter/10.1007/978-3-319-23485-4_78
AcessoAcesso restrito UMinho
Aparece nas coleções:HASLab - Artigos em atas de conferências internacionais (texto completo)

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
3049.pdf
Acesso restrito!
178,93 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID