Voyant tools and R in distance reading literary analysis in Portuguese

Authors

DOI:

10.59616/conehd.v1i10.2103

Keywords:

Distant Reading, Voyant Tools, Quanteda, R, Literature

Abstract

The present article aims to compare two distant reading tools using Portuguese literature corpora as an example. Distant reading, as defined by Franco Moretti, is a hermeneutic and methodological framework that allows for the quantitative study of large volumes of text. Voyant Tools is an open-source web platform that enables online textual analysis without the need for programming. Quanteda (Quantitative Analysis of Textual Data), on the other hand, is an R package for data mining, also open source. It was developed for R users who need to apply natural language processing to texts. To use Quanteda through R, basic programming knowledge is required. The article, therefore, seeks to compare these tools, contribute to informed choices in textual analysis tools, and provide a foundation for future research and applications in literary analysis. Additionally, the text aims to problematize the processes of modeling, analysis, and representation, which are knowledge constructions and, as such, subject to interpretation.

Downloads

Download data is not yet available.

Author Biography

Diego Giménez, Universidade de Coimbra

PhD in Philosophy and Literature by the University of Barcelona, with a thesis on the Book of Disquiet by Fernando Pessoa. He worked as a journalist on LaVanguardia.com, and, in 2008, he cofounded Revista de Letras. He was a researcher at the Calouste Gulbenkian Foundation and at the Center for Portuguese Literature at the University of Coimbra, where he worked on the Book of Disquiet Digital Archive. He was a post-doctoral fellow at the Universidade Estadual de Londrina (Brazil), where he continued to study Fernando Pessoa and taught "Theory of the Poem" and "Theory of Narrative". Currently, he is a post-doctoral fellow FCT-POCH at the Centro de Literatura Portuguesa of the Universidade de Coimbra (Portugal), where he teaches "Introduction to Digital Humanities".

References

ALVES, D. As Humanidades Digitais como uma comunidade de práticas dentro do formalismo acadêmico: dos exemplos internacionais ao caso português. Ler História, 69, 2016. Disponível em: https://doi.org/10.4000/lerhistoria.2496. Acesso em: 21 set. 2024.

BENOIT ET AL. Quanteda: An R package for the quantitative analysis of textual data. Journal of Open Source Software, 3(30), p. 774, 2018. Disponível em: https://doi.org/10.21105/joss.00774. Acesso em: 21 set. 2024.

CABRAL, M.J. et al. Lire de près, de loin: close vs distant reading. Paris: Garnier, 2014.

GIMÉNEZ, D & GOMIDE, A. Pesquisa Literária com R: Análise Quantitativa de Dados Textuais, Quanteda tomando como exemplo o Livro do Desassossego. Estudos do Século XX, 22, p. 135-153, 2022. Disponível em: https://doi.org/10.14195/1647-8622_22_7. Acesso em: 27 jul. 2025.

GIMÉNEZ, D. R na análise literária em português. Rpubs, 2024. Disponível em: https://hdl.handle.net/10316/116796. Acesso em: 27 jul. 2025.

GOODWIN, J. et al. Reading Graphs, Maps, and Trees: Responses to Franco Moretti. SC: Parlor Press, 2011.

MORETTI, F. Conjectures on World Literature, em New Left review, n. 1, p. 64-68, 2000.

MORETTI, F. Graphs, Maps, Trees: Abstract Models for Literary History. London: Verso Books, 2005.

SANTOS, D. et al. Leitura distante em português: resumo do Primeiro Encontro. MATLIT: Materialidades da Literatura, v. 8, n. 1, p. 279-298, 2020. Disponível em: https://doi.org/10.14195/2182-8830_8-1_16. Acesso em: 21 set. 2024.

UNDERWOOD, T. A Genealogy of Distant Reading. DHQ: Digital Humanities Quarterly, v. 11, n. 2, 2017. Disponível em: http://www.digitalhumanities.org/dhq/vol/11/2/000317/000317.html. Acesso em: 21 set. 2024.

UNDERWOOD, T. Distant Horizons: Digital Evidence and Literary Change. Chicago: The University of Chicago Press, 2019.

Published

2026-04-22

Métricas


Visualizações do artigo: 33     pdf (Português (Brasil)) downloads: 3

How to Cite

GIMÉNEZ, Diego. Voyant tools and R in distance reading literary analysis in Portuguese . Convergências: estudos em Humanidades Digitais, [S. l.], v. 1, n. 10, p. 122–145, 2026. DOI: 10.59616/conehd.v1i10.2103. Disponível em: https://periodicos.ifg.edu.br/cehd/article/view/2103. Acesso em: 25 apr. 2026.

Similar Articles

1 2 3 4 5 6 7 8 9 > >> 

You may also start an advanced similarity search for this article.