Analyzing 90 Slovenian Novels and the Oeuvre of Ivan Cankar Using Computational Stylometry


  • Ivana Zajc



Slovenian novel, Cankar, Ivan, distant reading, computational stylometry, digital humanities


The article presents a distant reading of Slovenian literature from the periods of realism and early modernism (moderna in Slovenian). The first part of the article displays the results of a computational stylometric analysis of a corpus of 90 Slovenian novels from the two periods, conducted in the programming environment R with the Stylometry with R package. The results show that the primary signal differentiating the included novels is the authorial one, with Pavlina Pajk and Ivan Cankar standing out the most. Since Pajk is the most distinct author in the corpus, the article also includes a stylometric investigation into the most typical vocabulary of her works. On this basis, the article concludes that her writing differs from others due to the genre of sentimental romance novels which characterizes her work. In the second part of the article, Cankar’s literary oeuvre is analyzed using the same methodology, and the corpus includes not only his novels but also other works from different periods of his writing career. The results of the analysis reveal the development of Cankar’s literary style in terms of genre and periodization.


Adwetewa-Badu, Bemma. »Poetry from Afar: Distant Reading, Global Poetics, and the Digital Humanities«. Modernism/Modernity Print+, let. 5, št. 1, 2020, Dostop 12. 4. 2024.

Avsenik Nabergoj, Irena. »Ivan Cankar med domovino in tujino«. Dve domovini, št. 20, 2004, str. 95–111.

Bernik, France. Obzorja slovenske književnosti. Slovenska matica, 1999.

Bernik, France. Ivan Cankar: monografija. Litera, 2006.

Burrows, John. »‘Delta’ – A Measure of Stylistic Difference and a Guide to Likely Authorship«. Literary and Linguistic Computing, let. 17, št. 3, 2002, str. 267–287.

Da, Nan Z. »The Computational Case against Computational Literary Studies«. Critical Inquiry, let. 45, št. 3, 2019, str. 601–639.

Čeh, Jožica. »Cankarjeva metafora v vinjetnem obdobju«. Slavistična revija, let. 45, št. 1–2, 1997, str. 45–58.

Eder, Maciej. »Visualization in Stylometry: Cluster Analysis Using Networks«. Digital Scholarship in the Humanities, let. 32, št. 1, 2017, str. 50–64.

Eder, Maciej, Jan Rybicki in Mike Kestemont. »Stylometry with R: A Package for Computational Text Analysis«. The R Journal, let. 8, št. 1, 2016, str. 107–121.

Evans, Mel. »Style and Chronology: A Stylometric Investigation of Aphra Behn’s Dramatic Style and the Dating of The Young King«. Language and Literature, let. 27, št. 2, 2018, str. 1–49.

Eve, Martin Paul. The Digital Humanities and Literary Studies. Oxford University Press, 2022.

Evert, Stefan, idr. »Understanding and Explaining Delta Measures for Authorship Attribution«. Digital Scholarship in the Humanities, let. 32, št. 2, 2017, str. 4–16.

Fox, Neal P., Omran Ehmoda in Eugene Charniak. »Statistical Stylometrics and the Marlowe-Shakespeare Authorship Debate«. Proceedings of the Georgetown University on Language and Linguistics, Georgetown University, 2012, Dostop 12. 4. 2024.

Heuser, Ryan, in Long Le-Khac. »A Quantitative Literary History of 2,958 Nineteenth-century British Novels«. Stanford Literary Lab, 2012, Dostop 12. 4. 2024.

Ilin, Darko. »Wikisource platforma kao biblioteka slovenačke književnosti i kulture«. Slovenika: časopis za kulturu, nauku i obrazovanje, št. 9, 2023, str. 195–211.

Jensterle-Doležal, Alenka. »Fenomen mesta v opusu Ivana Cankarja in Zofke Kveder«. Jezik in slovstvo, let. 55, št. 5–6, 2010, str. 57–70.

Jockers, Patrick. Macroanalysis: Digital Methods and Literary History. University of Illinois Press, 2013.

Juola, Patrick. »Rowling and ‘Galbraith’: An Authorial Analysis«. Language Log, 16. 7. 2013, Dostop 12. 4. 2024.

Juola, Patrick. »The Rowling Case: A Proposed Standard Analytic Protocol for Authorship Questions«. Digital Scholarship in the Humanities, let. 30, št. 1, 2015, str. 100–113.

Juvan, Marko, Mojca Šorli in Andrejka Žejn. »Interpretiranje literature v zmanjšanem merilu: ‘oddaljeno branje’ korpusa ‘dolgega leta 1968’«. Jezik in slovstvo, let. 66, št. 4, 2021, str. 55–76.

Kestemont, Mike. »Function Words in Authorship Attribution: From Black Magic to Theory?« Proceedings of the 3rd Workshop on Computational Linguistics for Literature, ur. Anna Feldman, Anna Kazantseva in Stan Szpakowicz, The Association for Computational Linguistics, 2014, str. 59–66.

Kocijan, Gregor. Kratka pripovedna proza v obdobju moderne. Znanstveni inštitut Filozofske fakultete, 1996.

Mandić, Lucija, in Ivana Zajc. »Stilometrična analiza avtorskega sloga Jurčič-Kersnikovega romana Rokovnjači«. Slovansko jezikovno in literarno povezovanje ter zgodovinski kontekst, ur. Ina Poteko, Študentska organizacija Filozofske fakultete, 2020, str. 7–13.

Moretti, Franco. »Domneve o svetovni literaturi«. Grafi, zemljevidi, drevesa in drugi spisi o svetovni literaturi, ur. in prev. Jernej Habjan, Studia humanitatis, 2011, str. 5–25.

Salgaro, Massimo. »The Digital Humanities as a Toolkit for Literary Theory: Three Case Studies of the Operationalization of the Concepts of ‘Late Style,’ ‘Authorship Attribution,’ and ‘Literary Movement’«. Iperstoria, let. 12, št. 2, 2018, str. 50–60.

Underwood, Ted. »Distant Reading and Recent Intellectual History«. Debates in the Digital Humanities 2016, ur. Matthew K. Gold in Lauren F. Klein, University of Minnesota Press, Dostop 12. 4. 2024.

Underwood, Ted. »A Genealogy of Distant Reading«. Digital Humanities Quarterly, let. 11, št. 2, 2017, Dostop 12. 4. 2024.

Zajc, Ivana. »Elementi monodrame in avtobiografskosti v besedilih Simone Semenič«. Amfiteater, let. 7, št. 2, 2019, str. 80–98.

Zupan Sosič, Alojzija. »Deset romanov Ivana Cankarja in sodobna definicija romana«. Slavia Centralis, let. 13, št. 1, 2020, str. 136–152.

Žejn, Andrejka. »Računalniško podprta stilometricčna analiza pripovedne literature Janeza Ciglerja in Christopha Schmida v slovenščini«. Fluminensia, let. 32, št. 2, 2020, str. 137–158.

Žejn, Andrejka. Izhodišča slovenske pripovedne proze. Dvestoletna tradicija slovenske pripovedne proze: od sredine 17. do sredine 19. stoletja. ZRC SAZU, Inštitut za slovensko literaturo in literarne vede, 2021, Dostop 12. 4. 2024.





Thematic section