ISSN: 2534-5192


◀ Manohar & ThottingalProceedingsThaine & Penn ▶



ISBN: 978-2-9570549-0-9
e-ISBN: 978-2-9570549-1-6

Download (57.57 MB)

Investigating Keylogs as Time-Stamped Graphemics
Nicolas Ballier ORCID iD icon, Erin Pacquetet ORCID iD icon & Taylor Arnold ORCID iD icon
Download (1.23 MB)

Abstract. This article investigates keystroke data, in an attempt to articulate the microlevel of the graphemic level with the macrolevel of text structures. Analyzing the time-stamps of keylogs, we suggest a hierarchy of constituents inspired by speech data and focus on the interaction of graphemic structure, phonological structure and textual structure within the dimension of time. We present the prototype of an R package designed to analyze keylog capture data, taking into account graphemic structures, syllable counts and parsing. Our R package under development offers functions that can be used to analyze the various levels of graphemic constituents produced by typists, from syllable counts to n-gram analysis.

DOI: https://doi.org/10.36824/2018-graf-ball


Bellis, Kouroch (2017). La disposition Cœur 2.0 (ÉWOPY) comme disposition de clavier bureautique français: Réponse à l’enquête publique de l’AFNOR pour une norme PR NF Z71-300. https://hal.archives-ouvertes.fr/hal-01558613/document.

Bergadano, Francesco, Daniele Gunetti, and Claudia Picardi (2002). “User Authentication through Keystroke Dynamics”. In: ACM Transactions on Information and System Security (TISSEC) 5.4, pp. 367–397.

Charoenchaikorn, Vararin (2019). “L2 Revision and Post-task Anticipation during Text-Based Synchronous Computer-Mediated Communication (SCMC) Tasks”. PhD Thesis. Lancaster University.

Chukharev-Khudilaynen, Evgeny (2014). “Pauses in Spontaneous Written Communication: A Keystroke Logging Study”. In: Journal of Writing Research 6.1, pp. 61–84.

Cislaru, Georgeta and Thierry Olive (2016). “Les automatismes du scripteur: Jets textuels spontanés dans le processus de production écrite, le cas des constructions coordinatives”. In: SHS Web of Conferences. Vol. 27. EDP Sciences, p. 06003.

_____________ (2017). “Segments répétés, jets textuels et autres routines. Quel niveau de pré-construction?” In: Corpus 17, pp. 1–21.

_____________ (2018). Le processus de textualisation: analyse des unités linguistiques de performance écrite. Louvain-la-Neuve, Paris: De Boeck Supérieur.

Díaz-Negrillo, Ana Marcus Callies and Cristóbal Lozano (2018). “Designing and Compiling a Learner Corpus of Written and Spoken Narratives: The Corpus of English as a Foreign Language? (COREFL)”. In: ARISLA workshop (Anaphora Resolution in Second Language Acquisition). University of Granada.

Evertz, Martin (in this volume). “The History of the Graphematic Foot in English and German”.

Giot, Romain, Mohamad El-Abed, and Christophe Rosenberger (2009). “Greyc Keystroke: A Benchmark for Keystroke Dynamics Biometric Systems”. In: 2009 IEEE 3rd International Conference on Biometrics: Theory, Applications, and Systems (BTAS). Washington, DC, pp. 1–6.

Giot, Romain et al. (2012). “Analysis of the Acquisition Process for Keystroke Dynamics”. In: Proceedings of the International Conference of the Biometrics Special Interest Group (BIOSIG). Darmstadt: IEEE, pp. 1–6.

Kang, Pilsung and Sungzoon Cho (2015). “Keystroke Dynamics-Based User Authentication Using Long and Free Text Strings from Various Input Devices”. In: Information Sciences 308, pp. 72–93.

Killourhy, Kevin S. and Roy A. Maxion (2009). “Keystroke Dynamics—Benchmark Data Set”. Carnegie-Mellon University, http://www.cs.cmu.edu/~keystroke.

Leijten, Mariëlle and Luuk Van Waes (2013). “Keystroke Logging in Writing Research: Using Inputlog to Analyze and Visualize Writing Processes”. In: Written Communication 30.3, pp. 358–392.

Lozano, C., A. Díaz-Negrillo, and M. Callies (to appear). “Designing and Compiling a Learner Corpus of Written and Spoken Narratives: COREFL”. In: What’s in a Narrative? Variation in Story-Telling at the Interface between Language and Literacy. Ed. by Christiane Bongartz and Jacopo Torregrossa.

Mahlow, Cerstin (2015). “Learning from Errors: Systematic Analysis of Complex Writing Errors for Improving Writing Technology”. In: Text, Speech and Language Technology. Vol. 48: Language Production, Cognition, and the Lexicon. Springer, pp. 419–438.

Malekian, Donia et al. (2019). “Characterising Students Writing Processes Using Temporal Keystroke Analysis”. In: The 12th International Conference on Educational Data Mining. Ed. by Michel Desmarais et al. Vol. 27. Montréal, pp. 354–359.

Monaco, John V. et al. (2013). “Behavioral Biometric Verification of Student Identity in Online Course Assessment and Authentication of Authors in Literary Works”. In: 2013 IEEE Sixth International Conference on Biometrics: Theory, Applications and Systems (BTAS). Washington, DC, pp. 1–8.

Nespor, Marina and Irene Vogel (2007). Prosodic Phonology. Vol. 28. de Gruyter.

Plank, Barbara (2016). “Keystroke Dynamics as Signal for Shallow Syntactic Parsing”. arXiv:1610.03321.

Stewart, John C. et al. (2011). “An Investigation of Keystroke and Stylometry Traits for Authenticating Online Test Takers”. In: 2011 International Joint Conference on Biometrics (IJCB). Washington, DC, pp. 1–7.

Tappert, Charles C. et al. (2012). “A Keystroke Biometric System for Long-Text Input”. In: Optimizing Information Security and Advancing Privacy Assurance: New Technologies. Hershey, PA: IGI Global, pp. 32–57.

Van Waes, Luuk, Mariëlle Leijten, and Christophe Neuwirth (2006). Writing and Digital Media. Leuven: Brill.

Weingarten, Rüdiger, Guido Nottbusch, and Udo Will (2004). “Morphemes, Syllables, and Graphemes in Written Word Production”. In: Trends in linguistics studies and monographs 157, pp. 529–572.

Zhang, Mo et al. (2019). “Identifying and Comparing Writing Process Patterns Using Keystroke Logs”. In: Springer Proceedings in Mathematics & Statistics. Vol. 265: Quantitative Psychology. Springer, pp. 367–381.

@UNPUBLISHED{evertz,
   AUTHOR = {Evertz, Martin},
   TITLE = {{The History of the Graphematic Foot in English and German}},
   YEAR = {in this volume},
}

@THESIS{Charoenchaikorn,
   AUTHOR = {Charoenchaikorn,Vararin},
   TITLE = {{L2 Revision and Post-task Anticipation during Text-Based Synchronous 
   Computer-Mediated Communication (SCMC) Tasks}},
   SCHOOL = {Lancaster University},
   TYPE = {PhD Thesis},
   YEAR = {2019},
}

@INPROCEEDINGS{Malekian2019,
   AUTHOR = {Malekian, Donia and Bailey, James and Kennedy, Gregor and de Barba, Paula and Nawaz, Sadia},
   EDITOR = {Desmarais, Michel and Lynch, Collin and Merceron, Agathe and Nkambou, Roger},
   TITLE = {{Characterising Students Writing Processes Using Temporal Keystroke Analysis}},
   BOOKTITLE = {{The 12th International Conference on Educational Data Mining}},
   ADDRESS = {Montréal},
   YEAR = {2019},
   VOLUME = {27},
   PAGES = {354--359},
}

@INCOLLECTION{Zhang2019,
   AUTHOR = {Mo Zhang and Mengxiao Zhu and Paul Deane and Hongwen Guo},
   TITLE = {{Identifying and Comparing Writing Process Patterns Using Keystroke Logs}},
   BOOKTITLE = {{Quantitative Psychology}},
   MAINTITLE = {{Springer Proceedings in Mathematics \& Statistics}},
   PUBLISHER = {Springer},
   YEAR = {2019},
   VOLUME = {265},
   PAGES = {367--381},
}

@REPORT{bellis2017disposition,
   AUTHOR = {Bellis, Kouroch},
   TITLE = {{La disposition Cœur 2.0 (ÉWOPY) comme disposition de clavier bureautique 
   français: Réponse à l'enquête publique de l'AFNOR pour une norme PR NF Z71-300}},
   PUBLISHER = {AFNOR},
   YEAR = {2017},
   NOTE = {\url{https://hal.archives-ouvertes.fr/hal-01558613/document}},
}

@ARTICLE{bergadano_user_2002,
   AUTHOR = {Bergadano, Francesco and Gunetti, Daniele and Picardi, Claudia},
   TITLE = {{User Authentication through Keystroke Dynamics}},
   JOURNAL = {ACM Transactions on Information and System Security (TISSEC)},
   YEAR = {2002},
   VOLUME = {5},
   NUMBER = {4},
   PAGES = {367--397},
}

@ARTICLE{chukharev2014pauses,
   AUTHOR = {Chukharev-Khudilaynen, Evgeny},
   TITLE = {{Pauses in Spontaneous Written Communication: A Keystroke Logging Study}},
   JOURNAL = {Journal of Writing Research},
   YEAR = {2014},
   VOLUME = {6},
   NUMBER = {1},
   PAGES = {61--84},
}

@INPROCEEDINGS{cislaru2016automatismes,
   AUTHOR = {Cislaru, Georgeta and Olive, Thierry},
   TITLE = {{Les automatismes du scripteur: Jets textuels spontanés dans le processus 
   de production écrite, le cas des constructions coordinatives}},
   BOOKTITLE = {{SHS Web of Conferences}},
   PUBLISHER = {EDP Sciences},
   YEAR = {2016},
   VOLUME = {27},
   PAGES = {06003},
}

@ARTICLE{cislaru2017segments,
   AUTHOR = {Cislaru, Georgeta and Olive, Thierry},
   TITLE = {{Segments répétés, jets textuels et autres routines. Quel niveau de pré-construction?}},
   JOURNAL = {Corpus},
   PUBLISHER = {Bases, corpus et langage-UMR 6039},
   YEAR = {2017},
   VOLUME = {17},
   PAGES = {1--21},
}

@BOOK{cislaru2018processus,
   AUTHOR = {Cislaru, Georgeta and Olive, Thierry},
   TITLE = {{Le processus de textualisation: analyse des unités linguistiques de performance écrite}},
   PUBLISHER = {De Boeck Supérieur},
   ADDRESS = {Louvain-la-Neuve, Paris},
   YEAR = {2018},
}

@INPROCEEDINGS{diazcorefl,
   AUTHOR = {A. Díaz-Negrillo, M. Callies, C. Lozano},
   TITLE = {{Designing and Compiling a Learner Corpus of Written and Spoken Narratives: 
   The Corpus of English as a Foreign Language? (COREFL)}},
   BOOKTITLE = {{ARISLA workshop (Anaphora Resolution in Second Language Acquisition)}},
   ADDRESS = {University of Granada},
   YEAR = {2018},
}

@INPROCEEDINGS{giot2009greyc,
   AUTHOR = {Giot, Romain and El-Abed, Mohamad and Rosenberger, Christophe},
   TITLE = {{Greyc Keystroke: A Benchmark for Keystroke Dynamics Biometric Systems}},
   BOOKTITLE = {{2009 IEEE 3rd International Conference on Biometrics: Theory, Applications, 
   and Systems (BTAS)}},
   ADDRESS = {Washington, DC},
   YEAR = {2009},
   PAGES = {1--6},
}

@ARTICLE{kang2015keystroke,
   AUTHOR = {Kang, Pilsung and Cho, Sungzoon},
   TITLE = {{Keystroke Dynamics-Based User Authentication Using Long and Free Text 
   Strings from Various Input Devices}},
   JOURNAL = {Information Sciences},
   PUBLISHER = {Elsevier},
   YEAR = {2015},
   VOLUME = {308},
   PAGES = {72--93},
}

@UNPUBLISHED{killourhy2009keystroke,
   AUTHOR = {Killourhy, Kevin S. and Maxion, Roy A.},
   TITLE = {{Keystroke Dynamics---Benchmark Data Set}},
   YEAR = {2009},
   NOTE = {Carnegie-Mellon University, \url{http://www.cs.cmu.edu/~keystroke}},
}

@ARTICLE{leijten2013keystroke,
   AUTHOR = {Leijten, Mariëlle and Van Waes, Luuk},
   TITLE = {{Keystroke Logging in Writing Research: Using Inputlog to Analyze and 
   Visualize Writing Processes}},
   JOURNAL = {Written Communication},
   PUBLISHER = {SAGE Publications Sage CA: Los Angeles, CA},
   YEAR = {2013},
   VOLUME = {30},
   NUMBER = {3},
   PAGES = {358--392},
}

@INPROCEEDINGS{lozanodesigning,
   AUTHOR = {Lozano, C. and Díaz-Negrillo, A. and Callies, M.},
   EDITOR = {Bongartz, Christiane and Torregrossa, Jacopo},
   TITLE = {{Designing and Compiling a Learner Corpus of Written and Spoken Narratives: 
   COREFL}},
   BOOKTITLE = {{What's in a Narrative? Variation in Story-Telling at the Interface between 
   Language and Literacy}},
   YEAR = {to appear},
}

@INPROCEEDINGS{mahlow_learning_2015,
   AUTHOR = {Mahlow, Cerstin},
   TITLE = {{Learning from Errors: Systematic Analysis of Complex Writing Errors for 
   Improving Writing Technology}},
   BOOKTITLE = {{Language Production, Cognition, and the Lexicon}},
   MAINTITLE = {{Text, Speech and Language Technology}},
   PUBLISHER = {Springer},
   YEAR = {2015},
   VOLUME = {48},
   PAGES = {419--438},
}

@INPROCEEDINGS{monaco2013behavioral,
   AUTHOR = {Monaco, John V. and Stewart, John C. and Cha, Sung-Hyuk and Tappert, Charles C},
   TITLE = {{Behavioral Biometric Verification of Student Identity in Online Course 
   Assessment and Authentication of Authors in Literary Works}},
   BOOKTITLE = {{2013 IEEE Sixth International Conference on Biometrics: Theory, 
   Applications and Systems (BTAS)}},
   ADDRESS = {Washington, DC},
   YEAR = {2013},
   PAGES = {1--8},
}

@BOOK{nespor_prosodic_2007,
   AUTHOR = {Nespor, Marina and Vogel, Irene},
   TITLE = {{Prosodic Phonology}},
   PUBLISHER = {de Gruyter},
   YEAR = {2007},
   VOLUME = {28},
}

@UNPUBLISHED{plank2016keystroke,
   AUTHOR = {Plank, Barbara},
   TITLE = {{Keystroke Dynamics as Signal for Shallow Syntactic Parsing}},
   YEAR = {2016},
   NOTE = {arXiv:1610.03321},
}

@INPROCEEDINGS{stewart2011investigation,
   AUTHOR = {Stewart, John C. and Monaco, John V. and Cha, Sung-Hyuk and Tappert, Charles C.},
   TITLE = {{An Investigation of Keystroke and Stylometry Traits for Authenticating 
   Online Test Takers}},
   BOOKTITLE = {{2011 International Joint Conference on Biometrics (IJCB)}},
   ADDRESS = {Washington, DC},
   YEAR = {2011},
   PAGES = {1--7},
}

@INPROCEEDINGS{tappert_keystroke_2012,
   AUTHOR = {Tappert, Charles C. and Cha, Sung-Hyuk and Villani, Mary and Zack, Robert S.},
   TITLE = {{A Keystroke Biometric System for Long-Text Input}},
   BOOKTITLE = {{Optimizing Information Security and Advancing Privacy Assurance: 
   New Technologies}},
   PUBLISHER = {IGI Global},
   ADDRESS = {Hershey, PA},
   YEAR = {2012},
   PAGES = {32--57},
}

@BOOK{vanWaes2006writing,
   AUTHOR = {Van Waes, Luuk and Leijten, Mariëlle and Neuwirth, Christophe},
   TITLE = {{Writing and Digital Media}},
   PUBLISHER = {Brill},
   ADDRESS = {Leuven},
   YEAR = {2006},
}

@ARTICLE{weingarten2004,
   AUTHOR = {Weingarten, Rüdiger and Nottbusch, Guido and Will, Udo},
   TITLE = {{Morphemes, Syllables, and Graphemes in Written Word Production}},
   JOURNAL = {Trends in linguistics studies and monographs},
   YEAR = {2004},
   VOLUME = {157},
   PAGES = {529--572},
}

@INPROCEEDINGS{giot_analysis_2012,
   AUTHOR = {Giot, Romain and Ninassi, Alexandre and El-Abed, Mohamad and 
   Rosenberger, Christophe},
   TITLE = {{Analysis of the Acquisition Process for Keystroke Dynamics}},
   BOOKTITLE = {{Proceedings of the International Conference of the Biometrics 
   Special Interest Group (BIOSIG)}},
   PUBLISHER = {IEEE},
   ADDRESS = {Darmstadt},
   YEAR = {2012},
   PAGES = {1--6},
}

@ARTICLE{leijten_keystroke_2013,
   AUTHOR = {Leijten, Mariëlle and Van Waes, Luuk},
   TITLE = {{Keystroke Logging in Writing Research: Using Inputlog to Analyze and 
   Visualize Writing Processes}},
   JOURNAL = {Written Communication},
   YEAR = {2013},
   VOLUME = {30},
   NUMBER = {3},
   PAGES = {358--392},
}

Nicolas Ballier ORCID iD icon, Erin Pacquetet ORCID iD icon & Taylor Arnold ORCID iD icon (2019), Investigating Keylogs as Time-Stamped Graphemics, in Proceedings of Graphemics in the 21st Century, Brest 2018 (Yannis Haralambous, Ed.), Brest: Fluxus Editions, 353–365

@INPROCEEDINGS{gla1-ball,
   AUTHOR = {Ballier, Nicolas and Pacquetet, Erin and Arnold, Taylor},
   EDITOR = {Haralambous, Yannis},
   TITLE = {{Investigating Keylogs as Time-Stamped Graphemics}},
   BOOKTITLE = {{Proceedings of Graphemics in the 21st Century, Brest 2018}},
   PUBLISHER = {Fluxus Editions},
   ADDRESS = {Brest},
   YEAR = {2019},
   PAGES = {353--365},
   DOI = {https://doi.org/10.36824/2018-graf-ball},
}