Ghent University Academic Bibliography

Advanced

From keystrokes to annotated process data: enriching the output of Inputlog with linguistic information

Lieve Macken UGent, Veronique Hoste UGent, Mariëlle Leijten and Luuk Van Waes (2012) LREC 2012 : eight international conference on language resources and evaluation. p.2224-2229
abstract
Keystroke logging tools are a valuable aid to monitor written language production. These tools record all keystrokes, including backspaces and deletions together with timing information. In this paper we report on an extension to the keystroke logging program Inputlog in which we aggregate the logged process data from the keystroke (character) level to the word level. The logged process data are further enriched with different kinds of linguistic information: part-of-speech tags, lemmata, chunk boundaries, syllable boundaries and word frequency. A dedicated parser has been developed that distils from the logged process data word-level revisions, deleted fragments and final product data. The linguistically-annotated output will facilitate the linguistic analysis of the logged data and will provide a valuable basis for more linguistically-oriented writing process research. The set-up of the extension to Inputlog is largely language-independent. As proof-of-concept, the extension has been developed for English and Dutch. Inputlog is freely available for research purposes.
Please use this url to cite or link to this publication:
author
organization
year
type
conference
publication status
published
subject
keyword
keystroke logging, linguistic annotation, writing process
in
LREC 2012 : eight international conference on language resources and evaluation
editor
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Jan Odijk and Stelios Piperidis
pages
2224 - 2229
publisher
European Language Resources Association (ELRA)
place of publication
Paris, France
conference name
8th International conference on Language Resources and Evaluation (LREC 2012)
conference location
Istanbul, Turkey
conference start
2012-05-21
conference end
2012-05-27
Web of Science type
Proceedings Paper
Web of Science id
000323927702049
ISBN
9782951740877
language
English
UGent publication?
yes
classification
P1
copyright statement
I have transferred the copyright for this publication to the publisher
id
2128484
handle
http://hdl.handle.net/1854/LU-2128484
alternative location
http://www.lrec-conf.org/proceedings/lrec2012/pdf/161_Paper.pdf
date created
2012-06-01 10:58:58
date last changed
2014-02-05 11:41:42
@inproceedings{2128484,
  abstract     = {Keystroke logging tools are a valuable aid to monitor written language production. These tools record all keystrokes, including backspaces and deletions together with timing information. In this paper we report on an extension to the keystroke logging program Inputlog in which we aggregate the logged process data from the keystroke (character) level to the word level. The logged process data are further enriched with different kinds of linguistic information: part-of-speech tags, lemmata, chunk boundaries, syllable boundaries and word frequency. A dedicated parser has been developed that distils from the logged process data word-level revisions, deleted fragments and final product data. The linguistically-annotated output will facilitate the linguistic analysis of the logged data and will provide a valuable basis for more linguistically-oriented writing process research. The set-up of the extension to Inputlog is largely language-independent. As proof-of-concept, the extension has been developed for English and Dutch. Inputlog is freely available for research purposes.},
  author       = {Macken, Lieve and Hoste, Veronique and Leijten, Mari{\"e}lle and Van Waes, Luuk},
  booktitle    = {LREC 2012 : eight international conference on language resources and evaluation},
  editor       = {Calzolari, Nicoletta and Choukri, Khalid and Declerck, Thierry and U\u{g}ur Do\u{g}an, Mehmet and Maegaard, Bente and Mariani, Joseph and Odijk, Jan and Piperidis, Stelios},
  isbn         = {9782951740877},
  keyword      = {keystroke logging,linguistic annotation,writing process},
  language     = {eng},
  location     = {Istanbul, Turkey},
  pages        = {2224--2229},
  publisher    = {European Language Resources Association (ELRA)},
  title        = {From keystrokes to annotated process data: enriching the output of Inputlog with linguistic information},
  url          = {http://www.lrec-conf.org/proceedings/lrec2012/pdf/161\_Paper.pdf},
  year         = {2012},
}

Chicago
Macken, Lieve, Veronique Hoste, Mariëlle Leijten, and Luuk Van Waes. 2012. “From Keystrokes to Annotated Process Data: Enriching the Output of Inputlog with Linguistic Information.” In LREC 2012 : Eight International Conference on Language Resources and Evaluation, ed. Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Jan Odijk, and Stelios Piperidis, 2224–2229. Paris, France: European Language Resources Association (ELRA).
APA
Macken, L., Hoste, V., Leijten, M., & Van Waes, L. (2012). From keystrokes to annotated process data: enriching the output of Inputlog with linguistic information. In N. Calzolari, K. Choukri, T. Declerck, M. Uğur Doğan, B. Maegaard, J. Mariani, J. Odijk, et al. (Eds.), LREC 2012 : eight international conference on language resources and evaluation (pp. 2224–2229). Presented at the 8th International conference on Language Resources and Evaluation (LREC 2012), Paris, France: European Language Resources Association (ELRA).
Vancouver
1.
Macken L, Hoste V, Leijten M, Van Waes L. From keystrokes to annotated process data: enriching the output of Inputlog with linguistic information. In: Calzolari N, Choukri K, Declerck T, Uğur Doğan M, Maegaard B, Mariani J, et al., editors. LREC 2012 : eight international conference on language resources and evaluation. Paris, France: European Language Resources Association (ELRA); 2012. p. 2224–9.
MLA
Macken, Lieve, Veronique Hoste, Mariëlle Leijten, et al. “From Keystrokes to Annotated Process Data: Enriching the Output of Inputlog with Linguistic Information.” LREC 2012 : Eight International Conference on Language Resources and Evaluation. Ed. Nicoletta Calzolari et al. Paris, France: European Language Resources Association (ELRA), 2012. 2224–2229. Print.