Treebank querying with GrETEL 3 : bigger, faster, stronger
- Author
- Liesbeth Augustinus, Bram Vanroy (UGent) and Vincent Vandeghinste
- Organization
- Abstract
- We describe the new version of GrETEL (http://gretel.ccl.kuleuven.be/gretel3), an online tool which allows users to query treebanks by means of a natural language example (example-based search) or via a formal query (XPath search). The new release comprises an update to the interface and considerable improvements in the back-end search mechanism. The update of the front-end is based on user suggestions. In addition to an overall design update, major changes include a more intuitive query builder in the example-based search mode and a visualizer for syntax trees that is compatible with all modern browsers. Moreover, the results are presented to the user as soon as they are found, so users can browse the matching sentences before the treebank search is completed. We will demonstrate that those changes considerably improve the query procedure. The update of the back-end mainly includes optimizing the search algorithm for querying the (very) large SoNaR treebank. Querying this 500-million word treebank was already made possible in the previous version of GrETEL, but due to the complex search mechanism this often resulted in long query times or even a timeout before the search completed. The improved version of the search algorithm results in faster query times and more accurate search results, which greatly enhances the usability of the SoNaR treebank for linguistic research.
- Keywords
- user interface, ux, design, user experience, nlp, natural language processing, treebank querying, corpus linguistics, search tool, lt3
Downloads
-
Treebank querying wth GrETEL.pdf
- full text
- |
- open access
- |
- |
- 243.16 KB
-
GrETEL3-poster-clin.pdf
- full text
- |
- open access
- |
- |
- 3.66 MB
Citation
Please use this url to cite or link to this publication: http://hdl.handle.net/1854/LU-8537246
- MLA
- Augustinus, Liesbeth, et al. “Treebank Querying with GrETEL 3 : Bigger, Faster, Stronger.” Computational Linguistics in the Netherlands: Abstracts, edited by Vincent Vandeghinste and Frank Van Eynde, 2017.
- APA
- Augustinus, L., Vanroy, B., & Vandeghinste, V. (2017). Treebank querying with GrETEL 3 : bigger, faster, stronger. In V. Vandeghinste & F. Van Eynde (Eds.), Computational Linguistics in the Netherlands: Abstracts.
- Chicago author-date
- Augustinus, Liesbeth, Bram Vanroy, and Vincent Vandeghinste. 2017. “Treebank Querying with GrETEL 3 : Bigger, Faster, Stronger.” In Computational Linguistics in the Netherlands: Abstracts, edited by Vincent Vandeghinste and Frank Van Eynde.
- Chicago author-date (all authors)
- Augustinus, Liesbeth, Bram Vanroy, and Vincent Vandeghinste. 2017. “Treebank Querying with GrETEL 3 : Bigger, Faster, Stronger.” In Computational Linguistics in the Netherlands: Abstracts, ed by. Vincent Vandeghinste and Frank Van Eynde.
- Vancouver
- 1.Augustinus L, Vanroy B, Vandeghinste V. Treebank querying with GrETEL 3 : bigger, faster, stronger. In: Vandeghinste V, Van Eynde F, editors. Computational Linguistics in the Netherlands: Abstracts. 2017.
- IEEE
- [1]L. Augustinus, B. Vanroy, and V. Vandeghinste, “Treebank querying with GrETEL 3 : bigger, faster, stronger,” in Computational Linguistics in the Netherlands: Abstracts, Leuven, Belgium, 2017.
@inproceedings{8537246, abstract = {{We describe the new version of GrETEL (http://gretel.ccl.kuleuven.be/gretel3), an online tool which allows users to query treebanks by means of a natural language example (example-based search) or via a formal query (XPath search). The new release comprises an update to the interface and considerable improvements in the back-end search mechanism. The update of the front-end is based on user suggestions. In addition to an overall design update, major changes include a more intuitive query builder in the example-based search mode and a visualizer for syntax trees that is compatible with all modern browsers. Moreover, the results are presented to the user as soon as they are found, so users can browse the matching sentences before the treebank search is completed. We will demonstrate that those changes considerably improve the query procedure. The update of the back-end mainly includes optimizing the search algorithm for querying the (very) large SoNaR treebank. Querying this 500-million word treebank was already made possible in the previous version of GrETEL, but due to the complex search mechanism this often resulted in long query times or even a timeout before the search completed. The improved version of the search algorithm results in faster query times and more accurate search results, which greatly enhances the usability of the SoNaR treebank for linguistic research.}}, author = {{Augustinus, Liesbeth and Vanroy, Bram and Vandeghinste, Vincent}}, booktitle = {{Computational Linguistics in the Netherlands: Abstracts}}, editor = {{Vandeghinste, Vincent and Van Eynde, Frank}}, keywords = {{user interface,ux,design,user experience,nlp,natural language processing,treebank querying,corpus linguistics,search tool,lt3}}, language = {{eng}}, location = {{Leuven, Belgium}}, title = {{Treebank querying with GrETEL 3 : bigger, faster, stronger}}, year = {{2017}}, }