NB!I have moved to the Dept of Swedish in Gothenburg. This site will be up until I've transferred all its content/functions to Sweden.
Homepage von|of Gerlof Bouma
I research and teach at the Linguistics Department of the University of Potsdam. Research-interest-wise, I hover somewhere between linguistics and computational linguistics and I have definitely, definitely been bitten by the corpus bug. Past work includes various aspects of word order variation (in Dutch) and the interpretation of pronominal anaphora (in Dutch and German). I have also been part of the SFB 632 D4 project, led by Prof. Jonas Kuhn.
Kontakt|Contact
Anschrift|Postal address:
Dr. G.J. Bouma
Write to me in Gothenburg!
Besuchsadresse|Visiting address:
Come look me up in Gothenburg!
Dissertation
I received my doctoral degree from the University of Groningen in 2008. In my dissertation research, I looked at preferences in argument fronting in spoken Dutch. The research was carried out under the supervision of Profs. Petra Hendriks, Jack Hoeksema and Henriëtte de Swart. A pdf of my dissertation Starting a Sentence in Dutch: A corpus study of subject- and object-fronting can be found here in two versions:
- for on-screen reading, there is a version with bookmarks;
- for printing, there is a 2-up version with the page spreads intact.
Lehre|Teaching
Wintersemester 2010/11:
- Lexikalisch-Funktionale Grammatik (Moodle site)
- Grammatikformalismen in der Computerlinguistik (Moodle site)
Sommersemester 2010:
- Computerlinguistische Techniken (Moodle site)
Wintersemester 2009/10:
- Lexikalisch-Funktionale Grammatik (Moodle site)
Sommersemester 2009:
- Computerlinguistische Techniken (with Jonas Kuhn, Moodle site)
Wintersemester 2008/9:
- Grammatikformalismen in der Computerlinguistik (Moodle site)
Sommersemester 2008:
- Einführung in die Korpuslinguistik (with Kathrin Spreyer)
- Competition-based and quantitative models of grammar
- Computerlinguistisches Kolloquium (with Jonas Kuhn)
Publikationen|Publications
To appear
Bouma, Gerlof. Expected 2011. Production and Comprehension in Context: The case of word order freezing. To appear in Benz & Mattausch (Eds), Bidirectional Optimality Theory, John Benjamins. [Draft pdf] See also Ch. 5 of my dissertation.
Bouma, Gerlof & Petra Hendriks. Expected 2012. Partial Word Order Freezing in Dutch. Journal of Logic, Language and Information. [doi]
2010
Bouma, Gerlof. 2010. Collocation Extraction beyond the Independence Assumption. In: Proceedings of the ACL 2010 Conference Short Papers, pp109–114, Uppsala. [pdf in ACL anthology]
Bouma, Gerlof. 2010. Syntactic tree queries in Prolog. In: Proceedings of the Fourth Linguistic Annotation Workshop, ACL 2010, pp212–216, Uppsala. [pdf in ACL anthology]
Bouma, Gerlof. 2010. Querying Linguistic Corpora with Prolog. In Pinkal, Rehbein, Schulte im Walde & Storrer (eds), Semantic Approaches in Natural Language Processing: Proceedings of the Conference on Natural Language Processing 2010 (KONVENS 2010), Saarbrücken, Universaar. [Final draft with a slightly spacier layout.]
Bouma, Gerlof, Lilja Øvrelid & Jonas Kuhn. 2010. Towards a Large Parallel Corpus of Cleft Constructions. In: Proceedings of LREC 2010, pp3585–3592. Malta. [abstract & link to pdf]
2009
Bouma, Gerlof & Jonas Kuhn. 2009. On the split nature of the Dutch laten-causative. In: Butt & King (eds), The Proceedings of the LFG'09 Conference, pp167–187, CSLI.
Bouma, Gerlof. 2009. Normalized (Pointwise) Mutual Information in Collocation Extraction. In: Chiarcos, Eckart de Castilho & Stede (eds), Von der Form zur Bedeutung: Texte automatisch verarbeiten / From Form to Meaning: Processing Texts Automatically, Proceedings of the Biennial GSCL Conference 2009, pp31–40, Tübingen, Gunter Narr Verlag. [prefinal draft]
2008
Bouma, Gerlof, Jonas Kuhn, Bettina Schrader & Kathrin Spreyer. 2008. Parallel LFG Grammars on Parallel Corpora: A base for practical triangulation. In: Butt & King (eds), The Proceedings of the LFG'08 Conference, pp169–189, CSLI.
Bouma, Gerlof & Helen de Hoop. 2008. Unscrambled pronouns in Dutch. Linguistic Inquiry 39, pp669–677. [doi]
Bouma, Gerlof. 2008. Starting a sentence in Dutch: A corpus study of subject- and object-fronting. Groningen Dissertations in Linguistics 66, Center for Language and Cognition, University of Groningen.
Bouma, Gerlof. 2008. Gathering corpus evidence of word order freezing in Dutch. In: Pre-proceedings of the International Conference on Linguistic Evidence: Empirical, theoretical and computational perspectives, pp23–27.
2007
Bouma, Gerlof, Irene Krämer & Joost Zwarts (eds). 2007. Cognitive Foundations of Interpretation, Amsterdam: KNAW/Edita.
Bouma, Gerlof, Petra Hendriks, Helen de Hoop, Irene Krämer, Henriëtte de Swart & Joost Zwarts. 2007. Conflicts in Interpretation. In: Bouma, Krämer & Zwarts (eds), Cognitive Foundations of Interpretation, Amsterdam: KNAW/Edita.
Bouma, Gerlof and Holger Hopp. 2007. Coreference preferences for personal pronouns in German. In: Bittner & Gagarina (eds), Intersentential Pronominal Reference in Child and Adult Language, ZASPIL 48, pp 53–75, Berlin: Zentrum für Allgmeine Sprachwissenschaft.
2006
Bouma, Gerlof. 2006. Production and Comprehension in Context: The case of word order freezing. In: Sidner, Harpur, Benz & Kühnlein (eds): Proceedings of the Workshop on Constraints in Discourse 2006, pp35–43.
Bouma, Gerlof and Holger Hopp. 2006. Effects of word order and grammatical function on pronoun resolution in German. In: Artstein & Poesio (eds), Proceedings of the Ambiguity in Anaphora Workshop, pp5–12.
2005 and before
Van der Beek, Leonoor and Gerlof Bouma. 2004. The role of the lexicon in optimality theoretic syntax. In: Butt & King (eds), The Proceedings of the LFG'04 Conference, pp167–187, CSLI.
Bouma, Gerlof. 2003. Doing Dutch pronouns automatically in Optimality Theory. In: Proceedings of the EACL 2003 Workshop on The Computational Treatment of Anaphora, Budapest.
Code & Data
- QPL: Prolog code and conversion scripts for querying linguistic corpora with Prolog.
- We've retokenized and aligned some of the languages of Europarl v3. This is available for download. The code used for the retokenization etc is also in a seperate archive. The base corpus Europarl v3 is available from statmt.org.