Publikationen
Die folgenden Angaben sind zum Teil noch auf Englisch. Ich habe sie nicht übersetzt, da sie für meine heutige Tätigkeit von untergeordneter Bedeutung sind.
Dissertation
Zitation: Matthias Büchse, 2015. Algebraic decoder specification: coupling formal-language theory and statistical machine translation. PhD Thesis. Technische Universität Dresden, Fakultät Informatik.
Abstract
The specification of a decoder, i.e., a program that translates sentences from one natural language into another, is an intricate process, driven by the application and lacking a canonical methodology. The practical nature of decoder development inhibits the transfer of knowledge between theory and application, which is unfortunate because many contemporary decoders are in fact related to formal-language theory. This thesis proposes an algebraic framework where a decoder is specified by an expression built from a fixed set of operations. As yet, this framework accommodates contemporary syntax-based decoders, it spans two levels of abstraction, and, primarily, it encourages mutual stimulation between the theory of weighted tree automata and the application.
Downloads (PDF)
- official download from the German national library (encouraged)
- download from this site (discouraged)
- defense slides (translated into English)
- original defense slides (German)
Peer-reviewed
- MB, Alexander Koller, and Heiko Vogler, 2013. Generic binarization for parsing and translation. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 145–154. (acceptance rate: 26 %)
paper (pdf) • slides (pdf) • bibtex entry - MB, Mark-Jan Nederhof, and Heiko Vogler, 2012. Tree Parsing for Tree-Adjoining Machine Translation. Journal of Logic and Computation
article - MB, Andreas Maletti, and Heiko Vogler, 2012. Unidirectional Derivation Semantics for Synchronous Tree-Adjoining Grammars. 16th International Conference on Developments in Language Theory (DLT 2012). Volume 7410 of LNCS, pages 368–379. © Springer-Verlag, 2012.
paper (pdf) • bibtex entry • Springer link - MB and Anja Fischer, 2012. Deciding the Twins Property for Weighted Tree Automata over Extremal Semifields. ATANLP 2012 (acceptance rate: 4/6)
paper (pdf) • slides (pdf) • bibtex entry • ACL link - MB, Mark-Jan Nederhof, and Heiko Vogler, 2011. Tree Parsing with Synchronous Tree-Adjoining Grammars. IWPT 2011 (acceptance rate: 28/58)
paper (pdf) • slides: part one, part two • bibtex entry • ACL link - MB, Daniel Geisler, Torsten Stüber, and Heiko Vogler, 2010. n-Best Parsing Revisited. ATANLP 2010 (acceptance rate: 6/11)
paper (pdf) • slides (pdf) • bibtex entry • ACL link - MB, Jonathan May, and Heiko Vogler, 2009. Determinization of Weighted Tree Automata using Factorizations. FSMNLP 2009 (pre-proceedings and talk; acceptance rate: 20/21), extended version appeared in Journal of Automata, Languages and Combinatorics, 15 (2010) 3/4.
workshop paper (pdf) • slides (pdf) • extended manuscript (pdf) • bibtex entry - MB and Torsten Stüber, 2009. Monadic Datalog Tree Transducers. LATA 2009 (acceptance rate: 58/121)
bibtex entry
Other Publications or Talks
- MB, 2012. As Easy As Vanda, Two, Three: Components for Machine Translation Based on Formal Grammars. Talk given (in English) at the Theorietag 2012 of the German Informatics Society (GI).
abstract (pdf) • slides (pdf) - MB, Toni Dietze, Johannes Osterholzer, Anja Fischer, and Linda Leuschner. Vanda: A Statistical Machine Translation Toolkit. Talk given at the 6th International Workshop “Weighted Automata: Theory and Applications” (WATA 2012)
abstract (pdf) • slides (pdf) - MB and Anja Fischer, 2012. Deciding the Twins Property for Weighted Tree Automata over Extremal Semifields. Talk given at the 6th International Workshop “Weighted Automata: Theory and Applications” (WATA 2012). (Paper published at ATANLP 2012.)
abstract (pdf) • slides (pdf) - MB, 2011. Produktkonstruktionen für das Maschinelle Übersetzen. (Product constructions for machine translation.) Talk (in German) given at Universität Potsdam on December 19, 2011.
abstract (in German) • slides (pdf) - MB, 2011. Statistical Machine Translation with Weighted Grammars. Talk given at IMS Stuttgart on June 15, 2011; revised version given in Dresden (Statusvortrag) on September 12, 2011.
abstract • original slides (pdf) • revised slides (pdf) • references (pdf) • IMS Stuttgart - MB, 2008. Unranked Attributed Tree Transducers. Diploma thesis defense talk.
slides (pdf)