View Full Document

PARAMOR: FROM PARA DIGM STRUCTURE TO NATURAL LANGUAGE MOR PHOLOGY INDUCTION



View the full content.
View Full Document
View Full Document

3 views

Unformatted text preview:

PARAMOR PARADIGM STRUCTURE TO NATURAL LANGUAGE MORPHOLOGY INDUCTION FROM Christian Monson Draft March 14 2008 Please do not distribute Language Technologies Institute School of Computer Science Carnegie Mellon University Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy Thesis Committee Jaime Carbonell Co Chair Alon Lavie Co Chair Lori Levin Ron Kaplan PowerSet 1 1 Introduction Most natural languages exhibit inflectional morphology that is the surface forms of words change to express syntactic features I run vs She runs Handling the inflectional morphology of English in a natural language processing NLP system is fairly straightforward The vast major ity of lexical items in English have fewer than five surface forms But English has a particularly sparse inflectional system It is not at all unusual for a language to construct tens of unique in flected forms from a single lexeme And many languages routinely inflect lexemes into hundreds thousands or even tens of thousands of unique forms In these inflectional languages computational systems as different as speech recognition Creutz 2006 machine translation Goldwater and McClosky 2005 Oflazer 2007 and information retrieval Mikko et al 2007 improve with careful morphological analysis Three broad categories encompass the wide variety of computational approaches which can analyze inflectional morphology A computational morphological analysis system can be 1 Hand built 2 Trained from examples of word forms correctly analyzed for morphology or 3 Induced from morphologically unannotated text in an unsupervised fashion Presently most computational applications take the first option hand encoding morphological facts Unfortunately manual description of morphology demands human expertise in a combina tion of linguistics and computation that is in short supply for many of the world s languages The second option training a morphological analyzer in a supervised fashion suffers from a



Access the best Study Guides, Lecture Notes and Practice Exams

Loading Unlocking...
Login

Join to view PARAMOR: FROM PARA DIGM STRUCTURE TO NATURAL LANGUAGE MOR PHOLOGY INDUCTION and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view PARAMOR: FROM PARA DIGM STRUCTURE TO NATURAL LANGUAGE MOR PHOLOGY INDUCTION and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?