Dependency based CCG derivation and application

dc.contributor.advisorBaldridge, Jasonen
dc.contributor.committeeMemberErk, Katrinen
dc.creatorBrewster, Joshua Blakeen
dc.date.accessioned2011-02-21T19:44:27Zen
dc.date.available2011-02-21T19:44:27Zen
dc.date.available2011-02-21T19:44:37Zen
dc.date.issued2010-12en
dc.date.submittedDecember 2010en
dc.date.updated2011-02-21T19:44:37Zen
dc.descriptiontexten
dc.description.abstractThis paper presents and evaluates an algorithm to translate a dependency treebank into a Combinatory Categorial Grammar (CCG) lexicon. The dependency relations between a head and a child in a dependency tree are exploited to determine how CCG categories should be derived by making a functional distinction between adjunct and argument relations. Derivations for an English (CoNLL08 shared task treebank) and for an Italian (Turin University Treebank) dependency treebank are performed, each requiring a number of preprocessing steps. In order to determine the adequacy of the lexicons, dubbed DepEngCCG and DepItCCG, they are compared via two methods to preexisting CCG lexicons derived from similar or equivalent sources (CCGbank and TutCCG). First, a number of metrics are used to compare the state of the lexicon, including category complexity and category growth. Second, to measures the potential applicability of the lexicons in NLP tasks, the derived English CCG lexicon and CCGbank are compared in a sentiment analysis task. While the numeric measurements show promising results for the quality of the lexicons, the sentiment analysis task fails to generate a usable comparison.en
dc.description.departmentLinguisticsen
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttp://hdl.handle.net/2152/ETD-UT-2010-12-2563en
dc.language.isoengen
dc.subjectCCG derivationen
dc.subjectDependency treebanken
dc.subjectComputational linguisticsen
dc.subjectCombinatory Categorical Grammaren
dc.subjectLexiconen
dc.subjectDependency grammarsen
dc.titleDependency based CCG derivation and applicationen
dc.type.genrethesisen
thesis.degree.departmentLinguisticsen
thesis.degree.disciplineLinguisticsen
thesis.degree.grantorUniversity of Texas at Austinen
thesis.degree.levelMastersen
thesis.degree.nameMaster of Artsen

Access full-text files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
BREWSTER-MASTERS-REPORT.pdf
Size:
936.26 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.13 KB
Format:
Plain Text
Description: