"Is There Choice in Non-Native Voice?" Linguistic Feature Engineering and a Variationist Perspective in Automatic Native Language Identification

DSpace Repositorium (Manakin basiert)

Zur Kurzanzeige

dc.contributor.advisor Meurers, Detmar (Prof. Dr.)
dc.contributor.author Bich, Serhiy
dc.date.accessioned 2017-08-08T08:51:09Z
dc.date.available 2017-08-08T08:51:09Z
dc.date.issued 2017
dc.identifier.other 493236589 de_DE
dc.identifier.uri http://hdl.handle.net/10900/77443
dc.identifier.uri http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-774435 de_DE
dc.identifier.uri http://dx.doi.org/10.15496/publikation-18844
dc.description.abstract Is it possible to infer the native language of an author from a non-native text? Can we perform this task fully automatically? The interest in answers to these questions led to the emergence of a research field called Native Language Identification (NLI) in the first decade of this century. The requirement to automatically identify a particular property based on some language data situates the task in the intersection between computer science and linguistics, or in the context of computational linguistics, which combines both disciplines. This thesis targets several relevant research questions in the context of NLI. In particular, what is the role of surface features and more abstract linguistic cues? How to combine different sets of features, and how to optimize the resulting large models? Do the findings generalize across different data sets? Can we benefit from considering the task in the light of the language variation theory? In order to approach these questions, we conduct a range of quantitative and qualitative explorations, employing different machine learning techniques. We show how linguistic insight can advance technology, and how technology can advance linguistic insight, constituting a fruitful and promising interplay. en
dc.language.iso en de_DE
dc.publisher Universität Tübingen de_DE
dc.rights ubt-podok de_DE
dc.rights.uri http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de de_DE
dc.rights.uri http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en en
dc.subject.classification Computerlinguistik , Automatische Klassifikation , Fremdsprachenlernen , Variationslinguistik , Identifikation de_DE
dc.subject.ddc 400 de_DE
dc.subject.other Native Language Identification en
dc.subject.other Author Profiling en
dc.subject.other Text Classification en
dc.subject.other Second Language Acquisition en
dc.subject.other Zweitspracherwerb de_DE
dc.subject.other Variationist Sociolinguistics en
dc.subject.other Textklassifikation de_DE
dc.subject.other Automatische Muttersprachenerkennung de_DE
dc.subject.other NLI en
dc.title "Is There Choice in Non-Native Voice?" Linguistic Feature Engineering and a Variationist Perspective in Automatic Native Language Identification en
dc.type PhDThesis de_DE
dcterms.dateAccepted 2017-05-12
utue.publikation.fachbereich Allgemeine u. vergleichende Sprachwissenschaft de_DE
utue.publikation.fakultaet 5 Philosophische Fakultät de_DE

Dateien:

Das Dokument erscheint in:

Zur Kurzanzeige