"Is There Choice in Non-Native Voice?" Linguistic Feature Engineering and a Variationist Perspective in Automatic Native Language Identification

Bich, Serhiy

Publikationsdienste
→
TOBIAS-lib - Publikationen und Dissertationen
→
5 Philosophische Fakultät
→
Dokumentanzeige

dc.contributor.advisor	Meurers, Detmar (Prof. Dr.)
dc.contributor.author	Bich, Serhiy
dc.date.accessioned	2017-08-08T08:51:09Z
dc.date.available	2017-08-08T08:51:09Z
dc.date.issued	2017
dc.identifier.other	493236589	de_DE
dc.identifier.uri	http://hdl.handle.net/10900/77443
dc.identifier.uri	http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-774435	de_DE
dc.identifier.uri	http://dx.doi.org/10.15496/publikation-18844
dc.identifier.uri	http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-774437	de_DE
dc.description.abstract	Is it possible to infer the native language of an author from a non-native text? Can we perform this task fully automatically? The interest in answers to these questions led to the emergence of a research field called Native Language Identification (NLI) in the first decade of this century. The requirement to automatically identify a particular property based on some language data situates the task in the intersection between computer science and linguistics, or in the context of computational linguistics, which combines both disciplines. This thesis targets several relevant research questions in the context of NLI. In particular, what is the role of surface features and more abstract linguistic cues? How to combine different sets of features, and how to optimize the resulting large models? Do the findings generalize across different data sets? Can we benefit from considering the task in the light of the language variation theory? In order to approach these questions, we conduct a range of quantitative and qualitative explorations, employing different machine learning techniques. We show how linguistic insight can advance technology, and how technology can advance linguistic insight, constituting a fruitful and promising interplay.	en
dc.language.iso	en	de_DE
dc.publisher	Universität Tübingen	de_DE
dc.rights	ubt-podok	de_DE
dc.rights.uri	http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de	de_DE
dc.rights.uri	http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en	en
dc.subject.classification	Computerlinguistik , Automatische Klassifikation , Fremdsprachenlernen , Variationslinguistik , Identifikation	de_DE
dc.subject.ddc	400	de_DE
dc.subject.other	Native Language Identification	en
dc.subject.other	Author Profiling	en
dc.subject.other	Text Classification	en
dc.subject.other	Second Language Acquisition	en
dc.subject.other	Zweitspracherwerb	de_DE
dc.subject.other	Variationist Sociolinguistics	en
dc.subject.other	Textklassifikation	de_DE
dc.subject.other	Automatische Muttersprachenerkennung	de_DE
dc.subject.other	NLI	en
dc.title	"Is There Choice in Non-Native Voice?" Linguistic Feature Engineering and a Variationist Perspective in Automatic Native Language Identification	en
dc.type	PhDThesis	de_DE
dcterms.dateAccepted	2017-05-12
utue.publikation.fachbereich	Allgemeine u. vergleichende Sprachwissenschaft	de_DE
utue.publikation.fakultaet	5 Philosophische Fakultät	de_DE

Dateien:	bich-nli-thesis.pdf 1.76 MB PDF

Das Dokument erscheint in:

5 Philosophische Fakultät [1771]

Zur Kurzanzeige

Veröffentlichen

Stöbern

Gesamter Bestand
Diese Sammlung

Mein Benutzerkonto

Einloggen

"Is There Choice in Non-Native Voice?" Linguistic Feature Engineering and a Variationist Perspective in Automatic Native Language Identification

DSpace Repositorium (Manakin basiert)

Das Dokument erscheint in:

Stöbern

Gesamter Bestand

Diese Sammlung

Mein Benutzerkonto