Projects per year
Abstract
The use of the pronoun ich (‘I’) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.
| Translated title of the contribution | Erforschung der Verwendung des Pronomen Ich in deutschen akademischen Texten mit maschinellem Lernen |
|---|---|
| Original language | English |
| Title of host publication | Informatik 2020 - Back to the future : 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual |
| Editors | Ralf H. Reussner, Anne Koziolek, Robert Heinrich |
| Number of pages | 7 |
| Place of Publication | Bonn |
| Publisher | Gesellschaft für Informatik e.V. |
| Publication date | 2020 |
| Pages | 1327-1333 |
| ISBN (Electronic) | 978-3-88579-701-2 |
| DOIs | |
| Publication status | Published - 2020 |
| Event | 50th Annual Conference of the German Informatics Society - INFORMATIK 2020: Back to the Future - Online, Karlsruhe, Germany Duration: 28.09.2020 → 02.10.2020 Conference number: 50 https://informatik2020.gi.de/ https://gi.de/themen/beitrag/politik-in-den-social-media-herausforderung-fuer-die-webarchivierung-1 https://doi.org/10.25798/ak8r-eg12 |
Bibliographical note
Funding Information:Melanie Andresen’s work on this paper was funded by the Landesforschungsförderung Hamburg in the context of the project hermA [Ga17] (LFF-FV 35) at Universitčt Hamburg.
Publisher Copyright:
© 2020 Gesellschaft fur Informatik (GI). All rights reserved.
Research areas and keywords
- Language Studies
- annotation
- Academic language
- German
- machine learning
- classification
- Academic language
- Annotation
- Classification
- German
- Machine learning
Fingerprint
Dive into the research topics of 'Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning'. Together they form a unique fingerprint.Projects
- 1 Finished
-
KoLaS: Kommentiertes Lernendenkorpus Akademisches Schreiben
Knorr, D. (Project manager, academic) & Andresen, M. (Partner)
30.10.11 → 31.12.16
Project: Research
File
Research output
- 1 Journal articles
-
Zwischen Forscher-, Verfasser- und Erzähler-Ich: Eine korpuslinguistische Studie zur Konstruktion von Selbstreferenz und zu ihrer Einsatzmöglichkeit in der Schreibberatungsausbildung
Knorr, D., 04.2021, In: Zeitschrift für Interkulturellen Fremdsprachenunterricht. 26, 1, p. 137–160 24 p.Research output: Journal contributions › Journal articles › Research › peer-review
Open Access
Datasets
-
Commented Learner Corpus Academic Writing; Kommentiertes Lernendenkorpus akademisches Schreiben
Knorr, D. (Creator) & Andresen, M. (Data Collector), Zentrum für Nachhaltiges Forschungsdatenmanagement Hamburg, 18.11.2017
DOI: 10.25592/uhhfdm.8322, https://www.fdr.uni-hamburg.de/record/8326
Dataset