Researcher biography

My name is Martin Schweinberger and I am Lecturer in Applied Linguistics at the University of Queensland (UQ) in Australia. At the UQ, I am Director of the Language Technology and Data Analysis Laboratory (LADAL) (together with Michael Haugh) and I would consider myself a quantitative corpus linguist specialized in computational analyses of text and speech. In my research, I aim to combine and bridge the gap between computational linguistics and corpus linguistics.

I am steering committee member and Chief Investigator (CI) of the Australian Text Analytics Platform (ATAP) where I focus on producing resources and training through LADAL. I am also CI and on the advisory committee of the Language Data Commons of Australia (LDaCA). Both ATAP and LDaCA aim at establishing language data infrastructures and text analytics upskilling resources in Australia and they have received substantive funding from the Australian Research Data Commons (ARDC). I have recently been elected as Vice-President Profession to be of the International Society for the Linguistics of English (ISLE) and I am board member of The International Computer Archive of Modern and Medieval English (ICAME).

Regarding my background, I have a PhD in English linguistics and I studied at the National University of Ireland, Galway, and Universität Kassel where I graduated in 2008 with an MA in English Philology, Philosophy, and Psychology. After my MA, I remained in Kassel for a short while but soon moved on to the University of Hamburg where I worked on and later received my PhD.

Potential topics for supervision

I would be particularly interested in supervising theses on the following topics:

Sociolinguistics / Language Variation and Change / World Englishes

  • General extenders
  • Terms-of-address and salutations
  • Discourse particles and markers
  • Vulgarity
  • Adjective amplification

Learner Language / Applied Linguistics / Corpus Phonetics / Learner Corpus Research

  • Vowel production among L1 speakers and learners of English
  • Voice-onset-times among L1 speakers and learners of English
  • Fluency and pauses in learner and L1 speech.
  • Accent and intelligibility / comprehension.

Text Analytics / Digital Humanities / Corpus Linguistics

  • Applied word embedding applications in the language sciences.
  • Comparison of different association / keyness measures
Featured projects Duration
Australian Text Analytics Platform (ATAP)
ARDC Co-Investment Project (now part of LDaCA)
Language Data Commons of Australia (LDaCA)
ARDC Co-investment Project