Achtung!

Sie finden unsere neuen Webseiten hier: https://www.umwelt-campus.de

ViMELF

A Corpus of Video-Mediated English as a Lingua Franca Conversations

Citation:

  • ViMELF. 2018. Corpus of Video-Mediated English as a Lingua Franca Conversations. Birkenfeld: Trier University of Applied Sciences. Version 1.0. The CASE project [umwelt-campus.de/case].

Compilation: Stefan Diemer, Marie-Louise Brunner, Caroline Collet, Selina Schmidt. The CASE project, Trier University of Applied Sciences.

Version 1.0, released 15 May 2018

Description:

  • 20 Conversations
  • Conversation length: 744.5 min total, ca. 12.5 hours of conversations
  • Average conversation length: 37.23 min.
  • Words/Tokens: 113670 (plain text), 154472 (annotated version)
  • Participants: 40 (20 SB, 5 FL, 5 HE, 5 ST, 5 SF)
    Speaker IDs: SB - Saarland University, Saarbrücken, Germany, FL - University of Bologna, Forlì, Italy, HE - Helsinki University & Hanken School of Economics, Helsinki, ST - University of Santiago de Compostela, Spain, SF - Sofia University, Bulgaria
  • Medium: Video both sides: 11, video one side: 3, audio: 6
  • Including sociolinguistic background data

Versions: Four versions of the corpus are available:

  • CASE transcription(as docx, rtf and txt): the basic version produced by manual transcription. Opens internal link in current windowCASE transcription conventions include spoken language features beyond the words, such as prosodic, paralinguistic and non-verbal features.
  • XML version (xml): a version of the annotated CASE transcription encapsulating the original information in a machine-readable form – this version is produced with Opens external link in new windowXTranscript (Gee 2018)
  • Lexical version (lex): For the lexical version all annotation is removed - this version is produced with Opens external link in new windowXTranscript (Gee 2018)
  • Part-of-speech tagged version (pos): a POS-tagged version of the lexical version, produced with the CLAWS POS tagger (C7 tagset).

Access ViMELF

The corpus is freely available for noncommercial research.

If you would like to obtain access to the transcripts, please complete the form below.

We will send you a link to the corpus data download site.

Name (last, first):
Institution:
Institutional or personal website:
Address:
Email:
Status (student, doctoral researcher, faculty, teacher, independent scholar):
Research Focus:

Access to ViMELF video data

The ViMELF video files are also available for noncommercial research purposes. In order to obtain a viewing link, please send a mail to

Opens window for sending emailcase@umwelt-campus.de

and provide the following details:

  • Name (last, first)
  • Institution
  • Institutional or personal website
  • Address
  • Email
  • Status (student, doctoral researcher, faculty, teacher, independent scholar)
  • Research Focus, including intended use of videos

We'd love to hear from you!

Further info, questions etc.: Opens window for sending emailcase@umwelt-campus.de

Project Coordination

Stefan Diemer & Marie-Louise Brunner

Language & Communication, Trier University of Applied Sciences, Germany

Opens window for sending emailcase@umwelt-campus.de