Automated speech recognition – whisply

Automated transcription extracts language content from common audio and video files in text form for further processing and use. The Research Data Centre supports you throughout the entire workflow.

With the whisply tool, based on whisper, you can process and transcribe large amounts of data in a short time via the University Library's servers. The whisply tool is language agnostic and can transcribe a wide range of languages.

Other whisply features include:

Automatic annotation of speakers and speaker changes
Transcription output as a .txt or .rttm file
Automatic creation of subtitles for videos in the .srt and .webvtt file formats

Instructions for installation and usage can be found on whisply's GitHub page.

Services

The FDZ can support you in areas including:

Advice on transcribing multimedia content
Setting up the transcription workflow
Customise whisply's output formats
General advice on audio-to-text
Support for further processing of the transcription (e.g. transformation from unstructured to structured data)

Contact

Forschungsdatenzentrum (FDZ)

Team: Irene Schumm, Phil Kolbe, David Morgan, Thomas Schmidt, Renat Shigapov, Christos Sidiropoulos, Vasilka Stoilova, Larissa Will

University of Mannheim
Universitätsbibliothek Mannheim
Schloss Schneckenhof West
68161 Mannheim

E-mail: forschungsdatenuni-mannheim.de
Web: www.bib.uni-mannheim.de/en/teaching-and-research/research-data-center-fdz

Opening Hours

Available Seats

Information and Advice

Chat Mon–Fri 10–6