麻豆社 MakerBox

Kaldi

A speech-to-text system for quick, cost free transcription

This tool is no longer available via the 麻豆社.

How does it work?

1
Download and run the debian package
2
Upload video/speech files into the tool, using the instructions provided
3
Edit the transcription, as required

What is Kaldi?

Kaldi is a speech recognition toolkit, built upon the open source software originally developed for use by speech recognition researchers. The quickest way to search through a piece of audio or video is via a transcript, but transcription by hand is a costly and time consuming endeavour. Kaldi has been designed as a means of automating the same process for free, requiring only a small amount of installation effort from a software developer. The 麻豆社-Kaldi component provides a machine learning model built using the tools in the Open Source Kaldi toolkit and audio / text data from 麻豆社 programme and an easy to use interface that makes it simple to get up and running.

What have we learnt from Kaldi?

Kaldi has been used extensively within the 麻豆社, with a variety of learnings from each project. 麻豆社 Newslabs' project showed that speech-to-text is a tremendous timesaver for journalists who need to be across a wide number of video feeds. Their project proved that creating subtitles for viral videoclips can be done much quicker than was previously imagined. 麻豆社 Rewind also used speech-to-text to open up almost a million hours of material from the 麻豆社 Archive.

Latest Discussions

Sorry, discussions on Kaldi couldn鈥檛 be loaded at this time. Please try again later.

麻豆社