indiana-university / automated-transcription-service

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Vocabulary files

alan-walsh opened this issue · comments

Allow users to include a vocabulary file with their transcription, either list or table. This is likely to make a huge difference in recordings with a lot of very domain-specific language.

In the current implementation this would require some kind of clue in the audio filename. Either the vocab file has the same name as the recording or perhaps some kind of prefix that becomes a clue to the audio-to-transcribe Lambda function to look for the vocab file.