deepsource docker docker-compose google-speech google-speech-to-text hacktoberfest javascript poc

Proof of Concept for transcoding podcasts into text using GCP Speech2Text service, following its NODE JS tutorial.

Installation

Download this repo:

git clone https://github.com/emibcn/Podcast2Text.git

Change directory into it:

cd Podcast2Text

Create local directories:

mkdir flac credentials

Create GCP credentials for consuming Speech2Text service at GCP IAM with -at least- Service Usage Consumer permission.
Copy credentials file to ./credentials directory
Create .env file with GOOGLE_APPLICATION_CREDENTIALS=[CREDENTIALS FILENAME] (without directory)

Usage

There is a script helper to transcode any audio file into text. It's syntax is:

./transcode.sh <FILEPATH> [START]

FILEPATH: Path (relative or absolute) to podcast audio file
START: Initial start seek (transcode beginning at this position). Same syntax as FFMPEG -ss option.

This will encode the supplied file to FLAC format into ./flac directory and then use the encoded file to send it to GCP Speech2Text service and get its transcription printed on screen.

About

Proof of concept for transcribing podcasts into text using GCP Speech2Text service

deepsource docker docker-compose google-speech google-speech-to-text hacktoberfest javascript poc

GNU General Public License v3.0

Languages

Language:JavaScript 67.5%Language:Shell 32.5%