wlee88 / azure-cognitive-speech-memory-leak

POC - does MS cogntiive speech memory leak with multiple channels

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Azure STT Memory Leak Test

The aim of this repository was to investigate if there was a memory leak with the basic usage of the azure STT provider.

The program simply:

  • reads in an example text audio file from the file system, and loops it using ffmpeg
  • pipes that data into an azure push stream and starts stt recognition (per channel)
  • takes memory usage stats every 15s

To Run:

  • ensure FFMPEG is installed on your system.
  • run yarn start with the AZURE_REGION and AZURE_KEY environment variables set

Notes:

  • we've provided an 8 channel audio sample - where for each channel we start a new SpeechRecognizer.
  • running FFMPEG without writing to the push stream shows no leaks - proving the leaks are solely in the microsoft-cognitiveservices-speech-sdk package.
    • you can easily test this by commenting out L46(index.ts) - as this means nothing is written to the push stream.
  • we've noticed the rate of the leak scales proportionally with the number of channels.

About

POC - does MS cogntiive speech memory leak with multiple channels


Languages

Language:TypeScript 100.0%