jbienzms / Unity-MS-SpeechSDK

Sample Unity project used to demonstrate Speech Recognition using the new Microsoft Speech Service (Preview) via WebSockets.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Unity-MS-SpeechSDK

Sample Unity project used to demonstrate Speech Recognition (aka Speech-to-Text) using the new Microsoft Speech Service (currently in Preview) via WebSockets. The Microsoft Speech Service is part of Microsoft Azure Cognitive Services. This is a work in progress.

  • Unity version: 2018.2.2f1
  • Speech Service version: 0.5.0 (Preview)
  • Target platforms tested: Unity Editor/Mono (to be tested: Windows Desktop, UWP/WMR, Android, iOS)

Implementation Notes

  • This sample uses the Speech Service WebSocket protocol to interact with the Speech Service and generate speech recognition hypotheses in real-time.
  • This sample is compatible with both the new Cognitive Services Speech Service (Preview) and the classic Bing Speech API. The default and recommended approach is the new service.
  • You will need an Azure Cognitive Services account to use this sample: Create an account here.
  • If you see any API keys in the code, these are either trial keys that will expire soon or temporary keys that may get invalidated. Please get your own keys. Get your own trial key to Bing Speech or the new Speech Service here. A free tier is available allowing 5,000 transactions per month, at a rate of 20 per minute.
  • This initial draft implementation uploads a speech audio file to perform the recognition.
  • A UI Canvas button is used to trigger the speech recognition job. The results are posted in the UI Canvas Text label as well as the Unity Debug Console window.
  • NOTE: The Microphone integration is coming soon. This project contains incomplete artifacts in progress.

Resource Links

Follow Me

About

Sample Unity project used to demonstrate Speech Recognition using the new Microsoft Speech Service (Preview) via WebSockets.


Languages

Language:C# 100.0%