elisaper / ML_Speech_Recognition_Google_API

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Google Cloud Speech API Python Samples

for small files run transcribe.py for large files run transcribe_async.py

https://gstatic.com/cloudssh/images/open-btn.png

This directory contains samples for Google Cloud Speech API. The Google Cloud Speech API enables easy integration of Google speech recognition technologies into developer applications. Send audio and receive a text transcription from the Cloud Speech API service.

  • See the migration guide for information about migrating to Python client library v0.27.

Setup

Authentication

This sample requires you to have authentication setup. Refer to the Authentication Getting Started Guide for instructions on setting up credentials for applications.

Install Dependencies

  1. Clone python-docs-samples and change directory to the sample directory you want to use.

    $ git clone https://github.com/GoogleCloudPlatform/python-docs-samples.git
  2. Install pip and virtualenv if you do not already have them. You may want to refer to the Python Development Environment Setup Guide for Google Cloud Platform for instructions.

  3. Create a virtualenv. Samples are compatible with Python 2.7 and 3.4+.

    $ virtualenv env
    $ source env/bin/activate
  4. Install the dependencies needed to run the samples.

    $ pip install -r requirements.txt

Samples

Quickstart

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python quickstart.py

Transcribe

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python transcribe.py

usage: transcribe.py [-h] path

Google Cloud Speech API sample application using the REST API for batch
processing.

Example usage:
    python transcribe.py resources/audio.raw
    python transcribe.py gs://cloud-samples-tests/speech/brooklyn.flac

positional arguments:
  path        File or GCS path for audio file to be recognized

optional arguments:
  -h, --help  show this help message and exit

Transcribe async

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python transcribe_async.py

usage: transcribe_async.py [-h] path

Google Cloud Speech API sample application using the REST API for async
batch processing.

Example usage:
    python transcribe_async.py resources/audio.raw
    python transcribe_async.py gs://cloud-samples-tests/speech/vr.flac

positional arguments:
  path        File or GCS path for audio file to be recognized

optional arguments:
  -h, --help  show this help message and exit

Transcribe with word time offsets

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python transcribe_word_time_offsets.py

usage: transcribe_word_time_offsets.py [-h] path

Google Cloud Speech API sample that demonstrates word time offsets.

Example usage:
    python transcribe_word_time_offsets.py resources/audio.raw
    python transcribe_word_time_offsets.py         gs://cloud-samples-tests/speech/vr.flac

positional arguments:
  path        File or GCS path for audio file to be recognized

optional arguments:
  -h, --help  show this help message and exit

Transcribe Streaming

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python transcribe_streaming.py

usage: transcribe_streaming.py [-h] stream

Google Cloud Speech API sample application using the streaming API.

Example usage:
    python transcribe_streaming.py resources/audio.raw

positional arguments:
  stream      File to stream to the API

optional arguments:
  -h, --help  show this help message and exit

Beta Samples

https://gstatic.com/cloudssh/images/open-btn.png

To run this sample:

$ python beta_snippets.py

usage: beta_snippets.py [-h] command path

Google Cloud Speech API sample that demonstrates enhanced models
and recognition metadata.

Example usage:
    python beta_snippets.py enhanced-model resources/commercial_mono.wav
    python beta_snippets.py metadata resources/commercial_mono.wav
    python beta_snippets.py punctuation resources/commercial_mono.wav

positional arguments:
  command
  path        File for audio file to be recognized

optional arguments:
  -h, --help  show this help message and exit

The client library

This sample uses the Google Cloud Client Library for Python. You can read the documentation for more details on API usage and use GitHub to browse the source and report issues.

About


Languages

Language:Python 100.0%