Hypothesis

3 Matching Annotations

Apr 2020
deepspeech.readthedocs.io deepspeech.readthedocs.io

Python contributed examples — DeepSpeech 0.7.0 documentation

1
1. raj_reddy 25 Apr 2020
  
  in Public
  
  Python contributed examples¶ Mic VAD Streaming¶ This example demonstrates getting audio from microphone, running Voice-Activity-Detection and then outputting text. Full source code available on https://github.com/mozilla/DeepSpeech-examples. VAD Transcriber¶ This example demonstrates VAD-based transcription with both console and graphical interface. Full source code available on https://github.com/mozilla/DeepSpeech-examples.
  
  speech to text machine learning python deepspeech
Visit annotations in context

Tags

machine learning

speech to text

deepspeech

python

Annotators

raj_reddy

URL

deepspeech.readthedocs.io/en/v0.7.0/Python-contrib-Examples.html
deepspeech.readthedocs.io deepspeech.readthedocs.io

Python API Usage example — DeepSpeech 0.7.0 documentation

1
1. raj_reddy 25 Apr 2020
  
  in Public
  
  Python API Usage example Edit on GitHub Python API Usage example¶ Examples are from native_client/python/client.cc. Creating a model instance and loading model¶ 115 ds = Model(args.model) Performing inference¶ 149 150 151 152 153 154 if args.extended: print(metadata_to_string(ds.sttWithMetadata(audio, 1).transcripts[0])) elif args.json: print(metadata_json_output(ds.sttWithMetadata(audio, 3))) else: print(ds.stt(audio)) Full source code
  
  speech to text machine learning deepspeech python
Visit annotations in context

Tags

machine learning

speech to text

deepspeech

python

Annotators

raj_reddy

URL

deepspeech.readthedocs.io/en/v0.7.0/Python-Examples.html
github.com github.com

mozilla/DeepSpeech

1
1. raj_reddy 25 Apr 2020
  
  in Public
  
  DeepSpeech is an open source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. NOTE: This documentation applies to the 0.7.0 version of DeepSpeech only. Documentation for all versions is published on deepspeech.readthedocs.io. To install and use DeepSpeech all you have to do is: # Create and activate a virtualenv virtualenv -p python3 $HOME/tmp/deepspeech-venv/ source $HOME/tmp/deepspeech-venv/bin/activate # Install DeepSpeech pip3 install deepspeech # Download pre-trained English model files curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.7.0/deepspeech-0.7.0-models.pbmm curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.7.0/deepspeech-0.7.0-models.scorer # Download example audio files curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.7.0/audio-0.7.0.tar.gz tar xvf audio-0.7.0.tar.gz # Transcribe an audio file deepspeech --model deepspeech-0.7.0-models.pbmm --scorer deepspeech-0.7.0-models.scorer --audio audio/2830-3980-0043.wav A pre-trained English model is available for use and can be downloaded using the instructions below. A package with some example audio files is available for download in our release notes.
  
  speech to text machine learning deepspeech mozilla
Visit annotations in context

Tags

mozilla

machine learning

speech to text

deepspeech

Annotators

raj_reddy

URL

github.com/mozilla/DeepSpeech

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL