Speech Recogntition Open API

Getting started guide:

Setup the grpc server:

Without docker

Create and activate a new environment :

conda create --name <env> python=3.8 && conda activate <env>
Install required libraries using the following command:
```
pip install -r requirements.txt
```
Bootstrap the model code and other models as pre requisites:
```
sh model_bootstrap.sh
```
Download models and update the right model paths in model_dict.json.
Start the server at port 50051:
```
python server.py
```

With docker

docker build -t speech_recognition_model_api .

sudo docker run --cpus=6 -m 20000m -itd -p <<host_port>>:50051 --name speech_recognition_model_api -v <<host_model_path>>/deployed_models:<<container_model_path>>/deployed_models/ -i -t speech_recognition_model_api

Using the model api as part of client code:

In python,

python examples/python/speech-recognition/main.py

Using the model api as part of REST call using api-gateway:

Create api config in api gateway:

gcloud api-gateway api-configs create CONFIG_ID \
--api=API_ID --project=PROJECT_ID \
--grpc-files=api_descriptor.pb,api_config.yaml

Deploy gateway in api gateway:

gcloud api-gateway gateways create GATEWAY_ID \
  --api=API_ID --api-config=CONFIG_ID \
  --location=GCP_REGION --project=PROJECT_ID

View gateway information:

gcloud api-gateway gateways describe GATEWAY_ID \
  --location=GCP_REGION --project=PROJECT_ID

Test the REST api using a POST request:

{
    "config":{
        "language": {
            "value":"hi"
        },
        "transcriptionFormat": "TRANSCRIPT",
        "audioFormat": "WAV"
    },
    "audio":{
        "audioUri": "https://codmento.com/ekstep/test/changed.wav"
    }
}

Developer Guide

The api, protobuf are taken from google folder from the below repo:

https://github.com/googleapis/googleapis

Generated stub files from .proto file, using the following command:

python3 -m grpc_tools.protoc \
    --include_imports \
    --include_source_info \
    --proto_path=./proto \
    ./proto/google/api/http.proto \
    ./proto/google/api/annotations.proto \
    ./proto/google/protobuf/descriptor.proto \
    -I ./proto \
    --descriptor_set_out=./proto/api_descriptor.pb \
    --python_out=./stub \
    --grpc_python_out=./stub \
    ./proto/speech-recognition-open-api.proto

To run tests, use the following command:

py.test --grpc-fake-server --ignore=wav2letter --ignore=wav2vec-infer --ignore=kenlm

DOC: https://cloud.google.com/api-gateway/docs/get-started-cloud-run-grpc#before_you_begin

Note:

In case you get a error such as, ModuleNotFoundError: No module named 'speech_recognition_open_api_pb2', do the following:

Go to stub/speech_recognition_open_api_pb2_grpc.py file, and in the import section change 

'import speech_recognition_open_api_pb2 as speech__recognition__open__api__pb2'
to 
'import stub.speech_recognition_open_api_pb2 as speech__recognition__open__api__pb2'

Issue:

AttributeError: Can't get attribute 'Wav2VecCtc' on <module 'main' from 'server.py'> Solution: Import Wav2VecCtc in file you are starting.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.circleci		.circleci
examples		examples
proto		proto
stub		stub
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
auth_interceptor.py		auth_interceptor.py
changed.wav		changed.wav
environment.yml		environment.yml
model_bootstrap.sh		model_bootstrap.sh
model_dict.json		model_dict.json
model_service.py		model_service.py
requirements.txt		requirements.txt
server.py		server.py
speech_recognition_service.py		speech_recognition_service.py
speech_recognition_service_handler.py		speech_recognition_service_handler.py
test_model_service.py		test_model_service.py
test_speech_recognition_service.py		test_speech_recognition_service.py
test_speech_recognition_service_handler.py		test_speech_recognition_service_handler.py
test_utilities.py		test_utilities.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Recogntition Open API

Getting started guide:

Setup the grpc server:

Without docker

With docker

Using the model api as part of client code:

Using the model api as part of REST call using api-gateway:

Create api config in api gateway:

Deploy gateway in api gateway:

View gateway information:

Test the REST api using a POST request:

Developer Guide

Note:

About

Releases

Packages

Languages

License

project-anuvaad/speech-recognition-open-api

Folders and files

Latest commit

History

Repository files navigation

Speech Recogntition Open API

Getting started guide:

Setup the grpc server:

Without docker

With docker

Using the model api as part of client code:

Using the model api as part of REST call using api-gateway:

Create api config in api gateway:

Deploy gateway in api gateway:

View gateway information:

Test the REST api using a POST request:

Developer Guide

Note:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages