Snowboy stop development #630

pepebc · 2020-07-25T18:05:07Z

KITT.AI announces: on December 31, 2020 stop the development of all their products
Have you thought of an alternative?

Sispheor · 2020-07-25T18:33:20Z

We are evaluating some alternatives.
Fell free to propose as well.

corus87 · 2020-07-25T19:26:05Z

We don't have much options at the moment, there is still porcupine but its not open source.

With the next Kalliope release we are able to use mycroft precise trigger which is open source, I'm already played a bit with precise, but to create a good wake word, it need to take a lot of training but its not very user friendly.

We need at least a good Kalliope wake word, therefore we need to collect some voice samples from the community then it needs a lot of datasets (for example I have downloaded 50GB from the mozilla common voice datasets) to train my wake word.

I'm going to push a community based precise trigger after the next release of Kalliope.

khimaros · 2021-02-02T03:46:23Z

a few resources since i was just researching this:

you mention porcupine is not open source, do you mean the models?

Miloune · 2021-04-01T16:51:40Z

Maybe this one as well: https://github.com/alphacep/vosk-api ?

Sispheor · 2021-04-28T11:14:27Z

So, the best option is porcupine?
I have some time in the next coming days to work on Kalliope. I'll test some trigger engine myself.

corus87 · 2021-04-28T11:38:11Z

I did take a look into vosk mentioned by @Miloune, it looks promising and at least in the german dataset the word "kalliope" is recognized. I don't have much time lately but a first try with a simple script for a single keyword worked pretty good.

Sispheor · 2021-04-29T10:06:15Z

Their is also the Snips project implementation. Used by Rhasspy.

Sispheor · 2021-04-29T13:31:33Z

So for porcupine we cannot generate model for RPI, and generated models need to be recreated every 30 days if I understand well.
So for me it looks like a no-go.

khimaros · 2021-04-29T14:49:41Z

if you are considering vosk, i'd recommend taking a look at mozilla deepspeech which i found to be both easier to configure and considerably more accurate. they also have examples for streaming transcription in python and bulk transcription in python

Sispheor · 2021-04-29T14:51:33Z

I thought they were more stt engines than wake word engines.

khimaros · 2021-04-29T14:54:47Z

@Sispheor -- that's right, including vosk. i'm not sure if either is a good fit for a snowboy replacement, but if you're going to go that direction i think deepspeech is a better target from my limited experiments.

khimaros · 2021-04-29T14:58:27Z

snips looks promising but appears to have been acquired by Sonos. i couldn't find the platform source https://github.com/snipsco?q=platform&type=&language=&sort= and their usage docs have been removed. the closest available is https://github.com/snipsco/snips-record-personal-hotword

Sispheor · 2021-04-29T15:08:24Z

Another similar project has created a list of wake work engine here.
They have created their own called Raven. Not yet a python lib but it seems to be full python. Need to take a look.
For me I think the best candidate is mycroft-prcise

pepebc · 2021-04-29T19:18:18Z

I have tried all and i prefer mycroft-precise, with good accuracy.
Raven is still a very young development, with many false positives.
And the mentioned vosk is very interesting

Sispheor · 2021-04-29T19:23:31Z

The problem with Mycroft is mainly that the only supported arm proc is armv7l.
I don't find a doc with a full map of all RPI but I think it's recent ones.

corus87 · 2021-04-29T19:43:51Z

Vosk is working pretty good so far, but its slow, with a small dataset it takes about 1-1.5 seconds until the word is recognize. For an Stt it would be a good offline alternative but as a trigger, may not.

if you are considering vosk, i'd recommend taking a look at mozilla deepspeech which i found to be both easier to configure and considerably more accurate. they also have examples for streaming transcription in python and bulk transcription in python

@khimaros I already played around with deepspeech but for more than a year ago, I guess they did some improvements since then, so I guess I will take new look.
For Vosk its very easy to setup, just pip install vosk and download the language model.

The problem with Mycroft is mainly that the only supported arm proc is armv7l.
I don't find a doc with a full map of all RPI but I think it's recent ones.

@Sispheor
It should support the Rpi 2 + 3 and x86 at least those are the ones I tested, but Rpi 4 should also be supported.
Precise is pretty nice, its working good but only with a good trained wake word, and this is very hard to train.
I also read about raven (At least on the rhasspy page) and maybe it worth a shot.

Hopefully soon there will come a good alternative to snowboy or wake work training on precise will get much easier...

Edit: Not all x86 CPU's are supported by precise, AVX is required for the default tensorflow package, but most modern CPU's support AVX.

nshmyrev · 2021-04-30T06:37:36Z

Vosk is working pretty good so far, but its slow, with a small dataset it takes about 1-1.5 seconds until the word is recognize. For an Stt it would be a good offline alternative but as a trigger, may not.

@corus87 this might be an issue with setup or an old version. Which model/language are you using here? We have updated some of our models couple months ago: English, German. We have also updated the code for much faster latency of the answer. Please try to recheck with latest setup.

Hopefully soon there will come a good alternative to snowboy or wake work training on precise will get much easier...

There will be a keyword spotter in Vosk soon, couple weeks. Stay tuned.

corus87 · 2021-04-30T07:39:22Z

@nshmyrev Thanks for your input! Those are great news you are working on a native keyword spotter, I'm looking forward to it!

I just started earlier this week with Vosk, so I'm using the latest pip version and model you provide here .
For my tests, I've made a small script using pyaudio, but even with your python example there is no difference, it takes about 1 second.

I'm running those tests on x86 with a ryzen 4800h, I guess there should be enough CPU power.

nshmyrev · 2021-04-30T07:59:36Z

I'm using the latest pip version and model you provide here .

Which model version exactly please? 0.15?

For my tests, I've made a small script using pyaudio, but even with your python example there is no difference, it takes about 1 second.

We do not recommend pyaudio exactly due to latency issues.

there is no difference, it takes about 1 second.

Ok, let me check too

corus87 · 2021-04-30T08:03:42Z

Which model version exactly please? 0.15?

I tried vosk-model-de-0.6, vosk-model-small-de-0.15 and vosk-model-small-en-us-0.15.

Ok, let me check too

Thanks.

HumanG33k · 2021-11-22T03:01:14Z

hi i just try to install following website instruction.
The default setting file use snowboy, there is somewhere an alternative ?

vkuehn · 2021-11-22T16:01:39Z

that's closed since ages

corus87 · 2021-12-11T10:53:04Z

The guys from rhasspy built a docker container to run a local web server where you can easily create your own pmdl wake word for snowboy.

https://github.com/rhasspy/snowboy-seasalt

Just install docker and run the command in the readme, it will download the container and starts the web server. Then you can access the interface on http://localhost:8000 and create a wake word in 2 minutes.

R-Jurado · 2024-02-02T12:57:52Z

Sorry for refloating this old issue, but since I haven't found any recent related issue or article on the matter, may I suggest to support TeachableMachine? It's opensource and easy to use. Don't know if there is an official alternative to Snowboy, I haven't found any.

1001111github · 2025-01-21T22:12:41Z

I have been using Precise quite happily for years.

However, I currently have an openwakeword trigger, hacked from the precise code, that is sort of functional. Unfortunately, it does not seem to release the microphone after a key word is spotted. In addition to that, there seems to be a mis-match between the precise buffer system and openwakeword giving weird, but reliable, detection values. Sigh, there are advantages to analog when dealing with sound.

YMMV
Dave

1001111github · 2025-02-04T21:11:10Z

I have a fully functional openwakeword trigger for Kalliope. It is much, much better than my precise trigger. All the default wake words are easily recognized, using any voice. Prototype code can be accessed at https://github.com/1001111github/kalliope-trigger-openww. In addition Home Assistant offers a huge number of word models.

Local STT using the openai-whisper model and code at https://github.com/1001111github/kalliope-stt-whisperer. The only major drawback in Kalliope, was the dependence on "foreign" STT. CMUSphinx was never really an option. Whisper just works. I did have to remove punctuation. This module is not suitable for mobile, lol, as it does require some CPU and memory. Again prototype code, very little testing of edge cases or review

And finally, another TTS, again fully open source using piper. https://github.com/1001111github/kalliope-tts-piper. The biggest advantage is there is no need to execute a system level program anymore. An added bonus is a lot of modern voices.

While working with piper I found that most of my "Device not available OS Error 9985" was caused by pulseaudio and could be prevented by running pavucontrol in the background. I guess my next search is "python pulseaudio control"

All feedback and ideas welcome

HTH
Dave

Sispheor mentioned this issue Apr 30, 2021

Using Pocupine since Snowboy's shutdown #651

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Snowboy stop development #630

Snowboy stop development #630

pepebc commented Jul 25, 2020

Sispheor commented Jul 25, 2020

corus87 commented Jul 25, 2020

khimaros commented Feb 2, 2021

Miloune commented Apr 1, 2021

Sispheor commented Apr 28, 2021

corus87 commented Apr 28, 2021

Sispheor commented Apr 29, 2021

Sispheor commented Apr 29, 2021

khimaros commented Apr 29, 2021

Sispheor commented Apr 29, 2021

khimaros commented Apr 29, 2021

khimaros commented Apr 29, 2021

Sispheor commented Apr 29, 2021

pepebc commented Apr 29, 2021 •

edited

Loading

Sispheor commented Apr 29, 2021

corus87 commented Apr 29, 2021 •

edited

Loading

nshmyrev commented Apr 30, 2021

corus87 commented Apr 30, 2021

nshmyrev commented Apr 30, 2021

corus87 commented Apr 30, 2021

HumanG33k commented Nov 22, 2021

vkuehn commented Nov 22, 2021

corus87 commented Dec 11, 2021

R-Jurado commented Feb 2, 2024 •

edited

Loading

1001111github commented Jan 21, 2025

1001111github commented Feb 4, 2025

Snowboy stop development #630

Snowboy stop development #630

Comments

pepebc commented Jul 25, 2020

Sispheor commented Jul 25, 2020

corus87 commented Jul 25, 2020

khimaros commented Feb 2, 2021

Miloune commented Apr 1, 2021

Sispheor commented Apr 28, 2021

corus87 commented Apr 28, 2021

Sispheor commented Apr 29, 2021

Sispheor commented Apr 29, 2021

khimaros commented Apr 29, 2021

Sispheor commented Apr 29, 2021

khimaros commented Apr 29, 2021

khimaros commented Apr 29, 2021

Sispheor commented Apr 29, 2021

pepebc commented Apr 29, 2021 • edited Loading

Sispheor commented Apr 29, 2021

corus87 commented Apr 29, 2021 • edited Loading

nshmyrev commented Apr 30, 2021

corus87 commented Apr 30, 2021

nshmyrev commented Apr 30, 2021

corus87 commented Apr 30, 2021

HumanG33k commented Nov 22, 2021

vkuehn commented Nov 22, 2021

corus87 commented Dec 11, 2021

R-Jurado commented Feb 2, 2024 • edited Loading

1001111github commented Jan 21, 2025

1001111github commented Feb 4, 2025

pepebc commented Apr 29, 2021 •

edited

Loading

corus87 commented Apr 29, 2021 •

edited

Loading

R-Jurado commented Feb 2, 2024 •

edited

Loading