-
Notifications
You must be signed in to change notification settings - Fork 548
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GLaDOS voice #187
Comments
Tomorrow I will train a Glados dataset, but what worries me is the license to publish it. |
That's what I was afraid of. Would instructions to train a dataset on one's own be more in the clear? I have no idea about hardware requirements, though. |
To make things easier, I use colab notebooks, since I don't have the hardware. To run it locally, you would need an NVIDIA GPU and the parameters (eg batch_size) can be run according to the capabilities of your GPU. |
I don't have the hardware either. And I guess if detailed instructions were published, that could still get DMCA'd as did tools like yt-dl |
@rmcpantoja any update on the Glados training? |
Hi @dnhkng, |
@rmcpantoja The English link doesn't work. I was going to try a finetune on the original game voice data. I have 2x 4090s, so I should have enough compute. I could rip the voices from https://theportalwiki.com/wiki/GLaDOS_voice_lines but is there a dataset with this already prepared? Happy to share the results! |
Hi @dnhkng, The model was trained using this dataset, but I was in charge of fixing many incorrect transcriptions. |
@rmcpantoja I scrapped the GLaDOS dataset (only using the Portal 2 voice and DLC), manually filtered out all the wav files that contained extras (Laughing, telephones, beeping, etc), and also fixed all the text. That gave me about 1 hour of high-quality data. I have currently fine-tuned for 15 hours on a 4090, and it sounds very good, and the loss is still decreasing. I will train for 24 hours, and see how the loss curves look. EDIT: Here is a samples after 24H of finetuning. 'a' is the generated sample, 'b' is an unseen sample from the the game. I think the generated sample is better! Kudos to Piper, this is amazing! |
@dnhkng Hello is it possible to get the model ? |
Yes, I will share it in the next few days. Doing a big refactor on the inference code. |
Sign me up as well |
@dnhkng Any update on the model ? |
OK, the model is available here: You can find the GlaDOS model in the models directory. It includes my new code base to use the voice. Have a look in the Jupyter Notebook on how to use it. If you instead want to use it with Piper, just take a medium size model, and copy the .onnx.json file, and rename it as glados.onnx.json, and it will run with Piper. |
Thank you very much for your work @dnhkng 👍👍👍 |
For those of you who want to run GlaDOS onnx model on iOS, Android, Raspberry Pi, or use We provide a colab to show you how to convert the GlaDOS model to sherpa-onnx The following is a sample command using the converted model with sherpa-onnx
test-glados.mov |
By the way, I just managed to build Android APKs for the pre-trained GLaDOS models mentioned in this issue, i.e, for the following two models:
You can find the APKs at For your convenience, the download address is given below:
If you are interested in how we build the APK, please read the following documentation You can also try the models in the following huggingface space in your browser |
could someone also train a german version please? |
@LaneaLucy I would, but I'm not sure how to do it! I trained using the voice from the game. Is there a German voice in the German edition? (I live in Bavaria, which is almost German :) ) |
@dnhkng yes, there is german voice in portal 1 and portal 2 |
@LaneaLucy I am doing new voice training soon, join my discord https://discord.com/invite/ERTDKwpjNB and in the general section I'm discussing it now. |
@dnhkng Your GlaDos model is amazing! You should also create a Cave Johnson and Wheatley model. Can you imagine having multiple devices speaking to each other with the different voices? 🤣 |
Is there a GLaDOS voice for pipers as it was for larynx (rhasspy/larynx#56)? Or possibly an easy way to convert one to another?
I added phonemes and missing entries in the json file, but still there are phonemes missing and errors about the model
The text was updated successfully, but these errors were encountered: