Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inconsistent Pronunciation of "BAN/ban" and "BUFF/buff" in GPT-SoVITS #1932

Open
Strive-for-excellence opened this issue Jan 14, 2025 · 0 comments

Comments

@Strive-for-excellence
Copy link
Contributor

I have noticed an issue with GPT-SoVITS where the pronunciation of certain words differs depending on their capitalization. Specifically:

Description:

"BAN" and "ban" are pronounced differently.
"BUFF" and "buff" are pronounced differently.
This inconsistency can affect the usability and naturalness of the model, especially when dealing with context-sensitive or case-sensitive text-to-speech tasks.

Steps to Reproduce:

Input "BAN" and "ban" in the model.
Input "BUFF" and "buff" in the model.
Compare the pronunciations for each pair.

Expected Behavior:

The pronunciation of these words should remain consistent regardless of capitalization, unless explicitly designed to convey different meanings.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant