You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to use a trained model of a speaker from well recorded sources to restore damaged or low fidelity recordings of the same person.
The timbre of the voice and the quality of the audio transfers extremely well, however I've found that the output audio tends to stray from the input audio.
So far I've noticed that:
the emphasis of words in a sentence sometimes changes
t's, b's p's and d's in particular are quite weak, and s's are a little lispy
The pitch of some words jumps up in a surprising manner
Are there any settings I could adjust to make the output audio more closely align to the input audio apart from the timbre?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I'm trying to use a trained model of a speaker from well recorded sources to restore damaged or low fidelity recordings of the same person.
The timbre of the voice and the quality of the audio transfers extremely well, however I've found that the output audio tends to stray from the input audio.
So far I've noticed that:
Are there any settings I could adjust to make the output audio more closely align to the input audio apart from the timbre?
Beta Was this translation helpful? Give feedback.
All reactions