which f0 method is better dio or crepe? #318

Meldoner · 2023-04-13T15:09:23Z

Meldoner
Apr 13, 2023

What do you think?

Apr 13, 2023

Blue did some comparison tests over there.

Personally, I avoid dio and harvest, since they're not as good as the other ones in my opinion.

parselmouth seems to have a pretty solid speed/performance/quality mix from what I noticed,
though it should be mentioned that it is struggling with certain parts.

So all in all, a mix of crepe and parselmouth is probably a good solution (if you edit them together in an audio editing program later on)

View full answer

Lordmau5 · 2023-04-13T15:51:08Z

Lordmau5
Apr 13, 2023
Maintainer

#275

Blue did some comparison tests over there.

Personally, I avoid dio and harvest, since they're not as good as the other ones in my opinion.

parselmouth seems to have a pretty solid speed/performance/quality mix from what I noticed,
though it should be mentioned that it is struggling with certain parts.

So all in all, a mix of crepe and parselmouth is probably a good solution (if you edit them together in an audio editing program later on)

0 replies

definatefilms · 2023-05-02T06:57:12Z

definatefilms
May 2, 2023

I always test them all because on different models they all perform differently.

DIO (Distributed Inline Filtering with Overlap) is an algorithm for fundamental frequency (F0) estimation in speech signals. It uses a two-step process: first, it applies a low-pass filter to the signal to extract the harmonic structure, and then it uses a peak-picking algorithm to estimate the F0.

CREPE (Convolutional REctified Phase Expressions) is a deep learning-based pitch detection algorithm that uses a convolutional neural network (CNN) to extract pitch features from the audio signal.

Harvest (Harmonic Product Spectrum) is an algorithm for pitch detection that works by computing the harmonic product spectrum of the audio signal, which is a spectral representation that emphasizes harmonic frequencies.

Parselmouth is a Python library for Praat, which is a software tool commonly used in phonetics research. Parselmouth provides an interface for accessing Praat's functionality in Python code, including functions for analyzing and synthesizing speech signals, as well as extracting features like pitch, formants, and spectrograms.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

which f0 method is better dio or crepe? #318

{{title}}

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

which f0 method is better dio or crepe? #318

Meldoner Apr 13, 2023

Replies: 2 comments

Lordmau5 Apr 13, 2023 Maintainer

definatefilms May 2, 2023

Meldoner
Apr 13, 2023

Lordmau5
Apr 13, 2023
Maintainer

definatefilms
May 2, 2023