Skip to content

Code for paper "Neural embeddings for audio similarity"

Notifications You must be signed in to change notification settings

marisbasha/neural_embeddings

Repository files navigation

Benchmarking embeddings for retrieval and separation of vocalizations in humans and songbirds

Install Environment and dependencies

conda env create -f environment.yml

Install either requirements_windows (for Windows) or requirements (for UNIX) for WhisperSeg

pip install -r ./WhisperSeg/requirements.txt
pip install -r ./WhisperSeg/requirements_windows.txt

Or simply run the bash script

sh setup.sh

Reproducibility

To reproduce the results obtained in the paper, first modify reproducibility_config.yaml with your specific directories. Then running:

python reproducibility_extract_features.py

will extract the mel, Whisper embeddings, and Encodec codes for all subsets.

Then:

python reproducibility_compute_distances.py

will compute the distance metrics for all the permutations.

Lastly:

python reproducibility_statistics.py

will create json files for accuracy and f-value.

Contact

Anonymus

About

Code for paper "Neural embeddings for audio similarity"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published