Benchmarking embeddings for retrieval and separation of vocalizations in humans and songbirds

Install Environment and dependencies

conda env create -f environment.yml

pip install -r ./WhisperSeg/requirements.txt

pip install -r ./WhisperSeg/requirements_windows.txt

sh setup.sh

To reproduce the results obtained in the paper, first modify reproducibility_config.yaml with your specific directories. Then running:

python reproducibility_extract_features.py

will extract the mel, Whisper embeddings, and Encodec codes for all subsets.

Then:

python reproducibility_compute_distances.py

will compute the distance metrics for all the permutations.

Lastly:

python reproducibility_statistics.py

will create json files for accuracy and f-value.

Anonymus

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
WhisperSeg		WhisperSeg
README.md		README.md
__init__.py		__init__.py
environment.yml		environment.yml
reproducibility_compute_distances.py		reproducibility_compute_distances.py
reproducibility_extract_features.py		reproducibility_extract_features.py
reproducibility_statistics.py		reproducibility_statistics.py
reproducibiliy_config.yaml		reproducibiliy_config.yaml
setup.sh		setup.sh