RPA-SCD: Rhythm and Pitch Aware Dual-Branch Network for Songs Conversion Detection

This repo contains code for our paper: RPA-SCD: Rhythm and Pitch Aware Dual-Branch Network for Songs Conversion Detection. Our MDS dataset are available at https://drive.google.com/file/d/1rFsvMYihVtk81uFbL7UpyUEs-qBgsX6H/view?usp=drive_link

Environments

The codebase is developed with Python 3.7. Install requirements as follows:

pip install -r requirements.txt

Train RPA-SCD from scratch

You can train RPA-SCD from scratch as follows.

1. Pack waveforms into hdf5 files

The scripts/2_pack_waveforms_to_hdf5s.sh script is used for packing all raw waveforms into large hdf5 files for speed up training. The packed files looks like:

workspace
└── hdf5s
     ├── indexs
          ├── train.h5
          ├── train_demucs.h5
          └── eval1.h5
          └── eval2.h5
          └── eval3.h5
     └── waveforms (1.1 TB)
          ├── train.h5
          ├── train_demucs.h5
          └── eval1.h5
          └── eval2.h5
          └── eval3.h5

2. Create training indexes

The scripts/3_create_training_indexes.sh is used for creating training indexes. Those indexes are used for sampling mini-batches.

3. Train

The scripts/4_train.sh script contains training, saving checkpoints, and evaluation.

WORKSPACE="your_workspace"
CUDA_VISIBLE_DEVICES=0 python3 pytorch/main.py train \
  --workspace=$WORKSPACE \
  --data_type='balanced_train' \
  --window_size=1024 \
  --hop_size=320 \
  --mel_bins=64 \
  --fmin=50 \
  --fmax=14000 \
  --model_type='Cnn14' \
  --loss_type='clip_bce' \
  --balanced='balanced' \
  --augmentation='mixup' \
  --batch_size=8 \
  --learning_rate=1e-3 \
  --resume_iteration=0 \
  --early_stop=1000000 \
  --cuda

Results

The RPA-SCD model is trained on a single card NVIDIA GeForce RTX 2080 Ti. The training takes around 7-9 hours.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
metadata		metadata
pytorch		pytorch
scripts		scripts
utils		utils
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RPA-SCD: Rhythm and Pitch Aware Dual-Branch Network for Songs Conversion Detection

Environments

Train RPA-SCD from scratch

1. Pack waveforms into hdf5 files

2. Create training indexes

3. Train

Results

About

Releases

Packages

Languages

Samantha-Du/RPA-SCD

Folders and files

Latest commit

History

Repository files navigation

RPA-SCD: Rhythm and Pitch Aware Dual-Branch Network for Songs Conversion Detection

Environments

Train RPA-SCD from scratch

1. Pack waveforms into hdf5 files

2. Create training indexes

3. Train

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages