Skip to content

Source code for paper "Maximizing diversity over clustered data" in SDM 2020.

Notifications You must be signed in to change notification settings

Guangyi-Zhang/clustered-max-diversity

Repository files navigation

Maximizing diversity over clustered data

This repo hosts the code for the paper "G. Zhang AND A. Gionis, Maximizing diversity over clustered data, 2020".

The main algorithms are in intra.py, and metrics are run by Jupyter notebooks that start with metric-. The repo adopts sacred and incense to manage experiment results. Therefore you will need a Mongo DB server to run experiments.

External realistic data can be downloaded from links below:

About

Source code for paper "Maximizing diversity over clustered data" in SDM 2020.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published