Skip to content

Code and data for paper "ApisTox: a new benchmark dataset for the classification of small molecules toxicity on honey bees"

Notifications You must be signed in to change notification settings

j-adamczyk/ApisTox_dataset

Repository files navigation

ApisTox - dataset of pesticide toxicity to honey bees

This repository hosts ApisTox dataset, for applications of data analysis and ML in ecotoxicology and agrochemistry.

Paper is freely available (open access) on Scientific Data, and preprint is available on ArXiv.

Dataset and code are released under CC-BY-NC-4.0 license.

Dataset files

Final dataset file is outputs/dataset_final.csv. For dataset splits, see outputs/splits directory.

Raw input data is in raw_data directory. Other datasets from this area are in other_sources directory (we do not recommend using them).

Reproducing dataset

Setup virtual environment:

  • Poetry (recommended), run make install or poetry install --no-root
  • venv, run pip install requirements.txt

Scripts:

  • recreate dataset: python create_dataset.py
  • split dataset:python create_dataset_splits.py
  • create analyses and plots: python analyze_dataset.py

About

Code and data for paper "ApisTox: a new benchmark dataset for the classification of small molecules toxicity on honey bees"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published