CleanX is an open-source python library for exploring, cleaning and augmenting large datasets of X-rays, or certain other types of radiological images.


What cleanX can do for you

  • Workflow demos included as Jupyter notebooks
  • Enables data augmentation
  • Able to process metadata from csv, json or other formats
  • Command line interface available

Images can be extracted from DICOM files or used directly. The primary authors are Candace Makeda H. Moore, Oleg Sivokon, and Andrew Murphy. CleanX allows users to do many data exploration and preprocessing steps to prepare images for machine learning algorithms.

Programming languages
  • Jupyter Notebook 98%
  • Python 2%
  • GPL-3.0
</>Source code

Participating organisations

Netherlands eScience Center


Candace Makeda  Moore
Candace Makeda Moore
Netherlands eScience Center