To capture and capitalize on genetic variation, the field of comparative genomics is switching from a reference-based approach to pangenomic approaches. The main aim of this project was to improve the scalability of our pangenomics software, called PanTools, for large-scale applications in plant sciences and biotechnology. We envisioned improvements in the data representation, in the construction and annotation algorithms, and in the use of novel technologies like Apache Spark. In addition, we aimed to improve our development efforts and make the code base more sustainable by reorganizing/refactoring, writing proper unit tests, and improving the documentation.