Welcome to EnGens documentation!

EnGens is a Python library for anyone interested in studying protein structures and their flexibility. EnGens provides a computational framework for generation and analysis of representative protein conformational ensembles.

EnGens Big Logo

With EnGens you will be able to process your datasets of protein structure both:

  • static (gathered from the PDB databank or modelled)

  • dynamic (generated by MD simulation)

EnGens will analyze your dataset and extract a representative ensemble with the following steps:

  1. Data loading and preprocessing

  2. Data featurization

  3. Dimensionality reduction

  4. Clustering and ensemble extraction

  5. Visualization and further analysis

You can use the resulting ensemble for downstram tasks such as docking, structural drug design or other analysis!

Contents

Check out the Installation instructions section for further information regarding the Installation of the project. Check out the Workflow Description section for detailed description of the workflows. Finally, check out EnGens Usage for the instructions on how to use EnGens.

Acknowledgments

This work was built up on a large amount of community effort including the following tools:

We would like to thank the authors and the maintainers of the tools!