Tutorials
Getting Started
The A minimal CPP analysis notebook is the shortest complete loop — load a dataset, run CPP, read out the signature — and pairs with the Prediction tasks concept page. For a fuller introduction, explore our Quick start and Slow start tutorials, both offering the same examples with the latter explaining the conceptual background. The Plotting Prelude tutorial can help you to create publication-ready plots.
Data Handling
Learn how to load protein benchmarking datasets and amino acid scale sets in the Data Loader and Scale Loader tutorials.
Feature Engineering
Explore interpretable feature engineering, the core of AAanalysis, with the AAclust, SequenceFeature,
and CPP tutorials, then see how CPP turns different data representations (scales, embeddings, structure)
into features. Because SequenceFeature.feature_matrix returns a plain numeric matrix, these features
drop directly into a stock scikit-learn Pipeline — the prediction protocol demonstrates this end to end.
PU Learning
Start positive-Unlabeled (PU) learning to tackle unbalanced and small data through our dPULearn tutorial.
Explainable AI
Explaining sample level predictions at single-residue resolution is introduced in our ShapModel tutorial.