AAclust: Selecting Redundancy-Reduced Scale Sets

AAclust (Amino Acid clustering) is a clustering wrapper framework for selecting redundancy-reduced sets of amino acid scales, introduced in [Breimann24a]. Using Pearson correlation, AAclust optimizes the number of clusters (k) and selects one representative scale per cluster, as illustrated in the figure below:

AAclust algorithm for clustering amino acid scales.

Scheme of AAclust algorithm with clustering of amino acid scales and scale selection, from [Breimann24a].

AAclust introduces two modes for defining the number of clusters (k):

  • k-optimized: AAclust automatically optimizes k, streamlining the scale selection process.

  • k-based: The user specifies k, allowing for custom configurations.

The distinctions between these modes and their respective AAclust options are depicted below:

Operational modes of AAclust.

Operational modes of AAclust for determining the number of clusters, from [Breimann24a].

Generally, AAclust works not only with amino acid scales sets but also with any set of numerical scales.

See our AAclust Tutorial for hands-on examples.