aaanalysis.SequencePreprocessor

class aaanalysis.SequencePreprocessor[source]

Bases: object

Utility preprocessing class to encode and represent protein sequences [Breimann25a].

Added in version 1.0.0.

Methods

encode_integer([list_seq, alphabet, gap, pad_at])

Integer-encode a list of protein sequences into a feature matrix.

encode_one_hot([list_seq, alphabet, gap, pad_at])

One-hot-encode a list of protein sequences into a feature matrix.

get_aa_window([seq, pos_start, pos_stop, ...])

Extracts a window of amino acids from a sequence.

get_sliding_aa_window([seq, slide_start, ...])

Extract sliding windows of amino acids from a sequence.