aaanalysis.AnnotationPreprocessor.fetch_uniprot

AnnotationPreprocessor.fetch_uniprot(df_seq=None, features=None, evidence='manual', timeout=30.0, verbose=None)[source]

Fetch UniProt features for every entry and map to df_annot.

Parameters:
  • df_seq (pd.DataFrame, shape (n_samples, n_seq_info)) – DataFrame containing an entry column with unique protein identifiers (UniProt accessions). The entry values are used as the UniProtKB accessions to fetch.

  • features (list of str, optional) – Registry keys to keep (e.g. ['phospho', 'disulfide']). None keeps every built-in key.

  • evidence ({'experimental', 'manual', 'all'}, default='manual') – Evidence allow-set. 'experimental' keeps only ECO:0000269; 'manual' also keeps ECO:0007744 (combinatorial, manual); 'all' disables evidence filtering. Raw ECO codes are retained in the evidence column regardless.

  • timeout (float, default=30.0) – Per-request timeout in seconds.

  • verbose (bool, optional) – Override instance verbosity for this call only.

Returns:

df_annot – Canonical per-residue annotation schema (see class Notes).

Return type:

pd.DataFrame

Raises: