aaanalysis.AnnotationPreprocessor.fetch_uniprot
- AnnotationPreprocessor.fetch_uniprot(df_seq=None, features=None, evidence='manual', timeout=30.0, verbose=None)[source]
Fetch UniProt features for every entry and map to
df_annot.- Parameters:
df_seq (pd.DataFrame, shape (n_samples, n_seq_info)) – DataFrame containing an
entrycolumn with unique protein identifiers (UniProt accessions). Theentryvalues are used as the UniProtKB accessions to fetch.features (list of str, optional) – Registry keys to keep (e.g.
['phospho', 'disulfide']).Nonekeeps every built-in key.evidence ({'experimental', 'manual', 'all'}, default='manual') – Evidence allow-set.
'experimental'keeps only ECO:0000269;'manual'also keeps ECO:0007744 (combinatorial, manual);'all'disables evidence filtering. Raw ECO codes are retained in theevidencecolumn regardless.timeout (float, default=30.0) – Per-request timeout in seconds.
verbose (bool, optional) – Override instance verbosity for this call only.
- Returns:
df_annot – Canonical per-residue annotation schema (see class Notes).
- Return type:
pd.DataFrame
- Raises:
ValueError – On invalid arguments.
RuntimeError – On UniProt network / response failure.