EFI - Enzyme Similarity Tool
This web resource is supported by a Research Resource from the National Institute of General Medical Sciences (R24GM141196-01).
The tools are available without charge or license to both academic and commercial users.
RadicalSAM.org, our resource for investigating sequence-function space in the radical SAM superfamily, has been updated with sequences from the UniProt Release 2024_01 and InterPro Release 98 databases (January 24, 2024) !!
https://radicalsam.org
Dataset Completed
Submission Name: IPR004184_IP74_UniProt
A minimum sequence similarity threshold that specifies the sequence pairs
connected by edges is needed to generate the SSN. This threshold also
determines the segregation of proteins into clusters. The threshold is applied
to the edges in the SSN using the alignment score, an edge node attribute that
is a measure of the similarity between sequence pairs.
The parameters for generating the initial dataset are summarized in the table.
Job Number | 29535 |
Database Version | UniProt: 2019-04 / InterPro: 74 |
Input Option | Families (Option B) |
Job Name | IPR004184_IP74_UniProt |
E-Value for SSN Edge Calculation | 5 |
Pfam / InterPro Family | IPR004184 |
Number of IDs in Pfam / InterPro Family | 20,232 |
Fraction Option | off |
Domain Option | off |
Exclude Fragments | No |
Total Number of Sequences in Dataset | 20,232 |
Total Number of Edges | 166,127,802 |
Convergence Ratio? | 0.812 |
This tab provides histograms and box plots with statistics about the sequences
in the input dataset as well as the BLAST all-by-all pairwise comparisons that
were computed.
The descriptions for the histograms and plots guide the choice of the values
for the "Alignment Score Threshold" and the Minimum and Maximum "Sequence
Length Restrictions" that are applied to the sequences and edges to generate
the SSN.
Portions of these data are derived from the Universal Protein Resource (UniProt) databases.
Click here to contact us for help, reporting issues, or suggestions.