EFI - Enzyme Similarity Tool

Download Network Files

Submission Name: IPR004184_IP74_UniRef50

Network Name: IPR004184_IP74_UniRef50_Minlen650_AS240

The parameters used for the initial submission and the finalization are summarized in the table below.

Analysis Summary

Analysis Job Number35895
Network NameIPR004184_IP74_UniRef50_Minlen650_AS240
Alignment Score240
Minimum Length650
Maximum Length50,000
Total Number of Sequences After Length Filtering520

Dataset Summary

EST Job Number29537 (Original Dataset)
Database VersionUniProt: 2019-04 / InterPro: 74
Input OptionFamilies (Option B)
Job NameIPR004184_IP74_UniRef50
Pfam / InterPro FamilyIPR004184
Number of IDs in Pfam / InterPro Family20,232
Domain Optionoff
UniRef Version50
Number of Cluster IDs in UniRef50 Family1,365
Exclude FragmentsNo
Total Number of Sequences in Dataset1,365
Total Number of Edges449,488
Convergence Ratio?0.483

The panels below provide files for full and representative node SSNs for download with the indicated numbers of nodes and edges. As an approximate guide, SSNs with ~2M edges can be opened with 16 GB RAM, ~5M edges can be opened with 32 GB RAM, ~10M edges can be opened with 64 GB RAM, ~20M edges can be opened with 128 GB RAM, ~40M edges can be opened with 256 GB RAM, and ~120M edges can be opened with 768 GB RAM.

Files may be transferred to the Genome Neighborhood Tool (GNT), the Color SSN utility, the Cluster Analysis utility, or the Neighborhood Connectivity utility.

Full Network ?

Each node in the network represents a single protein sequence.

# Nodes # Edges
520 179

 

Representative Node Networks ?

In representative node (RepNode) networks, each node in the network represents a collection of proteins grouped according to percent identity. For example, for a 75% identity RepNode network, all connected sequences that share 75% or more identity are grouped into a single node (meta node). Sequences are collapsed together to reduce the overall number of nodes, making for less complicated networks easier to load in Cytoscape.

The cluster organization is not changed, and the clustering of sequences remains identical to the full network.

% ID # Nodes # Edges
100 520 179
95 519 174
90 519 174
85 518 165
80 517 164
75 517 164
70 516 163
65 516 163
60 516 163
55 516 163
50 515 160
45 511 158
40 496 146

New to Cytoscape?

Portions of these data are derived from the Universal Protein Resource (UniProt) databases.

If you use the EFI web tools, please cite us.

Click here to contact us for help, reporting issues, or suggestions.