The Data File Download tab provides the Color SSN with the nodes colored according to node.fillColor (Cluster Sequence Count).
Six node attributes were added to the input SSN: Cluster Sequence Count, Sequence Count Cluster Number, Cluster Node Count, Node Count Cluster Number, node.fillColor (according to Cluster Sequence Count, hexadecimal), and Node Count Fill Color (according to Cluster Node Count, hexadecimal).
To change the node colors in Cytoscape to Node Count Fill Color: 1) select all nodes; 2) on the Style Panel, click on the "?" in the Fill Color Property; 3) select "Remove Bypass"; 4) deselect the nodes (default node color); and 5) open the Fill Color Property and select "Node Count Fill Color" as the Column and "Passthrough Mapping" as the Mapping Type. The nodes will be recolored.
The Data File Download tab also provides files for 1) UniProt ID-Color-Cluster Number mapping table, 2) ID Lists and FASTA Files for each cluster, 3) cluster sizes, and 4) SwissProt annotations for clusters and singletons. The number of UniRef/UniProt IDs for each cluster is displayed in the WebLogos, HMMs, and Length Histograms tabs.
The WebLogos tab provides the WebLogo (generated using http://weblogo.threeplusone.com/) and MSA (generated using MUSCLE) for the node IDs in each SSN cluster containing at least the specified "Minimum Node Count". The MSA can be viewed with Jalview (https://www.jalview.org/). This tab also provides the percent identity matrix for the multiple sequence alignment, as computed by Clustal-Omega.
The Consensus Residues tab provides a tab-delimited text file with the number of the conserved residues and their MSA positions for each specified residue in each SSN cluster (numbered by Cluster Sequence Count) containing at least the specified "Minimum Node Count".
The HMMs tab provides the HMM for each SSN cluster containing at least the specified "Minimum Node Count". The Skylign download provides the image of the HMM generated from the MSA (https://skylign.org/). The HMM text file can be viewed interactively by uploading to https://skylign.org/ and selecting "Information Content – Above Background"; the probability of each amino acid residue and probability and length of an insert at each position is provided. The p
The Length Histograms tab provides length histograms for each cluster containing at least the specified "Minimum Node Count".
Job Number | 26236 |
Input Option | Cluster Analysis |
Uploaded Filename | 26123_IP91_IPR004184_UniRef90_NoFragments_Fungi_Minlen650_AS240_PAPBPFPu_full_ssn.xgmml |
Database Version | UniProt: 2022-04 / InterPro: 91 |
Analysis Options | Weblogo, HMM, Consensus Residue, Length Histogram (AAs=C; Thresholds=0.9,0.8,0.7,0.6,0.5,0.4,0.3,0.2,0.1; ) |
Number of SSN clusters | 2 |
Number of SSN singletons | 1 |
SSN sequence source | UniRef90 |
Number of SSN (meta)nodes | 28 |
Number of accession IDs in SSN | 113 |
Each cluster in the submitted SSN has been identified and assigned a unique number and color.
If the WebLogo is missing for Node Cluster 1 (and additional clusters with large numbers of nodes), repeat the job with a "Maximum Node Count" in the Sequence Filter input window. MUSCLE can fail with a "large" number of sequences (variable, anywhere from >750 to >1500).
WebLogos for FASTA UniProt cluster (full length sequences) | 1 MB |
Percent Identity Matrix for FASTA UniProt cluster (full length sequences) | <1 MB |
If the HMM is missing for Node Cluster 1 (and additional clusters with large numbers of nodes), repeat the job with a "Maximum Node Count" in the Sequence Filter input window. MUSCLE can fail with a "large" number of sequences (variable, anywhere from >750 to >1500).
HMMs for FASTA UniProt cluster (full length sequences) | 2 MB |
Click here to contact us for help, reporting issues, or suggestions.