This web resource is supported by a Research Resource from the National Institute of General Medical Sciences (R24GM141196-01).
The tools are available without charge or license to both academic and commercial users.
Important Notice
The UniProtKB database used by the EFI tools is undergoing major reorganization
starting with the just-released version 2025_04
(https://www.uniprot.org/help/refprot_only_changes).
When the reorganization is
fully implemented (2026_02 release, Spring 2026), the number of proteins in
UniProtKB will decrease from ~253M accessions in the previous 2025_03 release
to ~141M accessions in the 2026_02 release.
In response to these changes, we will provide the previous 2025_03 release
until the 2026_02 release is available.
The current 2025_04 release removed 82M UniProt IDs; the UniProt pages
providing functional annotation for these IDs are no longer active. A new
Metadata Tool
provides access to the node attribute metadata for all UniProt
IDs in the 2025_03 release that the tools continue to use during the UniProtKB
reorganization. The Tool is available using the tab at the top of each page.
More information about the reorganization is located here.
Markers that uniquely define clusters in the submitted SSN have been identified.
Files detailing the identities of the markers and which sequences they represent are available for download.
SSN With Marker Identification Results
The SSN submitted has been edited to include the marker ID and type
and the number of markers that were identified.
File
Size
SSN with identify results (ZIP)
CGFP Family and Marker Data
The CD-HIT ShortBRED families by cluster file contains mappings of ShortBRED
families to SSN cluster number as well as a color that is assigned
to each unique ShortBRED family. The ShortBRED marker data file
lists the markers that were identified.
File
Size
CD-HIT ShortBRED families by cluster
ShortBRED marker data
Submission is currently disabled due to site maintenance.
Submission is currently disabled due to site maintenance.