EFI - Chemically-Guided Functional Profiling

This web resource is supported by a Research Resource from the National Institute of General Medical Sciences (R24GM141196-01).
The tools are available without charge or license to both academic and commercial users.
Important Notice

The UniProtKB database used by the EFI tools is undergoing major reorganization starting with the just-released version 2025_04 (https://www.uniprot.org/help/refprot_only_changes). When the reorganization is fully implemented (2026_02 release, Spring 2026), the number of proteins in UniProtKB will decrease from ~253M accessions in the previous 2025_03 release to ~141M accessions in the 2026_02 release.

In response to these changes, we will provide the previous 2025_03 release until the 2026_02 release is available.

The current 2025_04 release removed 82M UniProt IDs; the UniProt pages providing functional annotation for these IDs are no longer active. A new Metadata Tool provides access to the node attribute metadata for all UniProt IDs in the 2025_03 release that the tools continue to use during the UniProtKB reorganization. The Tool is available using the tab at the top of each page.

More information about the reorganization is located here.

Markers Computation Results

Submitted SSN: 27072_26148_IP91_IPR004184_NoFragments_Bacteroidetes_UniRef90_NoFragments_IPR004184_Bacteroidetes_full_ssn_coloredssn

Submission Summary Table

Input filename27072_26148_IP91_IPR004184_NoFragments_Bacteroidetes_UniRef90_NoFragments_IPR004184_Bacteroidetes_full_ssn_coloredssn.zip
Identify ID836
Minimum sequence length650
Identify search typeDIAMOND
Reference databaseUNIREF90
CD-HIT identity for ShortBRED family definition85

Markers that uniquely define clusters in the submitted SSN have been identified.

Files detailing the identities of the markers and which sequences they represent are available for download.

SSN With Marker Identification Results

The SSN submitted has been edited to include the marker ID and type and the number of markers that were identified.

FileSize
SSN with identify results (ZIP)

CGFP Family and Marker Data

The CD-HIT ShortBRED families by cluster file contains mappings of ShortBRED families to SSN cluster number as well as a color that is assigned to each unique ShortBRED family. The ShortBRED marker data file lists the markers that were identified.

FileSize
CD-HIT ShortBRED families by cluster
ShortBRED marker data
Submission is currently disabled due to site maintenance. Submission is currently disabled due to site maintenance.

Click here to contact us for help, reporting issues, or suggestions.