UniProt
Universal Protein Resource. Comprehensive protein sequence and functional annotation database. The canonical reference for protein information.
What It Does
- Protein sequences (reviewed SwissProt + unreviewed TrEMBL)
- Functional annotations: domains, sites, modifications
- Cross-references to 200+ databases (PDB, Pfam, InterPro, GO)
- Ortholog clusters for conservation analysis
- Variant annotations
How to Use
Web
- Go to https://www.uniprot.org
- Search: Q7RTU9 (STRC human) or gene name “STRC”
- Browse: Function, Structure, Disease, Cross-references tabs
REST API
# Get protein entry
curl "https://rest.uniprot.org/uniprotkb/Q7RTU9.json"
# Get FASTA sequence
curl "https://rest.uniprot.org/uniprotkb/Q7RTU9.fasta"
# Search orthologs
curl "https://rest.uniprot.org/uniprotkb/search?query=gene:STRC+AND+reviewed:true&format=json"Python
import requests
r = requests.get("https://rest.uniprot.org/uniprotkb/Q7RTU9.json")
data = r.json()
print(f"Protein: {data['proteinDescription']['recommendedName']['fullName']['value']}")
print(f"Length: {data['sequence']['length']} aa")
# Stereocilin, 1809 aaVerified Status
VERIFIED — STRC = Q7RTU9, Stereocilin, 1775 amino acids (mature chain 23-1775 after signal peptide cleavage). E1659 falls within the main chain with no specific domain/site annotation at that position — it’s in the core scaffolding region. This means E1659 is not in a known binding site or catalytic domain per UniProt, but the high conservation (100% across mammals) and AlphaMissense score (0.9016) indicate functional importance not yet annotated. 9 ortholog sequences retrieved for conservation analysis.
STRC Research Usage
- STRC Gene — canonical sequence reference (Q7RTU9)
- STRC Cross-Species Conservation Analysis — ortholog sequences from UniProt
- STRC Signal Peptide Validation — signal peptide annotation
- STRC GPI-Anchor Analysis — GPI anchor prediction from annotations
Results (April 2026)
- Domain architecture: mature chain residues 23-1775 (after signal peptide). No specific domain annotation at E1659 — it’s in the main scaffolding body.
- 14 N-linked glycosylation sites across the protein
- Natural variant rs2920791 at position 1640 (near E1659) — context for the region
- Still untapped: IntAct interactome, detailed PTM mapping around E1659
Connections
- AlphaFold Database [depends-on] — uses UniProt IDs
- Clustal Omega [see-also] — for aligning UniProt sequences
- OrthoDB [see-also] — ortholog clusters
- STRC Gene [about] 73|- STRC Cross-Species Conservation Analysis [used-in]
STRC E1659A Results
STRC: Q7RTU9, Stereocilin, 1775 aa. Precursor protein. Tested: 2026-04-08