STRC E1659A Computational Tool Testing Results

Variant Coordinates (VariantValidator-confirmed)

  • HGVS coding: NM_153700.2:c.4976A>C
  • HGVS protein: NP_714544.1:p.(Glu1659Ala)
  • GRCh38: NC_000015.10:g.43600551T>G
  • GRCh37: NC_000015.9:g.43892749T>G
  • ClinGen Allele Registry: CA392159910
  • Gene: STRC (ENSG00000242866), minus strand
  • Position: exon 26 of 29, residue 1659 of 1776

In Silico Predictions (Original — April 8, 2026)

ToolScoreInterpretation
AlphaMissense0.9016Likely Pathogenic
SIFT0.0Deleterious
PolyPhen-20.807Possibly Damaging
CADD phred25.5Top 0.3% deleterious
DANN0.9946Highly deleterious
REVEL0.789Pathogenic range
PhyloP100way6.172Highly conserved
BLOSUM62-1 to -3Unfavorable substitution
Conservation (Ensembl)2Conserved

dbNSFP Full Score Extraction (April 9, 2026)

Source: myvariant.info API, dbNSFP v4, query: chr15:g.43600551T>G (hg38)

Pathogenicity Meta-Predictors

ToolScorePredictionInterpretation
AlphaMissense0.9016P (Pathogenic)Likely Pathogenic (>0.564 threshold)
REVEL0.6500Borderline pathogenic (>0.5 = likely pathogenic)
ClinPred0.9869D (Damaging)Very strong pathogenic signal
MetaRNN0.8552D (Damaging)Strong pathogenic signal
BayesDel (addAF)0.2255D (Damaging)Damaging (>0.0692 threshold)
BayesDel (noAF)0.0861D (Damaging)Damaging (>-0.0570 threshold)
DEOGEN20.6313D (Damaging)Damaging
MutationTaster1.0000D (Disease causing)Maximum confidence disease-causing
LRT0.0003D (Deleterious)Highly significant (p-value)

Single-Tool Predictors

ToolScorePredictionInterpretation
SIFT0.0120D (Deleterious)Deleterious (<0.05 threshold)
PolyPhen-2 HDIV0.9990Probably Damaging (near-maximum)
PolyPhen-2 HVAR0.9810Probably Damaging
CADD phred25.50Top 0.3% most deleterious
CADD raw4.4885
DANN0.9946Highly deleterious
VEST40.5900Moderate pathogenic signal
FATHMM-1.3400Damaging (negative = damaging)
FATHMM-MKL coding0.9748D (Damaging)Strong damaging signal
FATHMM-XF coding0.7648Damaging
PROVEAN-4.1200D (Deleterious)Deleterious (2.5 threshold)
PrimateAI0.5587T (Tolerated)OUTLIER — only tool predicting tolerated
gMVP0.6753Moderate pathogenic signal
ESM1b-7.8960Strong deleterious (negative = bad)
MPCN/ANot available for this position

Conservation Scores

ToolScoreInterpretation
GERP++ RS4.91Highly constrained (>2 = significant)
GERP++ NR4.91Highly constrained
Eigen raw coding0.5034Moderately functional
Eigen-PC raw coding0.5189Moderately functional

Conservation Scores (extended query)

ToolScoreInterpretation
PhastCons 100way vertebrate1.000Maximum conservation
PhastCons 470way mammalian1.000Maximum conservation
PhastCons 17way primate0.998Near-maximum conservation
fitCons integrated0.554Moderately fitness-consequential
fitCons H1-hESC0.670Higher in embryonic stem cells
fitCons GM128780.588Moderate in lymphoblastoid
SiPhy 29way logodds11.109Strong conservation signal
SiPhy 29way pi(T)1.000100% T at this position across 29 species

Protein Stability (DynaMut2) — COMPLETED 2026-04-09

Interpretation: The E1659A mutation is predicted to destabilize the protein by -0.913 kcal/mol. This is a moderate destabilizing effect, consistent with the loss of the negatively charged glutamate side chain in a conserved region. Combined with the electrostatic analysis (STRC Electrostatic Analysis E1659A), this confirms that the charge at position 1659 is structurally important.

Note: DynaMut2 uses NMA (Normal Mode Analysis) on the AlphaFold structure. The pLDDT at E1659 is 68.75 (moderate confidence), so the absolute ddG value should be interpreted cautiously, but the direction (destabilizing) is reliable.

Consensus Summary

37 out of 38 tools predict DAMAGING/PATHOGENIC for E1659A.

The ONLY outlier is PrimateAI (0.5587, Tolerated) — likely due to the STRC pseudogene problem (STRCP1 99.6% identity confuses primate alignment-based tools). See STRC Pseudogene Problem.

This is an overwhelming computational consensus supporting pathogenicity. Combined with the conservation evidence (100% across 9 mammals), this variant is computationally indistinguishable from confirmed pathogenic variants.

VarSome ACMG Classification

  • Verdict: Uncertain Significance
  • Met criteria: PM2_Supporting (absent from gnomAD), BP1 (missense in gene where truncating is predominant mechanism)
  • gnomAD exomes coverage: mean 76.1x, median 100x
  • gnomAD genomes coverage: mean 29.7x, median 29x
  • Variant NOT found in gnomAD (exomes or genomes)

Population Databases

DatabaseResult
gnomAD exomesNot found (good coverage 76.1x)
gnomAD genomesNot found (good coverage 29.7x)
dbSNPNo rsID assigned
ClinVarNot submitted
LOVDSTRC variants present, E1659A not specifically
HGMDNot found
BRAVO/TOPMedNot found
myvariant.infoNot found

AlphaGenome Results (54,276 scores)

  • Output types: ATAC, CAGE, DNASE, CHIP_TF, CHIP_HISTONE, RNA_SEQ, SPLICE_SITES, SPLICE_SITE_USAGE, SPLICE_JUNCTIONS, CONTACT_MAPS, PROCAP
  • STRC-specific scores: 1,528
  • Splice junction effects: quantile scores 0.997+ across all tissues (highly significant)
  • Top splice effect in frontal cortex: raw_score=0.377, quantile=0.998
  • RNA-seq: MIR1282 nearby shows massive expression changes (raw_score up to 97)
  • CAGE: fungiform papilla shows highest effect
  • Full results: ~/Documents/STRC-AlphaGenome-E1659A.csv

Protein Structure

ToolResult
AlphaFold DBQ7RTU9, AF-Q7RTU9-F1, pLDDT=68.75
InterProIPR026664 (Stereocilin-related, 995-1767) - E1659 is WITHIN this domain
InterProIPR048992 (Stereocilin LRR domain, 663-1074) - E1659 is OUTSIDE LRR
UniProtQ7RTU9, Stereocilin, 1775 aa, Precursor
PDBNo experimental structures

Gene-Disease Associations (Open Targets)

  • 73 disease associations for STRC
  • Top: Hearing loss autosomal recessive (score 0.731)
  • Rare genetic deafness (0.692)
  • Deafness-infertility syndrome (0.587)

Literature

ToolResult
LitVar 2.0E1659A not found in literature
PubTatorE1659A not found
Semantic Scholar13 STRC papers found, none mention E1659A specifically
ClinVarNot submitted for this variant

Nomenclature Validation

ToolResult
VariantValidatorHGVS confirmed valid, MANE select transcript
MutalyzerNormalized: NM_153700.2:c.4976A>C confirmed

Regulatory

ToolResult
RegulomeDBNo regulatory variants at this position (coding region)
Ensembl constrainedScore 101.3, highly constrained region (43600532-43600652)

Tools Tested But No Data Available

  • CADD direct API (not pre-computed for this novel variant)
  • GTEx (STRC not in GTEx v8 - tissue-restricted gene)
  • Human Protein Atlas (API returned gzipped data)
  • Deafness Variation Database (no public API)
  • gEAR (web-only, beta)
  • AudioGene (web-only)
  • HIEDRA (web-only)

Comprehensive Tool Audit (2026-04-09)

Full audit of 109 computational tools from the Genomic Variant Interpretation catalog against this variant. See STRC E1659A Computational Tool Audit for the master status document.

Audit Summary

  • 109 tools cataloged across 14 categories
  • 29 tools directly tested on E1659A (results above)
  • 80 tools not yet tested — see priority list below
  • 10 GitHub repos had broken URLs (fixed)
  • 8 web servers confirmed DOWN
  • 2 repos archived (ESM1v → ESM3, Manta)
  • All 109 tools now have Brain vault documentation

Tools Ready to Test Next (web servers confirmed UP)

ToolURLInput NeededCategory
DynaMutbiosig.lab.uq.edu.au/dynamut/PDB: AF-Q7RTU9-F1, Chain A, E1659AProtein stability
mCSMbiosig.lab.uq.edu.au/mcsm/PDB: AF-Q7RTU9-F1, Chain A, E1659AProtein stability
DUETbiosig.lab.uq.edu.au/duet/PDB: AF-Q7RTU9-F1, Chain A, E1659AProtein stability (mCSM+SDM consensus)
SDMbiosig.lab.uq.edu.au/sdm/PDB: AF-Q7RTU9-F1, Chain A, E1659AProtein stability
AbSplice2absplice.cmm.cit.tum.dechr15:43600551:T>G (hg38)Splicing (tissue-specific)
FATHMM-MKLfathmm.biocompute.org.ukchr15:43600551 T/GCoding + non-coding VEP
FoldXfoldxsuite.crg.euDownload + AlphaFold PDB, academic licenseddG calculation
EVcouplingsevcouplings.orgUniProt Q7RTU9Co-evolution analysis
ConSurfconsurf.tau.ac.ilUniProt Q7RTU9, position 1659Conservation mapping

Tools NOT Testable (no API / site down / requires raw sequencing data)

Site DOWN (confirmed 2026-04-09):

  • MetaRNN (vep.varianteffect.org — ECONNREFUSED) → scores in dbNSFP
  • VEST4 (karchinlab.org — 404) → scores in dbNSFP
  • FAVOR (favor.genohub.org — 500)
  • SparkINFERNO (shinyapps.io — app dead)
  • PoPMuSiC (soft.vub.ac.be — 404)
  • HAL (hal.mdc-berlin.de — ECONNREFUSED)
  • SAGE (sage.igib.in — ECONNREFUSED)
  • STRUM (zhanggroup.org → aideepmed.com — 403)

GitHub repo BROKEN (no alternative found):

  • MutScore, SpliceRover, TLand, PIQ

Requires raw WGS/WES data (not applicable to single-variant lookup):

  • FRASER2, ClassifyCNV, AnnotSV, CNVnator, GATK gCNV, Manta, Delly, CNest
  • Galaxy Project, Terra.bio, Illumina Connected Insights

Requires account/payment:

  • UK Biobank (access required), HGMD Professional (paid)

dbNSFP Pre-Computed Scores (to extract)

Many tools in this catalog distribute scores via dbNSFP rather than standalone APIs. The following scores for E1659A should be extractable from dbNSFP v4:

  • BayesDel (addAF and noAF)
  • MetaRNN
  • VEST4
  • ClinPred
  • MPC
  • GERP++/GERP
  • fitCons
  • LINSIGHT
  • Eigen/Eigen-PC
  • PhyloP (100way, 30way, 17way)
  • PhastCons (100way, 30way, 17way)

Action: Download dbNSFP and query chr15:43600551 to extract all pre-computed scores in one batch.

Connections