STRC STRCP1 expression in GTEx — co-expressed at 4.5:1 with STRC, not silent
Question
STRC PE Phase4 STRCP1 Paralog Off-Target killed every PE3b pegRNA on STRCP1 paralog hits. STRC ASO Phase2 STRCP1 Paralog Cross-Hybridization killed every Phase-1 splice-switch ASO on the same paralog. Both atomic notes flagged a shared escape path: if STRCP1 is transcriptionally silent in OHCs, the off-target has no RNA substrate and the paralog risk drops. This proof asks: is that escape path biologically plausible?
GTEx is the cheapest first read. It covers 54 human tissues but no cochlea / inner ear, so we use highest-STRC tissues (cerebellum, testis, brain regions, skin) as proxies. Logic: if STRCP1 is co-expressed with STRC in tissues where STRC is abundant, then the OHC (the highest-STRC tissue in the body) almost certainly co-expresses STRCP1 too.
Method
ohc_strcp1_expression_check.py — single-shot GTEx v8 API query for both genes:
| Gene | GENCODE v26 ID | GTEx tissues |
|---|---|---|
| STRC | ENSG00000242866.9 | 54 |
| STRCP1 | ENSG00000166763.7 | 54 |
Median TPM per tissue pulled via gtexportal.org/api/v2/expression/medianGeneExpression?gencodeId=...&datasetId=gtex_v8. Verdict gate computed as the median STRC:STRCP1 ratio in the top 5 STRC-expressing tissues:
| Median ratio | Verdict | Implication |
|---|---|---|
| no STRCP1 detected (<0.05 TPM) | STRCP1_SILENT_PROBABLE | Escape path PLAUSIBLE for both PE+ASO |
| ≥10:1 | STRCP1_LOW_FAVORABLE | ASO loses ~10% dose, tolerable |
| 3:1–10:1 | STRCP1_CO_EXPRESSED_PARTIAL | ASO loses 25-50% dose, significant |
| <3:1 | STRCP1_CO_EXPRESSED_KILL | ASO loses >50% dose, kills splice ROI |
Result — STRCP1 transcribed in all 54 GTEx tissues, ratio 4.5:1 in STRC-expressing tissues
Verdict: STRCP1_CO_EXPRESSED_PARTIAL. Median STRC:STRCP1 ratio across top-5 STRC-expressing tissues = 4.5:1.
Top 10 by STRC TPM (proxy for OHC where direct data is missing):
| Tissue | STRC TPM | STRCP1 TPM | STRC:STRCP1 ratio |
|---|---|---|---|
| Brain Cerebellum | 15.17 | 2.56 | 5.93 |
| Brain Cerebellar Hemisphere | 12.34 | 2.49 | 4.96 |
| Testis | 4.94 | 1.11 | 4.46 |
| Brain Cortex | 1.21 | 0.38 | 3.17 |
| Pituitary | 1.03 | 0.43 | 2.38 |
| Brain Frontal Cortex BA9 | 0.85 | 0.38 | 2.23 |
| Skin Sun Exposed | 0.58 | 0.52 | 1.11 |
| Brain Nucleus Accumbens | 0.57 | 0.27 | 2.11 |
| Skin Not Sun Exposed | 0.56 | 0.45 | 1.26 |
| Prostate | 0.43 | 0.36 | 1.21 |
Distribution:
- STRC > 0.1 TPM: 40 / 54 tissues
- STRCP1 > 0.1 TPM: 37 / 54 tissues
- Both > 0.1 TPM: ~36 / 54 tissues (essentially co-expressed wherever either is detected)
- Tissues where STRCP1 is undetectable (TPM = 0): 0 / 54
The pattern is unambiguous: STRCP1 is not silent. It transcribes wherever STRC does. The Ensembl biotype transcribed_unprocessed_pseudogene and the NCBI note (“a previously suppressed RefSeq was removed because it is now thought that this locus is transcribed”) corroborate independently.
Why this matters more than typical paralog-expression questions
For a typical processed pseudogene, “is it transcribed?” is binary — most processed pseudogenes are not transcribed at all and the off-target paralog risk is moot. STRCP1 is an unprocessed pseudogene retaining the parent’s intron-exon architecture (30 exons vs STRC’s 29; the same splice sites that killed every ASO design). Unprocessed pseudogenes far more often get co-transcribed by the same regulatory elements as the parent gene.
GTEx confirms this empirically. In every STRC-expressing tissue, STRCP1 is co-expressed at 1:1 to 6:1 STRC:STRCP1 ratio. This is not “rare leaky transcription” — STRCP1 reaches 2.6 TPM in cerebellum, comparable to many protein-coding genes there.
The OHC is the body’s highest-expression-per-cell-type tissue for STRC. By extrapolation:
- If OHC shows the same pattern as cerebellum (the highest-STRC GTEx tissue), STRCP1 expression in OHC is ~2-3 TPM, roughly 16% of the STRC level.
- If OHC has STRC-supremacy similar to testis/brain (mean ratio 4.5:1), then ~18% of total STRC + STRCP1 mRNA pool is STRCP1.
- An RNase-H1 gapmer ASO with ≤2 mm match to STRCP1 will cleave 50-80% of STRCP1 transcripts as a function of single dose — dose loss to the paralog is therefore in the 10-20% range, not the >50% range.
This is not the worst-case scenario. The earlier worry that “every ASO molecule binds STRCP1 first” is overstated by the data. But it is also not free — the splice modulation hypothesis was already rated B because of weak inner-ear delivery; losing another ~15% to paralog cleavage is a real (if non-fatal) ROI hit.
For PE the calculus is different. Even rare STRCP1 editing events create permanent off-target genotypes. Unlike ASO (which cleaves transient mRNA and is dose-dependent), PE permanently writes a sequence change. So the GTEx-based dilution argument doesn’t apply to PE — every STRCP1-expressing OHC is at risk for an off-target edit, with unknown phenotypic consequence.
Implications for escape paths
The five escape paths from STRC ASO Phase2 STRCP1 Paralog Cross-Hybridization:
- ✗ STRCP1 silent in OHC — falsified by GTEx. STRCP1 transcribed in every STRC-expressing tissue at non-trivial levels. Cannot rely on this.
- ✓ Morpholino chemistry (no RNase-H cleavage) — more attractive now. Steric block on STRCP1 splicing has no protein-level consequence (STRCP1 makes no functional protein anyway), and morpholino doesn’t deplete drug at the target. The 4.5:1 GTEx ratio means morpholino dose is barely affected by STRCP1 binding.
- △ 3’UTR exonic targets with STRC vs STRCP1 divergence — pseudogene 3’UTRs often diverge faster, but this changes mechanism from splice modulation to steady-state knockdown. Different drug.
- △ Whole-genome BLAST + high-Tm discrimination — possible, requires re-design at higher Tm windows (24-26 nt).
- ✓ Allele-specific gapmer at c.4976 variant directly — also more attractive now. The c.4976A>C variant is on STRC mutant only (STRCP1 has the WT base). 1-bp ΔTm of ~1.5-2.5 °C is borderline for RNase-H discrimination, but a carefully designed 18-nt gapmer with the variant at the central RNase-H cleavage site could achieve 5-10× preference for the mutant allele. This is the most rigorous chemistry-aware path forward.
For PE Phase 3.5 STRCP1-aware redesign: still required. The GTEx data does NOT relax the need to design pegRNAs that discriminate STRC vs STRCP1 in the seed region.
Limitations
- No direct OHC data. GTEx covers 54 tissues but not cochlea / inner ear. The proxy assumption (“OHC behaves like cerebellum / testis for STRCP1 ratio”) is conservative but unverified. Direct human OHC RNA-seq would resolve. Closest published: Hoa et al. 2023 PNAS mouse cochlea atlas (PMC10293812) — uses mouse, and their paper does not mention Strcp1 (mouse may lack the ortholog or it was filtered as a pseudogene from standard scRNA-seq pipelines).
- GTEx v8 = bulk RNA-seq. Median TPM at the tissue level mixes many cell types; per-cell ratio in OHC could differ substantially. Some tissues (cerebellum) have many neuronal subtypes; STRCP1 might be enriched in specific cells while STRC is in others.
- One snapshot, no developmental data. STRCP1 may have age-dependent expression; if it is high in embryonic OHC and silent in postnatal OHC (or vice versa), the gate decision depends on therapeutic window timing.
- Allelic balance not assayed. STRCP1 has known alleles with varying integration into the STRC genomic region; some haplotypes may have higher / lower STRCP1 expression. Misha-specific genotype not analyzed here.
- TPM ≠ functional substrate availability. mRNA stability, accessibility, and ribosome occupancy may differ between STRC and STRCP1; functional substrate for ASO-mediated cleavage tracks bound mRNA, not total TPM.
Files
- Driver:
~/STRC/models/ohc_strcp1_expression_check.py(stdlib + urllib only, no compute) - Output JSON:
~/STRC/models/ohc_strcp1_expression_check.json - GTEx API:
gtexportal.org/api/v2/expression/medianGeneExpression?...&datasetId=gtex_v8
Ranking delta
- STRC ASO Exon Skipping: no tier change (stays C). Mechanism axis unchanged. Evidence depth +1 (escape path 1 falsified, escape paths 2 and 5 promoted to “more attractive”). Next step refined from “verify STRCP1 OHC expression first” → “design morpholino splice-switch + allele-specific gapmer at c.4976 in parallel as Phase 3” (escape paths 2 and 5 from the Phase 2 paper). The OHC RNA-seq direct verification is desirable for IND but no longer the gating step — even if STRCP1 is 50% silent in OHC, the GTEx baseline shows we cannot count on full silence.
- Prime Editing for STRC: no tier change (stays B). Mechanism still 4/5. Evidence depth +1. The STRCP1 paralog off-target stays a per-cell risk regardless of dose dilution — escape path “STRCP1 silent in OHC” was never directly applicable to PE (PE writes permanent edits, not dose-dependent transient cleavage). Next step unchanged: Phase 3.5 STRCP1-discriminating pegRNA redesign is required.
- All other S/A/B/C tier hypotheses: no change.
The shared “OHC STRCP1 expression check” gate from the prior two notes is now closed (with negative result). Both PE and ASO must proceed via design-side escape paths, not via biology-permission. The cross-cutting next step from the inventory (“ohc_strcp1_expression_check.py”) is now complete — no follow-up RNA-seq scrape needed unless a wet-lab RNA-FISH experiment is later proposed.
Connections
- STRC PE Phase4 STRCP1 Paralog Off-Target
- STRC ASO Phase2 STRCP1 Paralog Cross-Hybridization
- STRC ASO Exon Skipping
- Prime Editing for STRC
- GTEx v8 Median TPM API (would need its own reference note if querying again)
[see-also]STRC Hypothesis Ranking[applies]Misha