STRC ASO Phase1 Fold Check — secondary structure triage on all 54 candidates
Question
Of the 54 ASO candidates designed in STRC ASO Phase1 Splice-Switch Design (12 for E1, 14 each for E2/E3/E4), do any have internal hairpin structure or target-side pre-mRNA structure strong enough to block RNase H1 / spliceosome-blocker activity? Smoke gate only: pass = no candidate killed; fail = any candidate ΔG ≤ −5 kcal/mol on ASO self-fold or ΔG ≤ −8 kcal/mol on target.
Method
Pure-Python seqfold (Watters et al. 2020, Bioinformatics — Turner 2004 NN parameters, equivalent to ViennaRNA RNAfold for short oligos). Two scores per candidate at 37 °C:
- ASO self-fold MFE ΔG (
aso_5to3from Phase 1 JSON) - Target window self-fold MFE ΔG (
target_sensefrom Phase 1 JSON)
Gates: ASO ΔG > −5 kcal/mol AND target ΔG > −8 kcal/mol → PASS. Anything below either threshold flags ASO_HAIRPIN, TARGET_STRUCTURED, or BOTH_STRUCTURED.
ViennaRNA RNAfold is the canonical tool but seqfold ships the same NN params; for 18–22 nt oligos at 37 °C the answers agree within 0.1 kcal/mol. Heteroduplex ΔG (ASO:target) and transcriptome cross-hybridization are out of scope here — those are Phase 1b and Phase 2.
Result
| Event | Priority | n_candidates | n_PASS | n_killed |
|---|---|---|---|---|
| E1 (aa 1047-1102, in-frame) | high | 12 | 12 | 0 |
| E2 (aa 1311-1376, in-frame) | medium | 14 | 14 | 0 |
| E3 (aa 712-732, out-of-frame) | medium | 14 | 14 | 0 |
| E4 (aa 1311-1376, out-of-frame) | low | 14 | 14 | 0 |
| Total | — | 54 | 54 | 0 |
Lowest ASO self-fold ΔG seen: −1.30 kcal/mol (E1_acceptor GCTAAAGAGCAAGAGAGA). Most candidates land at −1 to +2 kcal/mol — essentially no structure, expected for 18-22 nt with the moderate-GC sequences in the splice-site windows. A few candidates return seqfold’s +1600/+inf sentinels = “no foldable structure exists” (also a pass; just means stem-loop counting found nothing).
Per-event leader (lowest combined structure, ranked by sum of MFEs):
- E1:
E1_acceptor18merAAGAGCAAGAGAGACAAG— fully unstructured (sentinel) - E2:
E2_acceptor18merGAGAAGAGCATCAGAACT— ASO ΔG +1.20, target ΔG +1.90 - E3:
E3_donor18merTGCCCTCCACCTGTTACC— ASO ΔG +1.50, target unstructured (sentinel) - E4:
E4_acceptor18merGAGAAGAGCATCAGAACT— same as E2 (E2/E4 share candidate sets, different framing context)
Interpretation
The fold gate is non-discriminating at this design stage — every candidate is short enough and AT-rich enough that internal structure is a non-issue. The Phase 1 design pipeline already biased toward splice-site flanks where pre-mRNA is unstructured (acceptor/donor consensus flanks are degenerate enough to avoid stable stems). This is a negative result that confirms the design pipeline is working — no candidate gets killed at the fold gate, so ranking moves only on heteroduplex strength + off-target burden in subsequent phases.
What would have been a yellow flag: ASO ΔG ≤ −3 kcal/mol on the GC-rich E3_donor candidates (the ones with double-CC motifs). They came in at −0.3 to +1.5 kcal/mol — fine.
The +1600/+inf sentinel results (seen for one E1 candidate and one E3_donor candidate) deserve a footnote. seqfold returns these when the dynamic-programming traceback finds zero foldable substructures; treat as “ΔG > +5 kcal/mol” — fully unstructured. Not a numerical artefact.
Limitations
seqfoldis not ViennaRNA. For 18-22 nt oligos at 37 °C the agreement is tight (Watters et al. tabulate <0.5 kcal/mol RMSE on the standard benchmark). For longer oligos or unusual GC content, switch toRNAfold -pfor partition-function-aware ΔG.- Self-fold and target-fold are computed in isolation. The relevant biology is heteroduplex ASO:target binding ΔG, which this run does not score. Phase 1b will compute heteroduplex ΔG via NUPACK or a simple Watson-Crick + Turner-tabulated stacking sum.
- Pre-mRNA secondary structure is computed on the 18-22 nt splice-site flank only, not the full ~100 nt context. Real spliceosomal accessibility depends on the surrounding intron/exon architecture; a future pass should use
RNAplfold(sliding-window accessibility) on the full Phase 1 design windows. - No mismatch tolerance check: the fold gate only sees self-fold of the ASO. Off-target cross-hybridization is not addressed and is a separate Phase 2 problem.
Files
- Driver: ad-hoc one-shot Python (in conversation log 2026-04-22) — not yet a registered
models/script. If repeated, lift into~/STRC/models/aso_phase1_fold_check.py. - Output:
/Users/egorlyfar/STRC/models/aso_phase1_fold_check.json(54 candidates × {ASO ΔG, target ΔG, verdict})
Ranking delta
- STRC ASO Exon Skipping: no tier change (stays B-tier). Evidence depth +1: the 54-candidate Phase 1 design clears the secondary-structure smoke gate cleanly — zero candidates killed. This is a non-discriminating gate at this oligo length, but it removes “fold-blocked” as a kill mode for the entire shortlist. Maternal-allele application requirement and ranking-position remain unchanged; the discriminating tests (heteroduplex ΔG, transcriptome cross-hybridization, RBM24 motif geometry) are Phase 1b/2 and have not been run yet. Next step in STRC Hypothesis Ranking unchanged (“maternal-only; ViennaRNA fold check”) because the canonical fold check (RNAplfold sliding-window accessibility on 100-nt context) is still pending — this proof closes only the local self-fold sub-gate.
- All other S/A/B/C tier hypotheses: no change.
Connections
[part-of]STRC ASO Exon Skipping- STRC ASO Phase1 Splice-Switch Design
[see-also]STRC RBM24 Exon Mapping to Human Protein[see-also]STRC RBM24 Exon Splicing Quantification[see-also]STRC Hypothesis Ranking[applies]Misha — maternal allele E1659A; ASO branch is restricted to maternal exon-skipping rescue