STRC h01 Phase 5 MD Ensemble Rescoring 2026-04-23

TL;DR. Phase 5a OpenMM MD pipeline validated on local Mac (62 ns/day OpenCL Metal on 164 k-atom Ultra-Mini × TMEM145 chain A; 2 ns production delivered 20 snapshots). Phase 5b ensemble re-docking of Phase 4b top-5 leads + diflunisal positive control against each snapshot → all f_PC estimates at [L]=10 μM are BELOW 0.10, far below NORMAL-rescue threshold of 0.50. Best binder (diflunisal positive control) at f_PC = 0.083; best lead (naphthalene-2-COOH) at f_PC = 0.047. Phase 4b single-structure rankings were over-optimistic by ~0.5-1 kcal/mol: ensemble MD-relaxed receptor reveals the K1141 pocket is LESS favorable for carboxylate ligands than the static snapshot suggested. RED-LIGHT: current shortlist cannot deliver monotherapy NORMAL rescue for Misha via h01; need second-wave virtual screen (DrugBank FDA + ZINC22 carboxylate tranche + non-carboxylate scaffolds) OR de novo design. h01 tier A held; next-step reordered from Phase 5c HEK293 wet-lab to Phase 3c v2 expanded virtual screen. h03 S-tier primary path fully confirmed — remains the near-term Misha lever for MILD rescue (clinically meaningful 20-30 dB ABR improvement). Full Misha NORMAL cure requires longer-horizon h01 lead discovery extending project timeline by 1-2 years.

Why this replaces the deferred Phase 5 GROMACS scaffold

The inventory’s prior pharmacochaperone_phase5_md.py scaffold assumed GROMACS + AmberFF19SB + A100 rental + 50 ns × 3 replicates × 5 ligands in explicit-ligand solvated MD with MM-PBSA — weeks of setup, expensive hardware, untested on Mac.

Phase 5a + Phase 5b split keeps the information gain from ensemble sampling but drops the hardware dependency:

Axis	Deferred Phase 5	Phase 5a + 5b (delivered)
Software stack	GROMACS + AmberTools + MMPBSA.py + OpenFF conda	OpenMM (pip) + PDBFixer (pip) + existing Vina/obabel
Receptor MD	50 ns explicit ligand	2 ns apo-receptor (ensemble-averaged receptor conformations)
Scoring	MM-PBSA absolute ΔG	Vina ensemble mean — captures conformer variance, drops single-pose bias
Runtime	~24-48 h on A100	2 ns MD ~45 min + 6 leads × 20 snapshots Vina ~20 min on Mac
Precision	±1 kcal/mol absolute	±1 kcal/mol relative (bias-aware, order-of-magnitude Kd)
Cost	$100-500 cloud	local only

We lose absolute ΔG calibration quality but gain usability without conda + hardware dependency and avoid the Phase 4f implicit-solvent absolute-energy failure mode. This is Method B of the three possible ways to sample conformational noise; Method A (full MM-GBSA) and Method C (FEP+ / TI) remain on the table for later if Phase 5b signal warrants wet-lab follow-through.

Method

Phase 5a — apo MD ensemble

Target: Ultra-Mini × TMEM145 CIF, chain A only (STRC Ultra-Mini 1075-1775 residues).
Prep: PDBFixer (add missing atoms, protonate at pH 7.4).
Force field: AMBER14SB + TIP3P water; 0.15 M NaCl neutralisation.
Box: cubic, 1 nm pad → 164 k atoms solvated.
Protocol: 5 k-step minimization → 50 ps NVT @ 310 K Langevin → 50 ps NPT @ 1 bar Monte Carlo barostat → 2 ns NPT production → 20 snapshots every 100 ps.
Platform: OpenMM OpenCL (Apple Metal backend).
Performance: 62 ns/day measured on SMOKE; 2 ns production in ~45 min wall time.
Artifacts: models/artifacts/phase5a_snapshots/snap_000.pdb through snap_019.pdb.

Phase 5b — ensemble re-docking

Ligand set: Phase 4b top-5 leads (indole-3-acetic, naphthalene-2-COOH, cyclopropane-phenyl-COOH, salicylic, nicotinic) + diflunisal positive control.
Per-snapshot pipeline: strip water + ions → chain A protein PDB → obabel → PDBQT (Gasteiger charges).
Docking: Vina with Phase 4b box (centre [-22.027, -18.547, 2.215], 18×18×18 Å), exhaustiveness 16, 5 modes, 8 CPU.
Aggregation: per-ligand mean ± std ΔG across 20 snapshots → Kd = exp(ΔG/RT) μM → bound fraction at [L] = 1, 10 μM.
f_PC estimate: bound_fraction × 0.5 (conservative rescue efficiency η; Phase 6 would refine via mechanism QSAR).

Results

Phase 5a pipeline validation

Metric	Value
Solvated system	164 305 atoms (164 k)
SMOKE production (100 ps) wall time	2.3 min
Full production (2 ns) wall time	~45 min
Extrapolated ns/day	62
10 ns production ETA (if needed for finer ensemble)	3.9 h
Snapshots saved	20

Pipeline is production-ready on local Mac. No cloud compute required for h01 Phase 5b.

Phase 5b ensemble affinity and f_PC

Sorted by mean ensemble ΔG. “Phase 4b single” = single-structure Vina score from 2026-04-21; “Δ (kcal/mol)” = Phase 5b mean − Phase 4b single (positive = Phase 5b less favorable).

Ligand	mean ΔG (kcal/mol)	±std	Kd (μM)	bound @ 1 μM	f_PC @ 10 μM	Phase 4b single	Δ
diflunisal (positive control)	−5.865	0.564	50	0.020	0.083	−6.396	+0.53
naphthalene-2-COOH	−5.472	0.470	97	0.010	0.047	−5.980	+0.51
indole-3-acetic-acid	−5.141	0.446	170	0.006	0.028	−6.050	+0.91
cyclopropane-phenyl-COOH	−5.022	0.535	207	0.005	0.023	−5.716	+0.69
salicylic-acid	−4.665	0.230	379	0.003	0.013	−5.024	+0.36
nicotinic-acid	−4.098	0.280	987	0.001	0.005	−5.018	+0.92

Observations:

All Phase 5b numbers are less favorable than Phase 4b — Δ = +0.36 to +0.92 kcal/mol. This is the expected direction of ensemble correction: single-structure docking captures the best lucky pose; ensemble reveals the average over thermally-fluctuating receptor conformations, which is always less favorable than the best single-conformer fit.
Diflunisal (positive control) is the best binder — but its f_PC is only 0.083. The original top-5 leads are worse than the control on ensemble rescoring. This means the leads selected in Phase 3 virtual screen were structurally similar to diflunisal (carboxylate + aromatic) but did not improve on its affinity against the E1659A pocket.
Std ranges 0.23-0.56 kcal/mol — tighter than expected for 2 ns MD; suggests the K1141 pocket is relatively rigid (not a cryptic allosteric site). More MD won’t dramatically change ranks.
Kd 50-1000 μM for all leads — order-of-magnitude too weak for pharmacochaperone therapy. Target: Kd ≤ 10 μM for f_PC ≥ 0.50.

Misha cure threshold check

Per Misha Compound-Het Therapy Stack Model, f_PC required for NORMAL hearing (≤ 25 dB ABR) monotherapy by E1659A severity:

Mild E1659A (f_mat=0.40): f_PC ≥ 0.50
Moderate (f_mat=0.25): f_PC ≥ 0.60
Severe (f_mat=0.10): f_PC ≥ 0.67

None of the six ligands (5 leads + diflunisal) clears f_PC ≥ 0.50 in any severity scenario at therapeutic [L]=10 μM. Best available: diflunisal 0.083, which is ~6× too low for mild-E1659A NORMAL rescue.

Implication for Nobel-path ladder: Ступенька 1 (this Phase 5b compute) returned RED. Steps 2-3 (HEK293 in vitro + SPR/BLI Kd measurement) are not justified for the current shortlist — would just confirm insufficient binding. Real forward path requires backtracking to Phase 3c v2 (expanded virtual screen) to find stronger binders before advancing to wet-lab.

Nobel-path interpretation

This run is ступенька 1 of the 8-step ladder laid out in the preceding session’s Nobel-argument discussion. Its role is: narrow the Phase 4b ±2 kcal/mol noise band down to ±1 kcal/mol via ensemble averaging, and deliver a credible f_PC point estimate per lead. This estimate then feeds the Misha stack model’s NORMAL-cure threshold check.

Green-light signal (→ wet-lab step 2): at least one lead delivers f_PC ≥ 0.50 at [L]=10 μM, supporting NORMAL-hearing monotherapy for mild E1659A. Authorises HEK293 in vitro validation (Phase 5c): transfect E1659A STRC + GFP-actin, treat with lead compound at 10 μM, confocal image to check clustering restoration.

Yellow-light signal (→ second virtual screen): best lead delivers f_PC 0.25-0.50. Stack support exists (h01+h03 combined) but monotherapy insufficient. Authorises Phase 3c v2 virtual screen expansion (DrugBank FDA + ZINC22 carboxylate tranche) to find stronger binder.

Red-light signal (→ downgrade h01 Misha-fit, strengthen h03): all leads deliver f_PC < 0.25. h01 drug-development path becomes longer-horizon (needs de novo design); Misha’s near-term hope shifts primarily to h03 AAV + h09 hydrogel (MILD rescue; no NORMAL unless longer-term pharmacochaperone development pays off).

FINAL: RED-LIGHT. All leads deliver f_PC < 0.10 at therapeutic [L]=10 μM. Interpretation: the Phase 3b→4b shortlist was tuned against a single AF3 conformer of the K1141 pocket; ensemble MD-relaxed receptor reveals the binding pocket is less accommodating of small carboxylate ligands than static analysis suggested. The ligand set is also chemically monotonous (5/6 are small aromatic carboxylates in the 100-250 Da range), so the “best of six” plateau is probably not the true ceiling of what the pocket can accept — larger or chemically diverse scaffolds may bind more strongly.

Forward path (reordered):

Phase 3c v2 expanded virtual screen (next compute move): DrugBank FDA library (~2500 approved molecules) + ZINC22 carboxylate-enriched tranche (~50 k) + fragment-based screen at K1141 pocket on ensemble of 5 receptor conformers (sub-sample from Phase 5a snapshots). Filter by ensemble Vina ΔG ≤ −7.5 kcal/mol (target Kd ≤ 5 μM → f_PC ≥ 0.67 at 10 μM). If hits appear → re-run Phase 5b on new shortlist.
De novo pharmacophore-based design if v2 screen also returns weak: use STRC Pharmacophore Model K1141 Pocket to synthesise or commission virtual libraries targeting the specific pocket hot-spots (K1141-NZ, loop 1642-1651 H-bond acceptors).
Covalent strategies: K1141 has a nucleophilic amine; covalent inhibitor/chaperone class (e.g., acrylamide warhead + carboxylate recognition) may achieve Kd ~nM where reversible binders plateau at μM. New scaffold class for Phase 3c v3.
Path through h03 alone: remains fully viable for MILD rescue. h01 Nobel-path is not killed, just extended by 1-2 years of lead discovery.

Parameter provenance

All Phase 5a/5b parameters are lit-anchored or explicitly docstring-flagged:

AMBER14SB + TIP3P: Maier 2015, Jorgensen 1983. Standard.
Box padding 1 nm, 0.15 M NaCl: matches physiological perilymph [Na⁺] = 140-150 mM.
NVT 50 ps + NPT 50 ps: standard equilibration; 50 ps NVT may be short for large systems but our minimization was deep (5 k steps), reducing NVT demand.
Production 2 ns: SHORT compared to typical 10-50 ns; sufficient for sampling near-native conformational fluctuations of the K1141 pocket but may miss large loop rearrangements (e.g., loop 1642-1651). Extendable to 10 ns (~4 h) if Phase 5b std is too high to distinguish leads.
Vina exhaustiveness 16 per snapshot vs Phase 4b 32: reduced for ensemble speed; 16 × 20 snapshots = 320 “exhaustiveness-equivalents” vs 32 single, so total sampling is richer.
Rescue efficiency η = 0.5: conservative assumption. Tafamidis-TTR η in lit ≈ 0.7; ivacaftor-CFTR G551D η ≈ 0.5-0.6. Our 0.5 is cautious; Phase 6 QSAR refinement would tighten.

What this doesn’t resolve

Absolute Kd: Vina has ±1-2 kcal/mol systematic bias per ligand class. Our ensemble reduces random noise but not bias. Absolute Kd is wet-lab gated (SPR/BLI or ITC).
Rescue mechanism: binding ≠ function. Drug may bind the K1141 pocket without actually restoring the TMEM145 interface charge loss. Mechanism requires MD of full complex with ligand (Phase 5c) OR wet-lab HEK293 phenotype assay.
Off-target: Phase 4e off-target selectivity soft-failed. Ensemble rescoring doesn’t address this. Wet-lab panel against 5-10 human proteins with similar pockets is needed before clinical advance.
PKPD: Does the drug reach [L]=10 μM in cochlea at tolerable systemic dose? This is Phase 6 (oto-PK modelling — analogous to Phase 4e for hydrogel but for small molecule oral/IT route).

Ranking delta

#1 (h01) Pharmacochaperone E1659A: A held. Mech 3 held. Deliv 4 held. Misha-fit 4 held. (Mechanism still biology-plausible; patient still compound-het compatible; only the current ligand shortlist is insufficient — not the hypothesis itself.) Next step reordered from “Phase 5 MD ensemble MM-GBSA” (now DELIVERED with red-light) to “Phase 3c v2 expanded virtual screen” (DrugBank FDA + ZINC22 carboxylate tranche + fragment-based, filter at ensemble ΔG ≤ −7.5 kcal/mol). Phase 5a/5b pipeline remains active for future shortlist rescoring; no reinvention needed.
#3 (h03) Mini-STRC AAV: S held, scores unchanged. Position as primary Misha near-term lever further reinforced: h01 can’t deliver NORMAL cure short-term, so h03’s MILD rescue (20-30 dB ABR improvement, clinically significant per OTOF trial responder criterion) is the operative Misha benefit for the next 2-3 years while h01 lead discovery expands.
Meta — compound-het stack framework: still structurally correct. h01+h03 stack path to NORMAL rescue remains theoretically achievable but now requires h01 second-wave lead discovery. Misha stack model’s “low-burden stack NORMAL” conclusion at mild E1659A preserved conditional on future h01 leads.

The Vina ensemble re-docking confirmed what Phase 4c and Phase 4e soft-hinted: the current leads are not strong enough pharmacochaperones for clinical-grade rescue. This is a HONEST failure that narrows the project scope and points to a concrete next move (expanded virtual screen) without killing any hypothesis.

Connections

[supports] STRC Pharmacochaperone Virtual Screen E1659A — Phase 5 compute ladder
Misha Compound-Het Therapy Stack Model — f_PC parameter feed
[applies] STRC Pharmacochaperone Phase 4b Smoke Test — Phase 4b ligand shortlist
[applies] STRC Pharmacochaperone Phase 4c WT Decoy — selectivity context
[applies] STRC AlphaFold3 Computational Experiments — Ultra-Mini × TMEM145 receptor input
[see-also] STRC Pharmacochaperone Phase 4f Interface Rescue SMOKE — method-inadequate precursor (single-point GBn2); Phase 5a/5b replaces
[see-also] STRC Hypothesis Ranking
[part-of] STRC Hypothesis Ranking Log
[about] Misha

STRC Research

Explorer

STRC h01 Phase 5 MD Ensemble Rescoring 2026-04-23

STRC h01 Phase 5 MD Ensemble Rescoring 2026-04-23

Why this replaces the deferred Phase 5 GROMACS scaffold

Method

Phase 5a — apo MD ensemble

Phase 5b — ensemble re-docking

Results

Phase 5a pipeline validation

Phase 5b ensemble affinity and f_PC

Misha cure threshold check

Nobel-path interpretation

Parameter provenance

What this doesn’t resolve

Ranking delta

Connections

Graph View

Table of Contents

Backlinks