CHIA Grant Architecture Walkthrough

Seattle Children's Hospital โ€” Pediatric Epilepsy Research Platform

re:Invent 2025 Enhanced
UC1: Data Lake
UC2: AI Neuroimaging
Combined
Security & Governance
Cost Summary
UC1: Pediatric Epilepsy Data Lake Architecture
Unified health data platform integrating EHR, imaging, genomics, and wearable data for epilepsy research
Existing AWS Service
re:Invent 2025 โ€” NEW
Data Sources
๐Ÿ’—
Epic FHIR
EHR Records
๐Ÿ“ก
PACS / DICOM
MRI, EEG Images
โฑ๏ธ
Wearables
IoT Seizure Data
๐Ÿงฌ
Genomics
VCF/CRAM Files
Ingestion
โค๏ธ
HealthLake
FHIR Data Store
๐Ÿง 
HealthImaging
Medical Images
๐Ÿงฌ
HealthOmics
Genomic Workflows
NEW
โšก
Lambda Durable
Long-running ETL
Storage
๐Ÿ—„๏ธ
Amazon S3
Data Lake (HIPAA)
NEW
๐Ÿ“
S3 Vectors
Embeddings Store
๐Ÿ›ก๏ธ
Lake Formation
Governance
Processing
๐Ÿ”€
AWS Glue
ETL & Catalog
๐Ÿง 
SageMaker
HyperPod Cluster
NEW
๐Ÿ’พ
Checkpointless
Training Resilience
Analytics & AI
๐Ÿ”
Athena
SQL Analytics
โœจ
Bedrock
Foundation Models
NEW
โœจ
Nova Forge
Custom Model Fine-tune
NEW
๐Ÿค–
AgentCore
Autonomous Agents
Consumption
๐Ÿ“Š
QuickSight
Dashboards
๐ŸŒ
API Gateway
Research APIs
โ–ฆ
Research Portal
Clinician UI
UC2: AI-Enhanced Neuroimaging Pipeline
Super-resolution MRI enhancement model trained on multi-institutional data, deployed as custom Bedrock model
Existing Service
re:Invent 2025 โ€” NEW
Research Environment
๐Ÿ“
Public Datasets
HCP, fastMRI, BraTS
๐Ÿข
UPMC Collaboration
Paired MRI Dataset
โš™๏ธ
Preprocessing
DICOM โ†’ NIfTI, QC
๐Ÿ—„๏ธ
S3 Research Bucket
Curated Training Data
Training Environment
๐Ÿ’ป
HyperPod Cluster
p4d.24xlarge ร— 4
NEW
๐Ÿ’พ
Checkpointless Training
-10% training cost
NEW
๐Ÿ’ป
Trainium3
Next-gen ML silicon
๐Ÿ“ˆ
W&B / MLflow
Experiment Tracking
Production (Inference)
โœ…
Model Evaluation
PSNR, SSIM, Clinical
โœจ
Custom Model Import
Bedrock Inference
๐Ÿ–ฅ๏ธ
Inference Endpoint
Real-time Enhancement
๐Ÿฉบ
Clinical Integration
PACS Viewer Plugin
๐Ÿ”€ Data Flow: UC1 โ†” UC2 Integration
UC1: Data Lake
Neuroimaging + EHR
โ†’
S3 Staging
MRI DICOM Export
โ†’
UC2: Model Training
Super-Resolution
โ†’
Enhanced Images
4ร— Resolution
โ†’
UC1: Data Lake
Research Corpus

The Data Lake (UC1) provides neuroimaging data to train the AI model (UC2). Enhanced images flow back into the lake, enriching the research corpus and enabling clinician access via the Research Portal.

๐Ÿ”— Shared Infrastructure Services
๐Ÿ—„๏ธ
Amazon S3
Data Lake
๐Ÿ”‘
AWS KMS
Encryption
๐Ÿ“œ
CloudTrail
Audit
๐Ÿ›ก๏ธ
Lake Formation
Governance
๐Ÿ“Š
Security Hub
Compliance
๐ŸŒ
VPC + PrivateLink
Networking
๐Ÿ‘ค
IAM + SSO
Identity
๐Ÿ“Š
CloudWatch
Monitoring
โšก re:Invent 2025 Enhancements (Shared)
NEW
๐Ÿ“
S3 Vectors
Replaces OpenSearch โ€” $500/mo savings, simpler architecture
NEW
๐Ÿ’พ
Checkpointless Training
10% compute savings โ€” resilience without checkpoint overhead
NEW
๐Ÿ’ป
Trainium3
40% inference cost reduction โ€” purpose-built for ML
NEW
โšก
Lambda Durable + AgentCore + Nova Forge
ETL orchestration, autonomous agents, custom fine-tuning
Security & Governance โ€” HIPAA-First Architecture
Defense-in-depth controls enabled from Day 1 of the POC. All services are HIPAA-eligible with BAA coverage.
๐Ÿ” Encryption & Access Control
๐Ÿ”‘
AWS KMS
Customer-managed CMKs, automatic key rotation every 365 days
๐Ÿ›ก๏ธ
Lake Formation
Tag-based access: per-site, per-study, column-level PHI masking
๐Ÿ‘ค
IAM + Identity Center
Least-privilege, federated SSO with SCH AD, no long-lived creds
๐ŸŒ
VPC + PrivateLink
All PHI traffic on private network, no internet-facing endpoints
๐Ÿ” Detection & Monitoring
๐Ÿ”Ž
Amazon Macie
Continuous S3 scanning for PHI leakage, alerts on unencrypted PII
โš ๏ธ
Amazon GuardDuty
Threat detection: anomalous API calls, credential compromise, exfiltration
๐Ÿ“œ
AWS CloudTrail
Full API audit trail, 7-year retention for compliance
๐Ÿ“Š
AWS Security Hub
HIPAA conformance pack, continuous compliance scoring
๐Ÿค– AI Governance
๐Ÿค–
AgentCore Policy (Cedar)
Deterministic enforcement: default-deny, forbid-wins-over-permit, automated reasoning
๐Ÿ›ก๏ธ
Bedrock Guardrails
Content filtering, PII redaction, topic denial โ€” applied regardless of model
Policy Authoring
Cedar policies define what AI agents can access. Automated reasoning validates policies are well-formed before deployment.
Runtime Enforcement
Default-deny, forbid-wins-over-permit. Every agent action evaluated against governance policies in milliseconds. Deterministic โ€” not probabilistic.
Audit & Compliance
Full decision log for every agent action โ€” approved or denied. Maps directly to AI Review Board governance requirements.
๐Ÿ’ก For the AI Review Board: Cedar policies are the technical enforcement layer for the governance rules your board defines. "No AI agent may access genomic data without IRB approval tag" becomes a Cedar policy that is enforced deterministically at the gateway.
๐Ÿ“… Security Deployment Timeline
Week 1-2
Security baseline: KMS, VPC, IAM roles, CloudTrail, Security Hub HIPAA pack, GuardDuty
Week 3-4
Data governance: Lake Formation tags, Macie scanning, de-identification pipeline validation
Week 5-8
AI governance: AgentCore policies, Bedrock Guardrails, model access controls
Week 9-12
Penetration testing, compliance audit, AI Review Board sign-off, production readiness
UC1 โ€” Year 1 Estimate
$58,000
Covered by CHIA credits
UC2 โ€” Year 1 Estimate
$66,000
Covered by CHIA credits
Combined Total
$124,000
~$26K headroom on $150K award
CHIA Award Coverage
$150,000
AWS credits โ€” full coverage โœ“

Year 1 Cost Breakdown

UC1 + UC2 Combined = ~$124K (CHIA Award: $150K)

UC1: Data Lake
$58K
UC2: Neuroimaging
$66K
Combined
$124K
CHIA Award $150K
Storage Compute Platform AI/ML

re:Invent 2025 Cost Savings

Waterfall: Original โ†’ Optimized

$140K
Pre-Optimization
-$6K
S3 Vectors
-$4.2K
Checkpointless
-$5.8K
Trainium3
$124K
Post-Optimization

re:Invent 2025 Cost Savings

S3 Vectors
Replaces OpenSearch Serverless
-$500/mo
Checkpointless Training
Eliminates checkpoint I/O
-10% compute
Trainium3 Inference
vs GPU-based inference
-40% inference

Post Year 1: Steady-State Operations

Monthly Operational
$8-10K
After POC optimization
Annual Steady-State
~$100K
With Savings Plans
Comparison
<1 FTE
Less than one research coordinator salary
Year 2+ cost drivers: Storage grows ~20%/year as cohort expands. Compute stays flat (inference only, no retraining). Savings Plans (1-year commit) reduce compute 30-40%. Right-sizing after POC usage patterns are established typically saves an additional 15-20%.
๐Ÿ’ก Budget context: $100K/year supports a platform serving 6,000+ epilepsy patients across 30+ PERC sites โ€” less than the cost of one FTE research coordinator doing manual chart abstraction.

Cost of Inaction โ€” What Manual Alternatives Cost

Manual Chart Abstraction (UC1)
$450K+/yr
3 FTE research coordinators ร— $85K salary + benefits
6,000 patients ร— 4 modalities = 24,000 records to abstract
Timeline: 18-24 months vs. 12 weeks with automation
Commercial 7T Scanning (UC2)
$1.5M+
500 research scans ร— $3,000/scan = $1.5M
Plus: scheduling delays (2-4 weeks), travel to 7T site
Not validated for pediatric routine use
โœ…
AWS Platform: $124K Year 1 (covered by CHIA credits)
Automated NLP + AI imaging enhancement โ€” 10ร— faster, 10ร— cheaper than manual alternatives

Next Steps

1
Confirm genomic data format
WGS vs WES vs targeted panel โ€” biggest cost variable for storage planning
2
Align on de-identification approach
Safe Harbor vs Expert Determination โ€” determines FHIR export pipeline design (needs IRB input)
3
Schedule technical kickoff โ€” Week of May 19
90-minute session with research informatics + engineering. 12-week clock starts โ†’ POC results by mid-June for CHIA submission (July 1 deadline).