ChapterDatasets UsedPurpose / Machine Learning Techniques
1. Introduction to ML in UtilitiesEIA-930 (regional demand), Synthetic SCADA snapshotsSimple regression and visualization of utility data trends
2. Utility Data FoundationsSynthetic SCADA, AMI smart meter data, GIS feeder layoutsData cleaning, resampling, joining telemetry and asset layers
3. ML Fundamentals for Grid ApplicationsSynthetic load curves (AMI), SCADA readingsRegression (load prediction), classification (asset health labels), clustering (AMI load profiles)
4. Load Forecasting and Demand AnalyticsEIA-930 demand data, NOAA weather dataTime series forecasting (SARIMA), regression (weather vs. load)
5. Predictive Maintenance for Grid AssetsSCADA telemetry (temperature, current), Synthetic EAM recordsClassification (failure prediction), anomaly detection (early degradation signals)
6. Outage Prediction and Reliability AnalyticsNOAA storm data, PUC outage reports, GIS feeder mapsClassification (storm outage risk), geospatial joins for feeder exposure
7. Grid Operations OptimizationSCADA voltage/reactive power data (synthetic IEEE feeders)Reinforcement learning (voltage control), optimization for VAr dispatch
8. Renewable Integration and DER ForecastingNREL NSRDB solar irradiance, PV generation profilesRegression (PV power modeling), time series forecasting (PV output)
9. Customer Analytics and Demand ResponseAMI smart meter data, synthetic DR event logsClustering (customer segments), classification (participation prediction)
10. Computer Vision for InspectionsSynthetic drone imagery of lines/substations, NDVI vegetation mapsCNN-based defect detection, segmentation (vegetation encroachment)
11. NLP for Maintenance and ComplianceSynthetic inspection logs, NERC CIP textNLP: text classification (routine vs. failure), entity extraction (assets, failure modes)
12. MLOps for UtilitiesPredictive maintenance model outputs (Chapter 5)Model versioning, automated retraining workflows (MLflow)
13. Cybersecurity AnalyticsCICIDS2017 network traffic dataset, synthetic SCADA logsAnomaly detection (unsupervised), intrusion classification (supervised ML)
14. Integrated Analytics PipelinesSCADA, outage risk models (Ch. 6), maintenance scores (Ch. 5)Orchestration (Prefect pipelines) combining outputs for operational dashboards
15. AI Ethics and GovernancePredictive maintenance risk outputs (urban vs. rural segmentation)Fairness audits (performance parity), explainability (SHAP analysis)
16. Workflow OrchestrationCombined datasets (load, outage, maintenance)Automated scheduling of multi-model analytics
17. Large Language Models and Multimodal AIMaintenance logs (NLP) + SCADA telemetry + drone imageryMultimodal AI combining text, structured data, and image insights
18. AI Roadmap for UtilitiesAggregate datasets from prior chaptersScenario modeling of AI maturity impacts (cost savings, SAIDI/SAIFI improvement)
19. Enterprise Integration (SCADA/GIS/EAM)SCADA telemetry, GIS asset layers, EAM maintenance recordsUnified data pipelines linking IT/OT for analytics consumption
20. AI Platform DeploymentPredictive maintenance and outage prediction modelsReal-time model deployment (API endpoints, streaming inference)