Open Source

Practical tools and reference implementations.

Open Source Data Engineering & Azure Modernization Projects

I maintain a set of public open-source projects focused on data engineering, financial data access, Azure modernization, SQL Server performance, and production observability. These repositories are built as practical tools and reference implementations, with tests, documentation, CI, and clear legal/usage boundaries.

Canada Valet CLI

Bank of Canada Valet API Tool

An unofficial open-source Python CLI for the public Bank of Canada Valet API. It lets users fetch, cache, validate, search, and export public financial data from the terminal. Supports CSV, JSON, Parquet, local caching, retry/backoff, metadata search, validation checks, and a doctor command.

Unofficial project. Not affiliated with, endorsed by, or sponsored by the Bank of Canada. Data source: Bank of Canada Valet API. Does not provide financial, trading, investing, tax, or legal advice.
PythonTyperhttpxPydanticRichPandasOpen DataCLI

Azure EDM Reference Data Platform

Reference Architecture Simulator

An EDM-style reference data platform simulator for Azure. Demonstrates ingestion, staging, validation, survivorship, golden source, audit, lineage, monitoring, and downstream distribution using Python, SQL Server/Azure SQL, Azure Data Factory sample pipelines, Bicep templates, Docker Compose, pytest, and ruff.

EDM-style simulator. Not Markit EDM. Does not use proprietary vendor code, SDKs, schemas, or real financial data.
AzureSQL ServerADFBicepPythonData QualityLineageReference Data

SQL EDM Performance Tuning Toolkit

Database Optimization Utilities

A SQL Server performance tuning toolkit for EDM-style batch data platforms. Helps analyze technical debt, slow procedures, missing indexes, blocking, wait stats, Query Store, tempdb pressure, batch SLA breaches, and Azure migration readiness. Includes bad vs tuned SQL examples, diagnostic scripts, synthetic data generation, benchmark reports, and a static SQL technical debt scanner.

Generic open-source SQL tuning toolkit. Uses synthetic data only. Does not include proprietary EDM internals or real banking data.
SQL ServerPerformance TuningQuery StoreAzure MigrationPythonBatch Processing

EDM Observability Azure Monitor Kit

Telemetry & Alerting Patterns

An observability starter kit for EDM-style Azure data platforms. Shows how to monitor batch jobs, Azure Data Factory pipelines, Python integrations, SQL jobs, missing files, data quality exceptions, SLA breaches, and downstream distribution failures. Includes structured JSON logging, correlation IDs, W3C traceparent, KQL queries, SQL monitoring tables, Bicep alert templates, dashboard notes, runbooks, and Dynatrace coexistence guidance.

Open-source observability pattern kit. Uses fake examples only. No real production telemetry or confidential data.
Azure MonitorApplication InsightsKQLObservabilityPythonSQLDynatraceRunbooks

These projects are designed as reusable open-source tools and reference implementations for developers working with data platforms, financial data, Azure modernization, and production operations.

Ready to optimize your data platforms?

Partner with us to reduce technical debt and build scalable architectures.

Get Started Today