Data Engineering Consulting

Turn raw data into
reliable infrastructure.

I design and build data pipelines, cloud data platforms, and analytics engineering solutions that give teams the data they need — on time, at scale, and without the toil.

Start a project See what I do

AWS Certified dbt Apache Spark Snowflake Apache Airflow

What I build

End-to-end data engineering — from raw ingestion to analytics-ready models.

▶

Data Pipeline Development

Batch and streaming pipelines built with Airflow, dbt, Spark, and Kafka. Idempotent, observable, and easy for your team to maintain.

☁

Cloud Data Platform Design

Architecture and implementation on AWS (Glue, Redshift, Lake Formation, Athena), Snowflake, and BigQuery — right-sized for your workload and budget.

☷

Data Warehouse & Modeling

Dimensional models, data vault, and analytics-layer transformations with dbt. Clean, documented, tested models your analysts can trust.

⇋

ETL / ELT Migration

Lift legacy SSIS, Informatica, or custom scripts to modern ELT patterns. Less fragility, faster iteration, lower ops overhead.

📈

Analytics Engineering

Semantic layer design, metric definitions, and self-serve analytics foundations. Bridges the gap between raw data and BI tools your stakeholders actually use.

🔒

Data Governance & Quality

Automated data quality tests, lineage documentation, access control patterns, and HIPAA/compliance-ready data architectures.

Tech stack

Tools I use daily — not checkbox certifications.

Cloud & Infra

AWS (Glue, Redshift, S3, Lambda, Athena, Lake Formation, EMR)
Azure (Data Factory, Synapse, Databricks)
Terraform / CloudFormation

Orchestration

Apache Airflow (MWAA, Astronomer)
Prefect
AWS Step Functions
dbt Cloud + Core

Processing

Apache Spark (PySpark)
Apache Kafka + Kinesis
Pandas / Polars
AWS Glue + EMR

Storage & Warehousing

Snowflake
Amazon Redshift
Google BigQuery
Delta Lake / Apache Iceberg

Languages

Python
SQL
TypeScript / Node.js
Bash / Shell

BI & Visualization

Power BI
Tableau
Looker / LookML
Apache Superset

About

I'm Scott Merklinger, an independent data engineering consultant with experience designing and delivering data infrastructure across healthcare, enterprise SaaS, and cloud-native startups.

I work with engineering teams and data leaders who need to move fast — building pipelines that are correct, observable, and maintainable by the people who inherit them. No bloat, no over-engineered abstraction layers.

When I'm not building pipelines I'm thinking about AI-augmented data workflows, agentic automation, and the next generation of data tooling.

Work with me

10+ Years experience

AWS Certified

50+ Projects delivered

∞ TB of data processed

Turn raw data intoreliable infrastructure.