Data Engineering Consulting

Turn raw data into
reliable infrastructure.

I design and build data pipelines, cloud data platforms, and analytics engineering solutions that give teams the data they need — on time, at scale, and without the toil.

AWS Certified dbt Apache Spark Snowflake Apache Airflow

What I build

End-to-end data engineering — from raw ingestion to analytics-ready models.

Data Pipeline Development

Batch and streaming pipelines built with Airflow, dbt, Spark, and Kafka. Idempotent, observable, and easy for your team to maintain.

Cloud Data Platform Design

Architecture and implementation on AWS (Glue, Redshift, Lake Formation, Athena), Snowflake, and BigQuery — right-sized for your workload and budget.

Data Warehouse & Modeling

Dimensional models, data vault, and analytics-layer transformations with dbt. Clean, documented, tested models your analysts can trust.

ETL / ELT Migration

Lift legacy SSIS, Informatica, or custom scripts to modern ELT patterns. Less fragility, faster iteration, lower ops overhead.

📈

Analytics Engineering

Semantic layer design, metric definitions, and self-serve analytics foundations. Bridges the gap between raw data and BI tools your stakeholders actually use.

🔒

Data Governance & Quality

Automated data quality tests, lineage documentation, access control patterns, and HIPAA/compliance-ready data architectures.

Tech stack

Tools I use daily — not checkbox certifications.

Cloud & Infra

  • AWS (Glue, Redshift, S3, Lambda, Athena, Lake Formation, EMR)
  • Azure (Data Factory, Synapse, Databricks)
  • Terraform / CloudFormation

Orchestration

  • Apache Airflow (MWAA, Astronomer)
  • Prefect
  • AWS Step Functions
  • dbt Cloud + Core

Processing

  • Apache Spark (PySpark)
  • Apache Kafka + Kinesis
  • Pandas / Polars
  • AWS Glue + EMR

Storage & Warehousing

  • Snowflake
  • Amazon Redshift
  • Google BigQuery
  • Delta Lake / Apache Iceberg

Languages

  • Python
  • SQL
  • TypeScript / Node.js
  • Bash / Shell

BI & Visualization

  • Power BI
  • Tableau
  • Looker / LookML
  • Apache Superset

About

I'm Scott Merklinger, an independent data engineering consultant with experience designing and delivering data infrastructure across healthcare, enterprise SaaS, and cloud-native startups.

I work with engineering teams and data leaders who need to move fast — building pipelines that are correct, observable, and maintainable by the people who inherit them. No bloat, no over-engineered abstraction layers.

When I'm not building pipelines I'm thinking about AI-augmented data workflows, agentic automation, and the next generation of data tooling.

Work with me
10+ Years experience
AWS Certified
50+ Projects delivered
TB of data processed

Start a conversation

Tell me about your data challenge. I'll respond within one business day.

Or email directly: merkasoft@comcast.net