Skip to content
Reliable Data Engineering

About

I am Amin Siddique, a data engineer focused on building reliable pipelines, migrating legacy systems, and making sense of the rapidly evolving AI-meets-data landscape.

I have spent years working with production data systems across industries — dealing with SQL migrations between dialects (Oracle, Exasol, Databricks), building dbt models at scale, optimizing Spark jobs that process billions of rows, and designing architectures that survive real-world load.

Why This Blog Exists

Most data engineering content falls into two categories: vendor marketing or academic theory. Neither helps when your pipeline fails at 2 AM or when you need to migrate 500 stored procedures to a new platform.

Reliable Data Engineering fills that gap. Every article is grounded in hands-on experience with production systems. I write about what actually works, what breaks, and what I wish someone had told me before I learned the hard way.

The data engineering landscape is shifting fast. AI agents are starting to write SQL, manage pipelines, and automate the grunt work that used to define the job. I started this blog to document that transition honestly — not with hype, but with benchmarks, code, and real-world results.

What You Will Find Here

My Background

I have worked across the full data stack — from writing ETL jobs in Python and Spark to designing warehouse schemas and managing production Databricks environments. My day-to-day involves building and maintaining data platforms that serve downstream analytics, machine learning models, and business-critical reporting.

Before data engineering, I spent time in software development, which gives me a strong opinion about code quality in data work. I believe data pipelines deserve the same engineering rigor as application code: version control, testing, code review, and CI/CD are not optional.

My Approach

I do not write sponsored content. When I recommend a tool, it is because I have used it in production. When I criticize something, I explain why with specifics. Every article includes limitations and honest assessments, not just the highlight reel.

I test everything I write about. If an article includes a benchmark, I ran it. If it includes a code snippet, I executed it. If it covers a tool, I installed it and used it on real data before forming an opinion.

Some articles on this site contain affiliate links to books I genuinely recommend. These are clearly disclosed and do not influence what I write. The small commission helps keep the site running without ads cluttering the reading experience.

Connect

Have questions or feedback? Visit the Contact page.