Data Engineering

Aug
12
Apache Iceberg vs Apache Parquet: Metadata Deep Dive

Apache Iceberg vs Apache Parquet: Metadata Deep Dive

Apache Iceberg is a table format for huge analytic tables. Iceberg brings the ability to treat your cloud storage data
7 min read
Jun
24
Tiny Thoughts: Cutting Back on Snowflake

Tiny Thoughts: Cutting Back on Snowflake

* Snowflake is a cloud data warehouse to store all your analytics data.  To anyone that uses Snowflake, you’re likely
1 min read
Jan
25
Understanding Access Control in Snowflake

Understanding Access Control in Snowflake

Snowflake is a pretty awesome data warehouse.  I recently implemented the access control policy for my organization.  Access control often
3 min read
Dec
04
Should you take the AWS Solutions Architect Exam?

Should you take the AWS Solutions Architect Exam?

TLDR: If you want to learn about cloud computing or AWS specifically, then yes.  Here are some lessons learned on
3 min read
Oct
02
Book Summary: Designing Data-Intensive Applications

Book Summary: Designing Data-Intensive Applications

TLDR: Data-Intensive applications should be reliable, scalable, and maintainable.  A big challenge is understanding the tradeoffs to pick the right
4 min read
Aug
07
Shell command line essentials for engineers

Shell command line essentials for engineers

TLDR: Learn a few keyboard shortcuts for BASH and get a crash course in the simplest of BASH scripting that
4 min read
Jul
03
The busy developer’s guide to Terraform

The busy developer’s guide to Terraform

TLDR:  Terraform is a tool to manage your infrastructure as code.  You define configuration files that describe your infrastructure.  Terraform
3 min read
Jun
04
The busy developer’s guide to Helm and Helm Charts

The busy developer’s guide to Helm and Helm Charts

TLDR: Helm is a package manager for Kubernetes (think what brew is for MacOs, pip for Python).  Helm Charts are
3 min read
May
01
The busy developer's guide to Apache NiFi

The busy developer's guide to Apache NiFi

TLDR: Nifi is an open-source tool used to create workflows for moving and transforming data. Using NiFi’s user interface
5 min read
Apr
04
The busy developer’s guide to Kubernetes

The busy developer’s guide to Kubernetes

TLDR:  Kubernetes is a container orchestration system.  You define K8’s API objects in a configuration file and then the
4 min read