Apache Iceberg vs Apache Parquet: Metadata Deep Dive
Apache Iceberg is a table format for huge analytic tables. Iceberg brings the ability to treat your cloud storage data
Tiny Thoughts: Cutting Back on Snowflake
* Snowflake is a cloud data warehouse to store all your analytics data. To anyone that uses Snowflake, you’re likely
Understanding Access Control in Snowflake
Snowflake is a pretty awesome data warehouse. I recently implemented the access control policy for my organization. Access control often
Should you take the AWS Solutions Architect Exam?
TLDR: If you want to learn about cloud computing or AWS specifically, then yes. Here are some lessons learned on
Book Summary: Designing Data-Intensive Applications
TLDR: Data-Intensive applications should be reliable, scalable, and maintainable. A big challenge is understanding the tradeoffs to pick the right
Shell command line essentials for engineers
TLDR: Learn a few keyboard shortcuts for BASH and get a crash course in the simplest of BASH scripting that
The busy developer’s guide to Terraform
TLDR: Terraform is a tool to manage your infrastructure as code. You define configuration files that describe your infrastructure. Terraform
The busy developer’s guide to Helm and Helm Charts
TLDR: Helm is a package manager for Kubernetes (think what brew is for MacOs, pip for Python). Helm Charts are
The busy developer's guide to Apache NiFi
TLDR: Nifi is an open-source tool used to create workflows for moving and transforming data. Using NiFi’s user interface
The busy developer’s guide to Kubernetes
TLDR: Kubernetes is a container orchestration system. You define K8’s API objects in a configuration file and then the