Data Engineering Archives

_ January 27, 2025_ Chris King

The Art of Data Engineering: Applying Sun Tzu’s Principles

How Ancient Wisdom Can Transform Modern Data Practices Sun Tzu’s “The Art of War” offers timeless wisdom that transcends the battlefield, providing insights applicable to various domains, including data engineering. In this field, success is not merely about coming up with innovative ideas or implementing initial solutions. True success is measured by how effectively these […]

LEARN MORE

Data Engineering Databricks

_ September 5, 2024_ Chris King

Databricks First Look: August 2024 Release, Deep Dive into Lakehouse Federation

Exploring the Power of Seamless Data Integration and Enhanced Security with Databricks Introduction In the fast-evolving landscape of data analytics, staying updated with the latest platform enhancements is crucial. The August 2024 release from Databricks brings a suite of impactful updates designed to boost security, compliance, and performance. Among these, Lakehouse Federation stands out, offering […]

LEARN MORE

Data Engineering Project Management

_ June 6, 2024_ Chris King

Risk Management in Cloud Data Projects: Strategies for Success

Introduction to Risk Management in Cloud Data Projects Risk management is a critical component of any cloud data project. As organizations increasingly rely on cloud technologies to store, process, and analyze data, understanding the unique risks associated with these projects becomes essential. Cloud data projects involve various stakeholders and technologies, which introduce complexities in data […]

LEARN MORE

Data Engineering Project Management

_ May 30, 2024_ Chris King

Effective Project Planning for Cloud Data Engineering

Introduction to Project Planning in Cloud Data Engineering Project planning is a critical component of successful data engineering projects, especially when these projects are executed in the cloud. The unique characteristics of cloud computing, such as scalability, on-demand resources, and geographic distribution, offer both opportunities and challenges that must be carefully managed through effective planning. […]

LEARN MORE

AWS Data Engineering

_ April 11, 2024_ Chris King

Accurate by Design: Advanced Data Quality on AWS

Introduction Data pipelines in AWS orchestrate the movement and transformation of data across various AWS services. The core objective of these pipelines is to enable efficient data processing, analysis, and storage, ensuring that data is available where and when it is needed. Maintaining high data quality throughout this process is critical; it ensures reliability, accuracy, […]

LEARN MORE

Data Engineering Leadership Life Sciences

_ March 28, 2024_ Chris King

Unleashing Potential: High-Performing Cloud Data Engineering Teams

Introduction The ability to efficiently process, store, and analyze vast amounts of data in real-time is not just a competitive advantage but a necessity for survival and growth. This transformative potential, however, hinges on the capabilities of the teams at the helm, tasked with navigating the intricacies of cloud data systems to unlock valuable insights […]

LEARN MORE

AWS Data Engineering Data Science Python

_ March 7, 2024_ Chris King

Turbocharge your functional tests with LocalStack for AWS

A deep dive into functional testing for AWS development Introduction In our exploration of advanced testing techniques for AWS development, we’ve delved into powerful tools like moto for unit testing and pytest.mark.parametrize for enhancing test coverage and efficiency. Building on this foundation, we turn our focus to a pivotal tool that bridges the gap between […]

LEARN MORE

Data Data Engineering

_ December 12, 2023_ Chris King

Data Engineering Methodology: From requirements to hand-off

Introduction Joining or starting data projects in large enterprise environments with many stakeholders can be stressful, not to mention a technical implementation nightmare. When the primary stakeholders can’t (or won’t) give the project team clear requirements, the onus falls to the technical implementation team to create order from the chaos and organize the delivery team […]

LEARN MORE

Data Engineering

_ November 28, 2023_ Scott Peters

The Data Journey

Many organizations share similar challenges with growing their operational capabilities with data. I have given several talks on data lake design and avoiding the “swampiness” of your data lake, invariably there are various pockets of mess or a “junk drawer” where people hide little bits of critical information. A complex data environment with myriad source […]

LEARN MORE

Data Engineering

_ October 17, 2023_ Taylor Feinstein

Running DBT on Databricks while using dbt_external_tables package to utilize Snowflake Tables

This article highlights a specific use case where one might need to run dbt on Databricks while utilizing tables in Snowflake. Typically, dbt runs on top of the database where it is instantiated. However, if a table needed to run dbt in Databricks does not exist in the hive-metastore and instead exists in an external […]

LEARN MORE

Who we are

Contacts

Visit our Medium Blog

The Art of Data Engineering: Applying Sun Tzu’s Principles

Databricks First Look: August 2024 Release, Deep Dive into Lakehouse Federation

Accurate by Design: Advanced Data Quality on AWS

Turbocharge your functional tests with LocalStack for AWS

Data Engineering Methodology: From requirements to hand-off

The Data Journey

Running DBT on Databricks while using dbt_external_tables package to utilize Snowflake Tables

Services

AWS Partner

Company