Who we are

Contacts

1815 W 14th St, Houston, TX 77008

281-817-6190

Data Engineering Leadership Life Sciences

Unleashing Potential: High-Performing Cloud Data Engineering Teams

Introduction The ability to efficiently process, store, and analyze vast amounts of data in real-time is not just a competitive advantage but a necessity for survival and growth. This transformative potential, however, hinges on the capabilities of the teams at the helm, tasked with navigating the intricacies of cloud data systems to unlock valuable insights […]

Leadership Life Sciences

The Art of Collaboration in Distributed Teams

Introduction In today’s rapidly evolving technological landscape, cloud data engineering projects are at the forefront of innovation, driving businesses towards unprecedented levels of efficiency, scalability, and data-driven decision-making. Central to the success of these projects is the concept of distributed teams – groups of individuals who work together from various geographical locations, leveraging the power […]

AWS Data Engineering Data Science Python

Turbocharge your functional tests with LocalStack for AWS

A deep dive into functional testing for AWS development Introduction In our exploration of advanced testing techniques for AWS development, we’ve delved into powerful tools like moto for unit testing and pytest.mark.parametrize for enhancing test coverage and efficiency. Building on this foundation, we turn our focus to a pivotal tool that bridges the gap between […]

AWS Python

Advanced Unit Testing in AWS

Leveraging Moto and Pytest Introduction In the world of AWS development, ensuring the reliability, efficiency, and correctness of your cloud-based applications is paramount. As cloud solutions grow increasingly complex, so too does the challenge of effectively testing these systems. Traditional testing methods often fall short in the face of AWS’s vast and intricately interconnected services. […]

Agile Python Testing

Reliability by design: Implementing Test Driven Development Strategies in Python Data Engineering

Introduction In the rapidly evolving field of data engineering, maintaining high-quality, reliable, and efficient data pipelines is crucial for businesses to make informed decisions and stay competitive. One methodology that has been instrumental in achieving these objectives is Test-Driven Development (TDD). At its core, TDD involves a simple, yet powerful cycle: write a failing test […]

Bot Chatbot LLM Machine Learning

LLMs and chatbots: a brief update

Generally and historically, data engineering, analytics, and science efforts focused on progressing from data to knowledge/wisdom. The emergence of LLMs allows for the decomposition of wisdom/knowledge back down to data. This can enable novel discovery, integrate with information systems, and drive automated processes. GenAI Categories Generation: Use bedrock models to create code, text, or images […]

Agile Leadership Project Management

Agile ‘thin slice’ technique: Explained

Introduction In today’s fast-paced development environment, the Agile methodology stands out for its emphasis on delivering functional features to users as early as possible. This approach challenges traditional, lengthy development cycles by advocating for the incremental release of a product’s most essential functionalities. By prioritizing early delivery, Agile aims to provide immediate value to users, […]

Data

Data Modeling for Developers

An Introduction to Data Modeling and Why it Matters for Development Teams Data modeling is a critical yet often underrated skill in technology development and within development teams. This article aims to teach you the basics of it and why it is important. This article will introduce you to the concept of a data model, […]

AWS

Mounting EFS Volume to Batch Jobs in AWS

Introduction In the realm of distributed computing and batch processing, operational challenges frequently arise that necessitate innovative solutions. A particular challenge we encountered involved a scenario where multiple jobs within our AWS environment were generating tens of thousands of files and storing them in an Amazon S3 bucket. Subsequently, a specific job was tasked with […]