Wrangling Geospatial Data with GeoPandas and Apache Sedona on AWS

Introduction: This article will walk through two leading solutions for processing and analyzing geospatial data: GeoPandas for efficient in-memory processing on a single machine, and Apache Sedona, which is built to run on distributed compute clusters like Apache Spark, Flink, and Databricks. We will show coding examples of both and how to set up GeoPandas locally and Apache Sedona in an AWS […]
A Vision for Application Observability

A ChatGPT-genarated image of a man observing an application Contextualizing Your Application Logs and Metrics to Understand Your Applications Better There are many platforms for logging, like Splunk, Dynatrace, and Prometheus, among others. But what do we really need to understand if our application is performing as we need it to? This depends on where the application lives and runs, […]
Building Intelligent Data Query Systems with Amazon Bedrock Knowledge Base and Redshift

Introduction In today’s data-driven world, organizations are constantly seeking ways to make their data more accessible to non-technical users. Traditional SQL queries require specialized knowledge and can be intimidating for business users who need quick insights from their data. Amazon Bedrock Knowledge Base with Redshift integration offers a revolutionary solution that bridges this gap by enabling natural […]
Hard‑Mind, Soft‑Voice: Execute First, Feel After

Plan, Execute, Analyze, Improve (repeat) TL;DR Decide the frame up front: park emotions, run the plan, debrief after. Be firm inside, kind outside: strict mindset, respectful tone. Works across contexts: endurance events, negotiations, and breakups. The Principle Hard work collapses when emotion floods the system. The goal isn’t to suppress feelings forever; it’s to sequence them. During the event, disable […]
Building Enterprise Agents

A Practical Playbook for Design, Rollout, and Metrics TL;DR — Start with a small set of high-value workflows. Use serverless primitives to control cost, MCP connectors for system access, and a Human-in-the-Loop (HITL) gate for risky actions. Ship via IaC + CI/CD. Prove safety and UX in a pilot/UAT, then scale. Instrument everything: errors → Jira, approvals/rejections → […]
Stream CDC data with Amazon Redshift streaming, Amazon MSK and Debezium Connector

Episode 3: Redshift Serverless Streaming Ingestion Introduction In the previous episodes, I covered the overall architecture design for this project and Debezium connector configuration for our CDC streaming pipeline. Now I’ll complete the series by diving deep into Amazon Redshift Serverless streaming ingestion — the final piece that enables real-time analytics on your CDC data. This episode focuses on the practical […]
Case Study: Automating Google and Jira Admin Tasks with AWS AgentCore

Executive Summary Administrative tasks in Google Workspace and Jira have consumed hundreds of staff hours for our team each quarter. These tasks — ranging from user provisioning to project setup — were repetitive, error-prone, and often delayed due to backlogs. By building a Slack-first automation interface powered by AWS AgentCore primitives (Gateway, Memory, Identity), we achieved a 90% reduction in admin time for common […]
Culture is the Code

Organizational change, rather than technical performance, will drive successful adoption of Agentic AI. Enterprise software has always carried an implicit theory of organization. Mainframes embodied centralization. ERPs encoded bureaucratic workflows. Rule engines and decision trees promised certainty by reducing ambiguity to fixed logic. These systems were designed for a world that valued compliance, stability, and efficiency over adaptability. […]
Stream CDC data with Amazon Redshift streaming, Amazon MSK and Debezium Connector

Episode 2: Configuring Debezium Connector for Reliable CDC Introduction In the previous episode, I covered the overall architecture and infrastructure setup for our CDC streaming pipeline. Now I’ll dive deep into Debezium — the open-source platform that captures row-level database changes in real-time and streams them to Kafka topics through MSK Connect. This episode focuses on the […]
Stream CDC data with Amazon Redshift streaming, Amazon MSK and Debezium Connector

Episode 1: Designing the End-to-End CDC Architecture and IaC Setup Introduction In today’s data-driven landscape, organizations need real-time insights from their operational databases to make informed decisions quickly. I developed a comprehensive Change Data Capture (CDC) streaming pipeline that captures database changes from Aurora MySQL and streams them in real-time to Amazon Redshift data warehouse for analytics. This solution […]