Author: Sean Cahill

SageMaker Training and Deployment with Custom Images

How and Why to Use Custom SageMaker Images If you have used SageMaker for data science modeling work, you have likely used the AWS-provided images to train your models and possibly deploy them to an endpoint. These provide images for scikit-learn, xgboost and deep-learning among other common frameworks. This article will address an issue we […]

The Problem of Underfitting in Machine Learning

ChatGPT generated image What Underfitting is and How to Test for It and Minimize its Impact Machine learning stands as a pivotal element in contemporary data science, fundamentally altering the landscape of predictive analytics and decision-making across various domains. We have written many articles on machine learning concepts, but this one takes us back to […]

S3 Custom Lifecycles: An AWS Glue solution to incorporate read operations into S3 Lifecycles

Introduction S3 Lifecycles provide a fantastic way to manage the lifetimes of S3 Objects. Working in AWS and in bigger organizations, you will inevitably have objects lingering (often in standard storage), costing you money. Lifecycle rules offer an easy way to trim some fat, which can both help you save money and exist in a […]

Data Quality Monitoring in AWS SageMaker

First things first, what is data quality monitoring? Data quality monitoring for machine learning can generally be thought of from two perspectives. One perspective is that of traditional data-engineering. This type of monitoring is concerned with the “physical” characteristics of the data and ensuring they are what you expect them to be. It involves criteria […]

Remote Development in Sagemaker Studio with VS Code

Disclaimer about Changes to Sagemaker Studio As of Nov. 30 2023, there have been major changes to Sagemaker Studio. Existing customers of Sagemaker Studio will get the default experience now called Sagemaker Studio Classic — this is the Studio experience this article was written for. New Sagemaker Studio customers (and existing customers that choose to […]

Cloud Architecture

Data Engineering

Cloud Migration

Data Science & Analytics

Generative AI

MLOps

AWS Generative AI

AWS OpenSearch

AWS Migration Acceleration Program (MAP)

AWS Data & Analytics

AWS EMR Delivery

AWS Well-Architected Framework Review

AWS Public Sector Solutions

AWS DynamoDB Delivery

Financial Services

Energy & Utilities

Healthcare & Life Sciences

Education

Careers

Resource Documents

Leadership

About Us

Events

SageMaker Training and Deployment with Custom Images

The Problem of Underfitting in Machine Learning

S3 Custom Lifecycles: An AWS Glue solution to incorporate read operations into S3 Lifecycles

Data Quality Monitoring in AWS SageMaker

Remote Development in Sagemaker Studio with VS Code

Services

Industries

Resources

About