Who we are

Contacts

1815 W 14th St, Houston, TX 77008

info@newmathdata.com

281-817-6190

GenAI General

Navigating the Ethical Landscape: Generative AI in Data Engineering

Exploring the integration of generative AI in data engineering and its ethical implications. Introduction The integration of generative AI into data engineering marks a significant evolution in the way data ecosystems are managed and utilized. As these advanced technologies take on more complex tasks traditionally performed by humans, they bring forth not only enhanced efficiency […]

GenAI General

Exploring the Future: Generative AI Webinars by Industry Vertical

Introduction to Generative AI and Its Impact Across Different Industries Generative AI is revolutionizing industries by automating creative processes and enhancing decision-making capabilities. This technology uses algorithms to generate content, solve problems, and derive insights from vast datasets, becoming a crucial tool across various sectors. As industries aim to harness the power of generative AI, […]

GenAI General

Augment Your Retrieval: LLMs with Python LangChain and AWS OpenSearch VectorSearch database

Introduction In this blog post we will introduce vector databases and some of the algorithms used for indexing and show examples of how to work with AWS OpenSearch vector database using python and LangChain library. Vector databases A vector database is a type of database that stores high-dimensional vectors for fast retrieval and similarity search. […]

GenAI General

Streamlining Talent Acquisition with HireBot, your AI Powered Recruiter

Introduction HireBot is a chatbot designed to streamline the initial screening process of candidate resumes and profiles for recruiters and hiring managers. It leverages natural language processing to interpret resumes, allowing users to query candidate information through a natural language interface. The primary issue HireBot addresses is the time-consuming nature of manual resume screening, reducing […]

General

Data Quality Monitoring in AWS SageMaker

First things first, what is data quality monitoring? Data quality monitoring for machine learning can generally be thought of from two perspectives. One perspective is that of traditional data-engineering. This type of monitoring is concerned with the “physical” characteristics of the data and ensuring they are what you expect them to be. It involves criteria […]

General

Accurate by Design: Advanced Data Quality on AWS

Introduction Data pipelines in AWS orchestrate the movement and transformation of data across various AWS services. The core objective of these pipelines is to enable efficient data processing, analysis, and storage, ensuring that data is available where and when it is needed. Maintaining high data quality throughout this process is critical; it ensures reliability, accuracy, […]

General

The Problem of Overfitting in Machine Learning

By Lena Qian Introduction Machine learning stands as a pivotal element in contemporary data science, fundamentally altering the landscape of predictive analytics and decision-making across various domains. Despite its widespread adoption, a significant impediment persists in the form of overfitting, wherein machine learning models have high accuracy with training data but fail when presented with […]

General

The Art of Collaboration in Distributed Teams

Introduction In today’s rapidly evolving technological landscape, cloud data engineering projects are at the forefront of innovation, driving businesses towards unprecedented levels of efficiency, scalability, and data-driven decision-making. Central to the success of these projects is the concept of distributed teams – groups of individuals who work together from various geographical locations, leveraging the power […]