Using Core Concepts of Computational Linguistics with RAG

A super colorful infographic ChatGPT created for this article. How linguistic fundamentals shape the design and performance of retrieval-augmented generation systems In a previous article, I discussed the core concepts of computational linguistics that every language model, big and small, needs to have. This article will discuss the core components of retrieval-augmented generation (RAG) models, show […]
How to: AWS OAM Sink and Link

How and Why to Setup Cross-Account Logging in AWS I recently implemented cross-account logging using AWS Observability Access Manager (OAM) for a comprehensive monitoring application my team and I are building. In this, we are looking to create one UI to understand performance and costs across the products in our domain. When setting up cross-account […]
Core Concepts for Small Language Models

Foundational Computational Linguistics Concepts and how they relate to Language Models LLMs are all the rage, and to use them effectively as engineers and users of any kind we could benefit from thinking about them in the context of some core Natural Language Processing (NLP) concepts. At New Math Data, we’ve been building a lot […]
Data Modeling for Developers

An Introduction to Data Modeling and Why it Matters for Development Teams Data modeling is a critical yet often underrated skill in technology development and within development teams. This article aims to teach you the basics of it and why it is important. This article will introduce you to the concept of a data model, […]