Core Concepts for Small Language Models

Foundational Computational Linguistics Concepts and how they relate to Language Models LLMs are all the rage, and to use them effectively as engineers and users of any kind we could benefit from thinking about them in the context of some core Natural Language Processing (NLP) concepts. At New Math Data, we’ve been building a lot […]
Nurturing a Better World: Lessons in Stewardship and Compassion

From Ancient Wisdom to Thriving in Modern Times Introduction In a world where ownership and control often dominate our thoughts and actions, it’s easy to forget the transient nature of our existence. We spend so much time acquiring and holding onto things that we lose sight of what truly matters. Instead of owning people, places, […]
The Melody of Data: Turning Unary Encoding into Bird-like Audio

Exploring the Intersection of Information Theory and Natural Sound Introduction In our increasingly digital world, the way we store and transmit information is becoming ever more crucial. Imagine trying to pack a suitcase with as much efficiency as possible, fitting in everything you need without wasting any space. In information theory, this idea translates into […]
Experimenting with the Databricks Mosaic AI Vector Search using python: a beginner example

Introduction After exploring how vector databases work and trying the AWS OpenSearch vector database in this blog post, the goal of this article is to experiment with a similar product from Databricks – Mosaic AI Vector Search. We will create a new Vector Search endpoint and index, load the toy dataset, try to perform similarity […]
Unlocking Efficiency: Proven Strategies for Getting Things Done

Practical Techniques for a Balanced and Productive Workflow Introduction Productivity and efficiency are critical components of professional success. A structured and disciplined approach to managing tasks can significantly enhance both individual and team performance. The implementation of well-established productivity techniques allows for better time management, clearer prioritization, and improved focus. This article explores several proven […]
September Databricks Updates

Clean Room Monitoring, VS Code Extension, and System Tables GA Introduction As organizations continue to scale their data and AI operations, effective monitoring, streamlined development, and transparent billing have become crucial for maintaining efficiency. This September, Databricks has introduced several powerful updates designed to address these needs. From the ability to track clean room usage […]
Designing High-Quality Code Screens for AWS Data Engineers

Identifying Top Talent with Focused Assessments Introduction Hiring the right AWS data engineer can often feel like a daunting task. The stakes are high, and the margin for error is slim. In this competitive field, we need more than just a résumé filled with buzzwords. We need effective code screens that can identify candidates who […]
The Hidden Drivers of Workplace Behavior

Breaking the Chains of on the Job Anxiety Introduction Understanding what drives our behavior at work can unlock greater productivity and satisfaction. The fight-or-flight response, an ancient survival mechanism, has transformed in our professional lives. While our ancestors faced physical dangers, we now encounter psychological challenges – deadlines, performance reviews, and career advancement pressures. This […]
What’s New in AWS: Highlighting Key September 2024 Releases

Introduction AWS continues to lead the cloud industry with a series of powerful updates designed to enhance efficiency, scalability, and creativity. From groundbreaking improvements in storage management to cutting-edge AI tools, the latest releases showcase AWS’s commitment to evolving business needs. In this post, we’ll explore key updates, including Amazon S3’s new conditional writes, advanced […]
Databricks Notebooks Reimagined: Simplicity Meets Power

Streamlined UX, Powerful Tools, and AI Integration Introduction Having powerful and intuitive tools is crucial for success. Databricks has recently unveiled the next generation of its Notebooks, bringing a host of new features designed to enhance productivity and ease of use. This update includes a modernized user interface, advanced Python capabilities, and AI-powered authoring tools, […]