Theoretical Deep Dive Into Linear Regression
Learn about why linear regression is how it is, and how to naturally extend it in various ways- 24842Murphy2025-03-23
The Docker Compose of ETL: Meerschaum Compose
This article is about Meerschaum Compose, a tool for defining ETL pipelines in YAML and a plugin for the data engineering framework...- 29430Murphy2025-03-23
Unsupervised Learning Series - Exploring Hierarchical Clustering
Let's explore how hierarchical clustering works and how it builds clusters based on pairwise distances.- 29951Murphy2025-03-23
GPT vs BERT: Which is Better?
Comparing two large-language models: Approach and example- 24370Murphy2025-03-23
Bootstrap Tests for Beginners
Part 2 of Non-parametric tests for beginners- 25493Murphy2025-03-23
AI Frontiers Series: Human Resources
An introduction to the AI puzzle in untapped territory- 25633Murphy2025-03-23
Imperfections Unveiled: The Intriguing Reality Behind Our MLOps Course Creation
Bonus Lesson: Behind the Scenes of an 'Imperfect' ML Project - Lessons and Insights- 21227Murphy2025-03-23
Spider and Parallel Charts in R with the ggvanced Package
An R package for effective visualization of multiple variables- 28151Murphy2025-03-23
Rotating On-Call for Operational and Support: A Must for Data Teams
A rotating on-call schedule for operational, support, and tech-dept frees the rest of the team to do great development- 30100Murphy2025-03-23
Is Your LLM Application Ready for the Public?
Key concerns when productionizing LLM-based applications- 27360Murphy2025-03-23
Breaking boundaries in protein design with a new AI model that understands interactions with anyR
This new model could help expand the applicability of ML models for engineering proteins with desired functions by tuning their specific...- 28940Murphy2025-03-23
Building Machine Learning Operations for Businesses
A Blueprint for Effective MLOps to Support Your AI Strategy- 26383Murphy2025-03-23
5 Signs That Your Data is Modeled Poorly
Common Challenges In The Cloud era- 25253Murphy2025-03-23
Using Plotly 3D Surface Plots to Visualise Geological Surfaces
Visualising the Subsurface using Python Data Visualisation Libraries- 21208Murphy2025-03-23
Too Many Features? Let's Look at Principal Component Analysis
Home-brewed machine learning model series- 26013Murphy2025-03-23
Say Once! Repeating Words Is Not Helping AI
How and why is repeating tokens harming LLMs? Why is this a problem?- 23737Murphy2025-03-23
Understanding Bayesian Marketing Mix Modeling: A Deep Dive into Prior Specifications
Exploring model specification with Google's LightweightMMM- 26568Murphy2025-03-23
All You Need to Know to Build Your First LLM App
A step-by-step tutorial to document loaders, embeddings, vector stores and prompt templates- 22655Murphy2025-03-23
The Power of OpenAI's Function Calling in Data Pipelines: A Comprehensive Guide
Transforming Data Pipelines with OpenAI's Function Calling Feature: Implementing an Email Sending Workflow Using PostgreSQL and FastAPI- 23285Murphy2025-03-23
To Really Learn a New Topic, Take Your Time
Our weekly selection of must-read Editors' Picks and original features- 29050Murphy2025-03-23
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.