The SQL Unit Testing Landscape: 2023
Advancing speed and safety in SQL development- 29054Murphy2025-03-23
Geospatial Data Analysis with GeoPandas
Learn how to manipulate and visualize vector data with Python's GeoPandas- 20405Murphy2025-03-23
A Comprehensive Overview of Regression Evaluation Metrics
An extensive reference into commonly used regression evaluation metrics and their practical applications across various scenarios- 24940Murphy2025-03-23
Catch Me If You Can: A Guide to Exception Handling in Python
Unlock Python’s full potential through smart exception management Photo by Cookie the Pom on Unsplash As a software developer, dealing with exceptions is often seen as a necessary evil. Yet, mastery of Python’s exception handling system can ma- 24089Murphy2025-03-23
Non-Negative Matrix Factorization (NMF) for Dimensionality Reduction in Image Data
Discussing theory and implementation with Python and Scikit-learn- 24481Murphy2025-03-23
Using Tqdm with Asyncio in Python
An efficient way to monitor concurrent tasks' progress- 24210Murphy2025-03-23
Machine Learning, Illustrated: Opening Black Box Models with SHAP
How to explain any machine learning model using SHAP- 21648Murphy2025-03-23
The Ultimate Preprocessing Pipeline for Your NLP Models
Get the most out of training NLP ML models by feeding the best possible input- 23451Murphy2025-03-23
The Matrix Algebra of Linear Regression
Looking under the hood at the matrix operations behind linear regression- 25694Murphy2025-03-23
Time Series Data Analysis with sARIMA and Dash
Introducing a Dash web app that guides the analysis of time series datasets, using sARIMA models | Live app | Git Hub- 25840Murphy2025-03-23
The Infinite Babel Library of LLMs
Open-source, data, and attention: How the future of LLMs will change- 20586Murphy2025-03-23
Don't Be Afraid to Go Beyond Online Coding Courses
Interactive coding courses are great, but (by design) they'll only get you so far. Here's how to maximise their potential while overcoming...- 24641Murphy2025-03-23
CRUD with Pinecone
A simple guide for getting started with vector database from the traditional database perspective- 23976Murphy2025-03-23
Integrating Neural Net: Deriving the Normal Distribution CDF
Integrating a function using a neural network (with code)- 21499Murphy2025-03-23
May Edition: The Data of Urban Spaces
How data helps us make sense of cities- 26604Murphy2025-03-23
See what you SAM
Generate and visualize Segment Anything Model predictions- 24735Murphy2025-03-23
Mastering ExternalTaskSensor in Apache Airflow: How to Calculate Execution Delta
External Task Sensors stop bad data from trickling downstream in a data pipeline. Leverage them to create a reliable data infrastructure.- 24090Murphy2025-03-23
Summarize Podcast Transcripts and Long Texts Better with NLP and AI
Why the existing summarization approach is flawed, and a walkthrough of how to do better- 23252Murphy2025-03-23
How Quantum Physics Broke the Laws of Statistics
Demystifying the Data Science Behind 2022's Physics Nobel Prize- 26363Murphy2025-03-23
Getting Started with Databricks
A Beginners Guide to Databricks- 26425Murphy2025-03-23
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.