Exploratory Data Analysis: Lost Property Items on the Transport of London
Getting statistical insights with Python, Pandas, and Plotly- 26648Murphy2025-03-22
Building a RAG chain using LangChain Expression Language (LCEL)
Learning the building blocks of LCEL to develop increasingly complex RAG chains- 20623Murphy2025-03-22
Prompt Engineering for Coding Tasks
Enhancing Code Generation with LLMs via Prompt Engineering- 25838Murphy2025-03-22
In Defense of LLMs in Data Science: What ChatGPT Can and Can't Do for Your Data Science Career
ChatGPT can take your data science game to the next level - if you know how to use it.- 27903Murphy2025-03-22
Feature Engineering with Microsoft Fabric and Dataflow Gen2
Fabric Madness part 3- 21356Murphy2025-03-22
Quantifying the Complexity and Learnability of Strategic Classification Problems
How generalizing the notion of VC dimension to a strategic setting can help us understand whether or not a problem is learnable- 28411Murphy2025-03-22
How to Train a Decision Tree Classifier… In SQL
SQL can now replace Python for most supervised ML tasks. Should you make the switch?- 24471Murphy2025-03-22
How I Self-Study Data Science
My techniques and methods for learning data science and technical fields- 24042Murphy2025-03-22
How to Generate Videos with Open-Sora-Plan Video Generation Model
Learn how to generate images with one of the latest models within image generation.- 22833Murphy2025-03-22
Multimodal Large Language Models & Apple's MM1
This blog post will go into the architecture and findings behind Apple's "MM1: Methods, Analysis & Insights from Multimodal LLM...- 25183Murphy2025-03-22
Enhancing Readability of Python Code via Annotations
Annotations are a powerful development tool. Read this article to learn how and where to use them.- 26982Murphy2025-03-22
What if ChatGPT is Actually a Tour Guide From Another World? (Part 2)
Part I of this post hypothesized that ChatGPT is a tour guide leading us through a high-dimensional version of the computer game Minecraft. Outrageous? Absolutely, but I tested the hypothesis anyway and stumbled on something beautiful and mysterious insid- 28382Murphy2025-03-22
Image Contouring with OpenCV
First steps for beginners- 21524Murphy2025-03-22
Write-Audit-Publish for Data Lakes in Pure Python (no JVM)
An open source implementation of WAP using Apache Iceberg, Lambdas, and Project Nessie all running entirely Python- 25269Murphy2025-03-22
Learning Generalist Models for Anomaly Detection
Generalist Anomaly Detection (GAD) aims to train one single detection model that can generalize to detect anomalies in diverse datasets...- 27896Murphy2025-03-22
One Mindset Shift That Will Make You a Better Data Scientist
Actually, any good employee should adopt this mindset- 23990Murphy2025-03-22
Coverage vs. Accuracy: Striking a Balance in Data Science
The art of getting quick gains with agile model production- 28671Murphy2025-03-22
A New Manager's Guide to High Performing Data Science Teams
Lessons from someone who manages a team of 8- 29819Murphy2025-03-22
Monitor Data Pipelines Using Snowflake's Data Metric Functions
Build Trusted Data Platforms with Google SRE Principles- 25817Murphy2025-03-22
Exploring the Power of Natural Language Data Manipulation with PandasAI
How generative AI supercharges Pandas capabilities- 26945Murphy2025-03-22
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.