Understanding Concept Drift: A Simple Guide
How changes in the distribution arise, and the impact of verification delay.- 27444Murphy2025-03-22
The German Tank Problem
Estimating your chances of winning the lottery with sampling- 27022Murphy2025-03-22
Set up a Pypi mirror in an AWS private environment with Terraform
Use AWS CodeArtifact as a fully secured solution for setting up a Pypi mirror in a private AWS environment.- 26990Murphy2025-03-22
How to Create an RFM Model in BigQuery
Learn what an RFM model is, how to create one, and how to segment on the results- 30010Murphy2025-03-22
How to Use SQLAlchemy to Make Database Requests Asynchronously
Learn to use SQLAlchemy asynchronously in different scenarios- 25975Murphy2025-03-22
How to Generate Instruction Datasets from Any Documents for LLM Fine-Tuning
Generate high-quality synthetic datasets economically using lightweight libraries- 28305Murphy2025-03-22
Visualizing household income from government sources – A guided walkthrough
Using python code which can be applied to data from your own dataset for generating your own charts and gifs- 20489Murphy2025-03-22
Setting Up PyTorch with GPU Support on EC2 without Preconfigured AMIs
A cost-effective approach- 25306Murphy2025-03-22
How I Became a Data Scientist Before I Joined LinkedIn
These hands-on projects were crucial for career transformations- 20999Murphy2025-03-22
Time Series Forecasting: Beyond the Basics
Tips and tricks to unlock the true potential of time series forecasting in the real-world- 22066Murphy2025-03-22
How I Learned To Code (No CS Degree, No Bootcamp)
A rundown of my coding journey and advice I wish I had- 22485Murphy2025-03-22
Robust Statistics for Data Scientists Part 2: Resilient Measures of Relationships Between Variables
From basic to advanced techniques for outlier-rich data analysis.- 22047Murphy2025-03-22
How to Improve LLMs with RAG
A beginner-friendly introduction w/ Python code- 27544Murphy2025-03-22
A Sharp and Solid Outline of 3D Grid Neighborhoods
How 2D grid-based algorithms can be brought into the 3D world- 27305Murphy2025-03-22
PyTorch and MLX for Apple Silicon
A side-by-side CNN implementation and comparison- 27956Murphy2025-03-22
Data Science for Sustainability- Green Inventory Management
Simulate the impact of store delivery frequency on the CO2 emissions of a fashion retailer- 24063Murphy2025-03-22
A Priority Based Scheduler for Amazon SageMaker Training Jobs
Optimizing the use of limited AI training accelerators - Part 2- 28512Murphy2025-03-22
5 Redshift SQL Functions You Need to Know
With code examples on how to use them- 27473Murphy2025-03-22
Revolutionize Web Browsing with AI
Unlock Interactive Online Experiences Using GPT-4V and Puppeteer- 29072Murphy2025-03-22
Experiment Tracking & Hyperparameter Tuning: Organize Your Trials with DVC
Learn how to avoid getting lost with all the experiments while tuning your model's hyperparameters- 23635Murphy2025-03-22
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.