Graph Machine Learning @ ICML 2023
Recent advancements and hot trends, August 2023 edition- 24374Murphy2025-03-23
Building PCA from the Ground Up
Supercharge your understanding of Principal Component Analysis with a step-by-step derivation- 30123Murphy2025-03-23
Data Analyst Guide to Data Cleaning
Although there are many resources to learn technical skills, there are few that go in-depth on how to clean data – an essential skill needed for data analysts. You may think you can apply the same set of rules to clean data but that’s not alwa- 26059Murphy2025-03-23
Software Engineering Best Practices for Writing Maintainable ML Code
Advanced coding tips for data scientists- 26798Murphy2025-03-23
Regulating Generative AI
How Well Do LLMs Comply with the EU AI Act?- 28592Murphy2025-03-23
Data Leakage: What It Is and Why It Causes Our Predictive Systems to Fail
Data leakage represents, together with over/underfitting, the main cause of failure of machine learning projects that go into production- 26213Murphy2025-03-23
College Football Conference Realignment - Regression
Welcome to part 2 of my series on conference realignment! Last summer when conference realignment was in full swing, Tony Altimore published a study on Twitter that inspired me to do my own conference realignment analysis. This series is organized into fo- 21135Murphy2025-03-23
College Football Conference Realignment – node2vec
Using unsupervised learning to maximize media rights value- 24893Murphy2025-03-23
Latest in CNN Kernels for Large Image Models
A high-level overview of the latest convolutional kernel structures in Deformable Convolutional Networks, DCNv2, DCNv3- 26922Murphy2025-03-23
Unsupervised Learning Series -Exploring Self-Organizing Maps
Learn how Self-Organizing Maps work and why they are a useful unsupervised learning algorithm- 25398Murphy2025-03-23
ChatGPT's energy use per query
How much electricity does ChatGPT use to answer one question?- 26436Murphy2025-03-23
Data-Driven Dispatch
Using supervised learning to predict service callouts to Chicago car collisions- 23734Murphy2025-03-23
Cracking Open the Hugging Face Transformers Library
A quick-start guide to using open-source LLMs- 20585Murphy2025-03-23
Initialize Rust Constants at Runtime with lazy_static
Define non-constant static variables with lazy initialization- 27472Murphy2025-03-23
Spoken language recognition on Mozilla Common Voice – Part II: Models.
This is the second article on spoken language recognition based on Mozilla Common Voice dataset. In the first part we discussed data...- 25454Murphy2025-03-23
How to Chat With Any File from PDFs to Images Using Large Language Models – With Code
Complete guide to building an AI assistant that can answer questions about any file- 23287Murphy2025-03-23
Decoding the Symphony of Sound: Audio Signal Processing for Musical Engineering
The Ultimate Guide to Time and Frequency Domain Audio Feature Extraction using Python- 23664Murphy2025-03-23
Building a Comment Toxicity Ranker Using Hugging Face's Transformer Models
Catching up on NLP and LLM (Part I)- 27725Murphy2025-03-23
College Football Conference Realignment – Exploratory Data Analysis in Python
Exploring the changing CFB landscape as a data scientist- 23412Murphy2025-03-23
Multilevel Regression Models and Simpson's paradox
Avoiding false conclusions with the proper tooling- 24644Murphy2025-03-23
Genius Cliques: Mapping out the Nobel Network
Combining Network Science, Data Visualization, and Wikipedia to uncover hidden connections between all the Nobel laureates.
Data Science Expertise Comes in Many Shapes and Forms
Our weekly selection of must-read Editors' Picks and original features
