Graph Machine Learning @ ICML 2023
Recent advancements and hot trends, August 2023 edition- 24333Murphy2025-03-23
Building PCA from the Ground Up
Supercharge your understanding of Principal Component Analysis with a step-by-step derivation- 30081Murphy2025-03-23
Data Analyst Guide to Data Cleaning
Although there are many resources to learn technical skills, there are few that go in-depth on how to clean data – an essential skill needed for data analysts. You may think you can apply the same set of rules to clean data but that’s not alwa- 26017Murphy2025-03-23
Software Engineering Best Practices for Writing Maintainable ML Code
Advanced coding tips for data scientists- 26757Murphy2025-03-23
Regulating Generative AI
How Well Do LLMs Comply with the EU AI Act?- 28552Murphy2025-03-23
Data Leakage: What It Is and Why It Causes Our Predictive Systems to Fail
Data leakage represents, together with over/underfitting, the main cause of failure of machine learning projects that go into production- 26172Murphy2025-03-23
College Football Conference Realignment - Regression
Welcome to part 2 of my series on conference realignment! Last summer when conference realignment was in full swing, Tony Altimore published a study on Twitter that inspired me to do my own conference realignment analysis. This series is organized into fo- 21094Murphy2025-03-23
College Football Conference Realignment – node2vec
Using unsupervised learning to maximize media rights value- 24852Murphy2025-03-23
Latest in CNN Kernels for Large Image Models
A high-level overview of the latest convolutional kernel structures in Deformable Convolutional Networks, DCNv2, DCNv3- 26879Murphy2025-03-23
Unsupervised Learning Series -Exploring Self-Organizing Maps
Learn how Self-Organizing Maps work and why they are a useful unsupervised learning algorithm- 25355Murphy2025-03-23
ChatGPT's energy use per query
How much electricity does ChatGPT use to answer one question?- 26395Murphy2025-03-23
Data-Driven Dispatch
Using supervised learning to predict service callouts to Chicago car collisions- 23694Murphy2025-03-23
Cracking Open the Hugging Face Transformers Library
A quick-start guide to using open-source LLMs- 20544Murphy2025-03-23
Initialize Rust Constants at Runtime with lazy_static
Define non-constant static variables with lazy initialization- 27430Murphy2025-03-23
Spoken language recognition on Mozilla Common Voice – Part II: Models.
This is the second article on spoken language recognition based on Mozilla Common Voice dataset. In the first part we discussed data...- 25413Murphy2025-03-23
How to Chat With Any File from PDFs to Images Using Large Language Models – With Code
Complete guide to building an AI assistant that can answer questions about any file- 23246Murphy2025-03-23
Decoding the Symphony of Sound: Audio Signal Processing for Musical Engineering
The Ultimate Guide to Time and Frequency Domain Audio Feature Extraction using Python- 23624Murphy2025-03-23
Building a Comment Toxicity Ranker Using Hugging Face's Transformer Models
Catching up on NLP and LLM (Part I)- 27685Murphy2025-03-23
College Football Conference Realignment – Exploratory Data Analysis in Python
Exploring the changing CFB landscape as a data scientist- 23372Murphy2025-03-23
Multilevel Regression Models and Simpson's paradox
Avoiding false conclusions with the proper tooling- 24604Murphy2025-03-23
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.