Working in People Analytics, you are often asked to tell the story of your company’s headcount and how the company evolved to what it is today. I often see this presented as a waterfall chart, which can be great, but it gets murky when trying to sha- 21414Murphy2025-03-23
Background – Simple on the Surface The metrics used for gauging performance of classification models are fairly straightforward, at least from a mathematical standpoint. Nevertheless, I have observed that many modellers and data scientists encounter- 20478Murphy2025-03-23
Teaching yourself data science can sure seem out of reach when all you’re inundated with on social media these days is stories of how people taught themselves data science in three months and were hired by a FAANG company quicker than you could say- 25724Murphy2025-03-23
As a data enthusiast, exploring a new dataset is an exciting endeavour. It allows us to gain a deeper understanding of the data and lays the foundation for successful analysis. Getting a good feeling for a new dataset is not always easy, and takes time. H- 23511Murphy2025-03-23
With Wessel Radstok Travelers on the Dutch Railways can use the app from the Dutch railway agency to plan their trip. While planning the trip, the app shows a prediction for the crowdedness of the train in question. This is shown as three categories: low- 29432Murphy2025-03-23
Photo by John McArthur on Unsplash Table of Contents ∘ Introduction ∘ Problem Statement ∘ Data ∘ AWS Architecture ∘ Data Storage with AWS S3 ∘ Designing the Schema ∘ ETL with AWS Glue ∘ Data Warehousing with AWS Redshift ∘ Extracting Insights with AWS Red- 24459Murphy2025-03-23
Reducing the size of Large Language Models with 8-bit quantization- 28789Murphy2025-03-23
Your data has a lot of things to say about which customers turned out to be money in the bank and which ones didn’t. Regardless of whether you work as a Lifecycle Marketing Manager in a B2B company where you optimize for driving free trials to paid- 24031Murphy2025-03-23
Once exclusively used in academia, in particular medical research, randomized control trials are now a popular method for businesses to make data-driven decisions. In particular, online A/B testing is easy to implement and potentially incredibly powerful- 29975Murphy2025-03-23
When evaluating and comparing machine learning solutions, your first go-to evaluation metric will probably be predictive power. It’s easy to compare different models with one single metric, and this is perfectly fine in Kaggle competitions. In real- 26358Murphy2025-03-23
Our weekly selection of must-read Editors' Picks and original features- 24937Murphy2025-03-23
Knowing how to interpret your model is essential to understand if it is not doing weird stuff. The more you know your model, the less likely you are to be surprised by its behavior when it goes to production. Also, the more domain you have over your model- 23018Murphy2025-03-23
Running Falcon-7B in the cloud as a microservice- 23106Murphy2025-03-23
Python is one of the most used programming languages in the world and provides developers with a wide range of libraries. Anyway, when it comes to data manipulation and scientific computation, we generally think of libraries such as Numpy, Pandas, or SciP- 25847Murphy2025-03-23
Overview of Your Journey Introduction – No Production, No Problems! Notebooks can be Used for Everything! Why Automate When You have the Time? Testing? Just Never Make Mistakes! Dependency Management in my Head! Wrapping Up 1 – Introduction - 23059Murphy2025-03-23
Planning to integrate some LLM service into your code? Here are some of the common challenges you should expect when doing so- 22901Murphy2025-03-23
The First-Years Chronicles of a Data Scientist in Tech A Data Scientist’s Guide to Turning Your Insights into Impactful Actions The killer skill combo to turn your work into meaningful results Imagine pouring your heart and soul into a project for m- 22758Murphy2025-03-23
Understanding data analysis, visualization techniques- 24762Murphy2025-03-23
Let me introduce to you our latest work, which has been accepted by ICML 2023: Change is Hard: A Closer Look at Subpopulation Shift. Machine learning models have shown great potential in many applications, but they often perform poorly on subgroups that a- 26908Murphy2025-03-23
Turn your LLM into a field expert- 22748Murphy2025-03-23
Why is ChatGPT only trained up until 2021?
Learn how to rearrange your code to achieve significant speed improvements.