There is no use avoiding reality. Data science, and more broadly, data-driven structures, are at the center of the society we are currently building. When the computer science craze first hit in the early 2000s, many noted that computer science would beco- 20438Murphy2025-03-23
Part one of a comprehensive, practical guide to CLV techniques and real-world use-cases- 25501Murphy2025-03-23
Boost performance with unlabeled data.- 27844Murphy2025-03-23
Delve into PackedInts, VInt, FixedBitSet, and RoaringDocIdSet (Roaring Bitmaps)- 25226Murphy2025-03-23
I published an article last year on Data Architecture trends. This was before Large Language Models (LLMs) became all the rage and influenced most industries. Gartner reports, "Venture capital firms have invested over $1.7 billion in generative AI so- 21657Murphy2025-03-23
An exploration of Microsoft Research's paper 'Textbooks Are All You Need'- 20376Murphy2025-03-23
How can we use deep learning to convert between strings without getting "boggled"? (Image by Andrew Malone, CC BY 2.0) This article describes TaatikNet and how to easily implement seq2seq models. For code and documentation, see the TaatikNet Git- 20293Murphy2025-03-23
Geological Lithology Variations Within The Zechstein Group of the Norwegian Continental Shelf- 27941Murphy2025-03-23
Let's understand the big picture behind generative AI- 24853Murphy2025-03-23
Understanding time series seasonality- 26683Murphy2025-03-23
Many great developments in data science have been made in the last decade but despite these achievements, many projects never see the light of day. As data scientists we must not only show strong technical skills but also understand the business context,- 20252Murphy2025-03-23
As the sun began to dim and the city lights came to life, the inevitability of a late night in the office settled in. I found myself in a race against time. A crucial sales presentation was looming less than a day away, and success hinged on an unfulfille- 23410Murphy2025-03-23
Using data aggregation techniques can help us transform an overwhelming and almost incomprehensible numeric dataset into something that is easily digestible and much more reader-friendly. The process of data aggregation involves summarising multiple data- 25949Murphy2025-03-23
Our weekly selection of must-read Editors' Picks and original features- 26738Murphy2025-03-23
Leverage Kedro to build production-ready machine learning pipelines- 26786Murphy2025-03-23
The First-Years Chronicles of a Data Scientist in Tech Congratulations and welcome to the adventure! You’re a Data Scientist in the making and your journey is only getting started! You’ve graduated from college and now you’re diving into- 24012Murphy2025-03-23
The First-Years Chronicles of a Data Scientist in Tech This article is part two of the "5 Essential Lessons for Junior Data Scientists I Learned at Spotify" series. Make sure to check out Part 1 first! 5 Essential Lessons for Junior Data Scienti- 26745Murphy2025-03-23
Learn about Common Table Expression (CTE) and Window Functions- 26305Murphy2025-03-23
Guide This story will be somewhat different from what I usually post. It won’t be an intro to specific tools and techniques, nor a tutorial or a practical case. This time I want to answer a question I’ve been receiving through LinkedIn since I- 20674Murphy2025-03-23
Implementation from scratch vs numpy The Fourier transform algorithm is considered one of the greatest discoveries in all of mathematics. French mathematician Jean-Baptiste Joseph Fourier laid the foundation for harmonic analysis in his book "Théorie- 21143Murphy2025-03-23
Why is ChatGPT only trained up until 2021?
Learn how to rearrange your code to achieve significant speed improvements.