DeepGuide for DeepSeek

R Toolkit for People Analytics: Telling Your Headcount Story
Working in People Analytics, you are often asked to tell the story of your company’s headcount and how the company evolved to what it is today. I often see this presented as a waterfall chart, which can be great, but it gets murky when trying to sha
21414Murphy2025-03-23
How to Avoid Being Fooled by Model Accuracy
Background – Simple on the Surface The metrics used for gauging performance of classification models are fairly straightforward, at least from a mathematical standpoint. Nevertheless, I have observed that many modellers and data scientists encounter
20478Murphy2025-03-23
How to Create a 1-Year Data Science Self-Study Plan Using the Seasonality of Your Brain
Teaching yourself data science can sure seem out of reach when all you’re inundated with on social media these days is stories of how people taught themselves data science in three months and were hired by a FAANG company quicker than you could say
25724Murphy2025-03-23
Exploratory Data Analysis: Unraveling the Story Within Your Dataset
As a data enthusiast, exploring a new dataset is an exciting endeavour. It allows us to gain a deeper understanding of the data and lays the foundation for successful analysis. Getting a good feeling for a new dataset is not always easy, and takes time. H
23511Murphy2025-03-23
Real-Time Crowdedness Predictions for Train Travelers
With Wessel Radstok Travelers on the Dutch Railways can use the app from the Dutch railway agency to plan their trip. While planning the trip, the app shows a prediction for the crowdedness of the train in question. This is shown as three categories: low
29432Murphy2025-03-23
Examining Flights in the U.S. with AWS and Power BI
Photo by John McArthur on Unsplash Table of Contents ∘ Introduction ∘ Problem Statement ∘ Data ∘ AWS Architecture ∘ Data Storage with AWS S3 ∘ Designing the Schema ∘ ETL with AWS Glue ∘ Data Warehousing with AWS Redshift ∘ Extracting Insights with AWS Red
24459Murphy2025-03-23
Introduction to Weight Quantization
Reducing the size of Large Language Models with 8-bit quantization
28789Murphy2025-03-23
The hidden cost of data quality issues on the return of ad spend
Your data has a lot of things to say about which customers turned out to be money in the bank and which ones didn’t. Regardless of whether you work as a Lifecycle Marketing Manager in a B2B company where you optimize for driving free trials to paid
24031Murphy2025-03-23
Save your A/B testing by avoiding those 3 costly mistakes
Once exclusively used in academia, in particular medical research, randomized control trials are now a popular method for businesses to make data-driven decisions. In particular, online A/B testing is easy to implement and potentially incredibly powerful
29975Murphy2025-03-23
How to Compare ML Solutions Effectively?
When evaluating and comparing machine learning solutions, your first go-to evaluation metric will probably be predictive power. It’s easy to compare different models with one single metric, and this is perfectly fine in Kaggle competitions. In real
26358Murphy2025-03-23
Learning the Ropes for Your Next LangChain Project
Our weekly selection of must-read Editors' Picks and original features
24937Murphy2025-03-23
Mastering Model Interpretability: A Comprehensive Look at Partial Dependence Plots
Knowing how to interpret your model is essential to understand if it is not doing weird stuff. The more you know your model, the less likely you are to be surprised by its behavior when it goes to production. Also, the more domain you have over your model
23018Murphy2025-03-23
Deploying Falcon-7B Into Production
Running Falcon-7B in the cloud as a microservice
23106Murphy2025-03-23
Beyond Numpy and Pandas: Unlocking the Potential of Lesser-Known Python Libraries
Python is one of the most used programming languages in the world and provides developers with a wide range of libraries. Anyway, when it comes to data manipulation and scientific computation, we generally think of libraries such as Numpy, Pandas, or SciP
25847Murphy2025-03-23
How to *Not* Get Machine Learning Models in Production
Overview of Your Journey Introduction – No Production, No Problems! Notebooks can be Used for Everything! Why Automate When You have the Time? Testing? Just Never Make Mistakes! Dependency Management in my Head! Wrapping Up 1 – Introduction &#
23059Murphy2025-03-23
The Complexities and Challenges of Integrating LLMs into Applications
Planning to integrate some LLM service into your code? Here are some of the common challenges you should expect when doing so
22901Murphy2025-03-23
Step-by-Step Guide: The Art of Winning Stakeholders as a Data Scientist to Drive Impact
The First-Years Chronicles of a Data Scientist in Tech A Data Scientist’s Guide to Turning Your Insights into Impactful Actions The killer skill combo to turn your work into meaningful results Imagine pouring your heart and soul into a project for m
22758Murphy2025-03-23
Python water quality EDA and Potability analysis
Understanding data analysis, visualization techniques
24762Murphy2025-03-23
On Learning in the Presence of Underrepresented Groups
Let me introduce to you our latest work, which has been accepted by ICML 2023: Change is Hard: A Closer Look at Subpopulation Shift. Machine learning models have shown great potential in many applications, but they often perform poorly on subgroups that a
26908Murphy2025-03-23
How to add Domain-Specific Knowledge to an LLM Based on Your Data
Turn your LLM into a field expert
22748Murphy2025-03-23