Real-Time Analytics Solution for Usage-Based API Billing and Metering
Design a real-time analytics pipeline for tracking API invocation usage with Apache APISIX, Redpanda, and Apache Pinot.- 20410Murphy2025-03-22
Behind The Scenes: Explaining My Work As A Data Scientist
A breakdown of what my data science role truly entails- 28265Murphy2025-03-22
Language as a Universal Learning Machine
Saying is believing. Seeing is hallucinating.- 28067Murphy2025-03-22
From Data to Dashboard: Visualizing the Ancient Maritime Silk Road with Dash Leaflet and SeaRoute
Build an interactive map visualization displaying the shortest path between two points at sea within the Python Dash Framework- 25236Murphy2025-03-22
Roll Up Your Sleeves: 9 Data and Machine Learning Project Walkthroughs You Should Explore
Our weekly selection of must-read Editors' Picks and original features- 25958Murphy2025-03-22
Langchain's built-in eval metrics for AI output: how are they different?
Quick Dive into the Built-in Language Model Evaluation Metrics in LangChain for AI Development- 28349Murphy2025-03-22
Building a marketing data science team from scratch
From scratch to a 6-member team: How I built Skyscanner's marketing data science team, proving value by being focused and strong...- 27365Murphy2025-03-22
The Art of Stress Management as a Data Scientist
What you do when you're not a data scientist could help you become a better data scientist- 27426Murphy2025-03-22
From Assumptions to Accuracy: The Role of Conditional Probability in Real-World Predictions
Conditional probability is better than probability ; IF you have the relevant information- 26286Murphy2025-03-22
Exploring RAG Applications Across Languages: Conversing with the Mishnah
Building a Cross-Lingual RAG System for Rabbinic Texts- 27007Murphy2025-03-22
AI-based organizational network analysis
The new generation of AI tools provide out-of-the-box solutions for complex problems that were not possible (or scalable) before - or only...- 23275Murphy2025-03-22
How to apply RLS based on multiple columns in Power BI
Usually, Data Access with RLS is based on Data from one column. But what when Data access must be controlled by multiple columns?- 28971Murphy2025-03-22
How to: Handle Missing Data for Time Series
Should you drop, interpolate, or impute?- 22251Murphy2025-03-22
Interpretable Outlier Detection: Frequent Patterns Outlier Factor (FPOF)
An outlier detector method that supports categorical data and provides explanations for the outliers flagged- 21945Murphy2025-03-22
Difference-in-Difference 101
What is Difference-in-difference (DiD or DD or diff-in-diff)? Why do we care about DiD?- 29433Murphy2025-03-22
Why Representation Finetuning is the Most Efficient Approach Today?
A Step-by-Step Guide to Representation Finetuning LLAMA3- 22831Murphy2025-03-22
Embedding Markdown Files in a Streamlit Dashboard
Simplify Streamlit app code by moving longer static contents to Markdown files.- 28632Murphy2025-03-22
Your End-to-End Product Analytics Strategy
What gets measured gets managed- 21634Murphy2025-03-22
Evaluating ChatGPT in Data Science: Churn Prediction Analysis As An Example
Can ChatGPT assist or even replace a data scientist?- 22315Murphy2025-03-22
Streamlining E-commerce: Leveraging Entity Resolution for Product Matching
How Google figures out the price of a product across websites- 26885Murphy2025-03-22
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.