Similarity Search, Part 3: Blending Inverted File Index and Product Quantization
In the first two parts of this series we have discussed two fundamental algorithms in information retrieval: inverted file index and...- 25779Murphy2025-03-23
3 Essential PostgreSQL Functions You Deserve To Know
Three little-known functions that will elevate your data game- 25772Murphy2025-03-23
5 Data Science Portfolio Mistakes
How to make a portfolio that will get you hired- 23062Murphy2025-03-23
Unlocking the Power of Interaction Terms in Linear Regression
Learn how to make your linear models more flexible by including interaction terms- 22230Murphy2025-03-23
The Dreaded Antagonist: Data Leakage in Machine Learning
Probably one of the most underappreciated concepts in Machine Learning- 26809Murphy2025-03-23
Data Entropy – More Data, More Problems?
How to navigate and embrace complexity in a modern data organisation.- 28882Murphy2025-03-23
Creating Your Personalized Voice Assistant with GPT and Whisper
A Step-by-Step Guide- 21521Murphy2025-03-23
Change your focus over time in DAX Measures
How to determine if your investment has paid off when you advertise products over time- 29849Murphy2025-03-23
How to Evaluate the Performance of Your ML/ AI Models
An accurate evaluation is the only way to performance improvement- 30019Murphy2025-03-23
How to identify bird species by their songs?
A Kick-off for Applying ML on Sounds- 22807Murphy2025-03-23
Tabyl – a frequency table for the modern R user
Out with the old, in with the new!- 24241Murphy2025-03-23
Unraveling the Design Pattern of Physics-Informed Neural Networks: Part 02
Enhancing PINN training stability through ensemble learning and dynamic solution interval expansion- 28313Murphy2025-03-23
The Notorious XGBoost
Revisiting one of the most awarded machine learning algorithms- 21185Murphy2025-03-23
Mastering ChatGPT: Effective Summarization with LLMs
How to Prompt ChatGPT to get High-Quality Summaries- 22065Murphy2025-03-23
Visualizing Social Networks for Better Insights: Analyzing and Mapping Social Relationships withR
A beginner's guide to using Python's NetworkX library to conduct social network analysis- 26598Murphy2025-03-23
From Data Engineering to Prompt Engineering
Solving data preparation tasks with ChatGPT- 25078Murphy2025-03-23
Need for Speed: Comparing Pandas 2.0 with Four Python Speed-Up Libs (with Code)
Polars, Dask, RAPIDS.ai cuDF, and Numba are compared against Pandas 2.0 with pyarrow in the backend, vectorization, and itertuples()...- 20365Murphy2025-03-23
Unveiling the Dropout Layer: An Essential Tool for Enhancing Neural Networks
Understanding the Dropout Layer: Improving Neural Network Training and Reducing Overfitting with Dropout Regularization- 26216Murphy2025-03-23
Cyberpunking Your Matplotlib Figures
Taking Your Matplotlib Figures from Boring to Interesting with a Few Lines of Code- 25222Murphy2025-03-23
The Art of Spending: Optimizing Your Marketing Budget with Nonlinear Optimization
Introduction to CVXPY to maximize marketing ROI- 21283Murphy2025-03-23
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.