Bite Size Data Science: Heteroscedastic Robust Errors
How to adjust standard errors for heteroscedasticity and why it works- 20327Murphy2025-03-22
Data Scientists Work in the Cloud. Here's How to Practice This as a Student (Part 2: Python)
Because data scientists don't write production code in the Udemy code editor- 27745Murphy2025-03-22
Constructive Heuristics in Discrete Optimization
Obtain initial solutions for combinatorial optimization problems with Python examples- 27040Murphy2025-03-22
Orchestrating a Dynamic Time-series Pipeline in Azure
Explore how to build, trigger, and parameterize a time-series data pipeline with ADF and Databricks, accompanied by a step-by-step tutorial- 20801Murphy2025-03-22
The History of Bodybuilding Through Network Visualization
Constructing the Shared Podium Graph of Mr. Olympia Winners (1965-2023) using Python and Gephi.- 25742Murphy2025-03-22
Data Science Portfolios, Speeding Up Python, KANs, and Other May Must-Reads
The stories that resonated the most with our community in the past month- 25775Murphy2025-03-22
Flash attention(Fast and Memory-Efficient Exact Attention with IO-Awareness): A deep dive
Flash attention is power optimization transformer attention mechanism that provides 15% efficiency- 26052Murphy2025-03-22
Long Short Term Memory (LSTM)- Improving RNNs
How state of the art RNNs work- 26520Murphy2025-03-22
Writing Powerful Programming Articles: A Guide for Success
Reflections on 4+ Years of Publishing Programming Articles- 29723Murphy2025-03-22
How to Better Communicate as a Data Scientist
Most valuable lessons from my time at McKinsey- 20604Murphy2025-03-22
Introduction to spatial analysis of cells for neuroscientists (part 1)
An approach using point patterns analysis (PPA) with spatstat- 28387Murphy2025-03-22
AI Model Training with JAX
Hit the road to super-fast AI/ML development- 29508Murphy2025-03-22
Build Your Own ChatGPT-like Chatbot with Java and Python
Creating a custom LLM inference infrastructure from scratch- 26500Murphy2025-03-22
Scalable OCR Pipelines using AWS
A survey of 3 different OCR pipeline patterns and their pros and cons- 23708Murphy2025-03-22
N-HiTS – Making Deep Learning for Time Series Forecasting More Efficient
A deep dive into how N-HiTS works and how you can use it- 23995Murphy2025-03-22
Implementing Generative and Analytical Models to Create and Enrich Knowledge Graphs for RAGs
Evaluate generative and analytical models to build Knowledge Graphs and facilitate them to power highly performing RAGs- 23932Murphy2025-03-22
Deep Dive into Anthropic's Sparse Autoencoders by Hand ✍️
Explore the concepts behind the interpretability quest for LLMs- 23651Murphy2025-03-22
Automating Data Pipelines with Python & GitHub Actions
A simple (and free) way to run data workflows- 24590Murphy2025-03-22
What 10 Years at Uber, Meta and Startups Taught Me About Data Analytics
Advice for Data Scientists and Managers- 25238Murphy2025-03-22
How to Evaluate Search Relevance and Ranking
This article explores the key metrics used for evaluating Search Relevance and Ranking, empowering you to optimize your Search Engine.- 20509Murphy2025-03-22
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.