Transforming Next-Token Prediction into Classification with LLMs
From tokens to labels: Performing classification with large language models- 28317Murphy2025-03-22
A Step-By-Step Guide to Building a Programming Language
Building a programming language from scratch in a few hours- 24597Murphy2025-03-22
Foundation Models in Graph & Geometric Deep Learning
In this post, we argue that the era of Graph FMs has already begun and provide a few examples of how one can use them already today.- 26353Murphy2025-03-22
How I Dockerized Apache Flink, Kafka, and PostgreSQL for Real-Time Data Streaming
Integrating pyFlink, Kafka, and PostgreSQL using Docker- 25655Murphy2025-03-22
Exploratory Data Analysis in 11 Steps
Starting an exploratory data analysis can be daunting. How do you know what to look at? How do you know when you're done? What if you miss...- 20247Murphy2025-03-22
Voyage Multilingual 2 Embedding Evaluation
Compared to OpenAI, Cohere, Google, and E5- 26743Murphy2025-03-22
Optimizing Sigma Rules in Spark with the Aho-Corasick Algorithm
Extending Spark for improved performance in handling multiple search terms- 26580Murphy2025-03-22
3 Painful Mistakes I Made as a Junior Data Scientist
Learn from them to fast-track your career today- 27861Murphy2025-03-22
Managing Pivot Table and Excel Charts with VBA
Save precious hours by automating working with pivot tables and charts using VBA- 27916Murphy2025-03-22
Using Decision Trees for Exploratory Data Analysis
Add decision trees to your EDA and get great insights from the start- 23439Murphy2025-03-22
AI Agent Capabilities Engineering
Introducing a high-level capabilities engineering framework for AI Agents- 29988Murphy2025-03-22
Enhancing Marketing Mix Modelling with Causal AI
Causal AI, exploring the integration of causal reasoning into machine learning- 24482Murphy2025-03-22
Guiding an LLM's Response to Create Structured Output
Learn how to structure a language model's response to ensure that the response format is respected, such as JSON- 29071Murphy2025-03-22
Simplifying Support Vector Machines – A Concise Introduction into Binary Classification
MLBasics #4: The Binary Classification King - A Journey Through Support Vector Machines- 23772Murphy2025-03-22
Creating a Streamlit App for Satellite Imagery Visualization: A Step-by-Step Guide
Explore any point on Earth at any time using satellite data with Streamlit- 22922Murphy2025-03-22
Entity-Resolved Knowledge Graphs
New words. Old concepts. In the end, it's about data fusion.- 21550Murphy2025-03-22
Deep Learning Illustrated, Part 5: Long Short-Term Memory (LSTM)
An illustrated and intuitive guide on the inner workings of an LSTM- 20394Murphy2025-03-22
Beyond Kleinberg's Impossibility Theorem of Clustering: A Pragmatic Clustering Evaluation Fram
This article explores a pragmatic evaluation framework for clustering under the constraint of Kleinberg's Impossibility Theorem- 23458Murphy2025-03-22
Challenges and Solutions in Data Mesh – Part 3
A practical approach to achieving interoperability in the data mesh through federated enterprise data modeling- 22961Murphy2025-03-22
Model Drift Introduction and Concepts
Learn some of the concepts behind machine learning models drift and understand why MLOps is so important in today's world- 27256Murphy2025-03-22
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.