Data Science Unicorns, RAG Pipelines, a New Coefficient of Correlation, and Other April Must-Reads
The stories that resonated the most with our community in the past month- 25552Murphy2025-03-22
Chatbot Morality?
Exposing the promise and perils of ChatGPT's convincing moral rhetoric- 28664Murphy2025-03-22
From Social Science to Data Science
8 years ago I started my bachelor's degree in Geography. Now I'm a Data Scientist; this is the story of how (and why) I've got here- 23757Murphy2025-03-22
Modeling Slowly Changing Dimensions
A deep dive into the various SCD types and how they can be implemented in Data Warehouses- 25548Murphy2025-03-22
Why Data Science May Not Be For You
Tell-tale signs why a career in data science may not suit you- 20561Murphy2025-03-22
Getting Started with the Dev Containers Extension
This is a step-by-step tutorial for getting started with the Dev Containers extension. This tutorial is the first of a sequence of...- 26130Murphy2025-03-22
How to Use and Test WizardLM2: Microsoft's New LLM
Learn how to run and test Microsoft's new LLM, WizardLM2, and use it to perform tasks like question-answering and information extraction- 23117Murphy2025-03-22
FanFabler: Fine-Tuning Llama 3 to Be a Multilingual Fanfic Writing Assistant
How I used a custom training dataset and information retrieval for global storytelling. 好样的! Bravo! वाह! ¡Guau! 브라보!- 27815Murphy2025-03-22
5 Things to do When Evaluating ELT/ETL Tools
A list to make evaluating ELT/ETL tools a bit less daunting- 21970Murphy2025-03-22
Are Data Scientists Fortune Tellers?
Should we aim to be one?- 25818Murphy2025-03-22
Text to Knowledge Graph Made Easy with Graph Maker
An open-source library for building knowledge graphs from text corpus using open-source LLMs like Llama 3 and Mixtral.- 21844Murphy2025-03-22
How to Transition from Physics to Data Science: A Comprehensive Guide
Advice from a Physics Master's Graduate turned Data Scientist- 24952Murphy2025-03-22
Demo AI Products Like a Pro
An intro to expert guide on using Gradio to demonstrate product value to expert and non-technical audiences.- 22010Murphy2025-03-22
Sparsifying Knowledge-Graph using Target Information
Sparsifying knowledge graphs for supervised tasks, using PMI to remove irrelevant edges; With concrete example using medical data- 20920Murphy2025-03-22
Prompt Like a Data Scientist: Auto Prompt Optimization and Testing with DSPy
Applying machine learning methodology to prompt building- 23933Murphy2025-03-22
A 10-Minute Template to Build an AI Assistant on HuggingFace
And why you should build yours today- 20454Murphy2025-03-22
Transformers: From NLP to Computer Vision
How Transformer architecture has been adapted to computer vision tasks- 23398Murphy2025-03-22
Plotting Golf Courses in R with Google Earth
A user guide to mapping golf courses in Google Earth and bringing them to life in R.- 20284Murphy2025-03-22
Lunar Crater Detection: Computer Vision in Space
One small step towards autonomous crater-based navigation- 29680Murphy2025-03-22
Understanding Kolmogorov-Arnold Networks (KAN)
Why KANs are a potential alternative to MPLs and the current landscape of Machine Learning. Let's go through the paper to find out.- 29072Murphy2025-03-22
The current state of continual learning in AI
Why is ChatGPT only trained up until 2021?Optimizing Pandas Code: The Impact of Operation Sequence
Learn how to rearrange your code to achieve significant speed improvements.