- Beyond Accuracy: Exploring Exotic Metrics for Holistic Evaluation of Machine Learning ModelsMachine learning has undoubtedly become a powerful tool in today's data-driven world, but are we truly tapping into its full potential...
- 21554Murphy ≡ DeepGuide
- RecList 2.0: Open-Source Systematic Testing of ML ModelsA new RecList to provide more flexibility and better support for evaluation
- 20231Murphy ≡ DeepGuide
- Topic Modelling in productionLeveraging LangChain to move from ad-hoc Jupyter Notebooks to production modular service
- 25813Murphy ≡ DeepGuide
- We Need to Raise the Bar for AI Product ManagersHow to Stop Blaming the 'Model' and Start Building Successful AI Products
- 28912Murphy ≡ DeepGuide
- The DIY Path to AI Product Management: Picking a Starter ProjectBuilding real-world skills through hands-on trial and error.
- 25680Murphy ≡ DeepGuide
- Evaluations with Chat FormatsApplying chat templates to generative LM evaluation tests
- 23601Murphy ≡ DeepGuide
- What Exactly Is an "Eval" and Why Should Product Managers Care?How to stop worrying and love the data
- 26757Murphy ≡ DeepGuide
- Evaluating synthetic dataAssessing plausibility and usefulness of data we generated from real data
- 24519Murphy ≡ DeepGuide
We look at an implementation of the HyperLogLog cardinality estimati
Using clustering algorithms such as K-means is one of the most popul
Level up Your Data Game by Mastering These 4 Skills
Learn how to create an object-oriented approach to compare and evalu
When I was a beginner using Kubernetes, my main concern was getting
Tutorial and theory on how to carry out forecasts with moving averag
