Beyond the Basics: Reinforcement Learning with Jax – Part I: Introduction and Core Concepts
An overview of the Reinforcement Learning fundamentals: Markov Decision Processes, Policies, and Value Functions- 23998Murphy ≡ DeepGuide
Beyond the Basics: Reinforcement Learning with Jax - Part II: Developing an exploitative…
In our last blog post, we explored the Reinforcement Learning paradigm, delving into its core concepts of finite Markov Decision Processes, Policies, and Value Functions. Now, we are ready to apply our newfound knowledge and discover an alternative approa- 29532Murphy ≡ DeepGuide
Vectorize and Parallelize RL Environments with JAX: Q-learning at the Speed of Light⚡
Learn to vectorize a GridWorld environment and train 30 Q-learning agents in parallel on a CPU, at 1.8 million step per seconds!- 28376Murphy ≡ DeepGuide
A Gentle Introduction to Deep Reinforcement Learning in JAX
Solving the CartPole environment with DQN in under a second- 24821Murphy ≡ DeepGuide
A Practical Guide to Proximal Policy Optimization in JAX
All the tricks and details you wish you knew about PPO- 26943Murphy ≡ DeepGuide
AI Model Training with JAX
Hit the road to super-fast AI/ML development- 29512Murphy ≡ DeepGuide
Multi-Framework AI/ML Development with Keras 3
All hail the return of Keras- 30058Murphy ≡ DeepGuide
Rainbow: The Colorful Evolution of Deep Q-Networks
Everything you need to assemble the DQN Megazord in JAX.- 23371Murphy ≡ DeepGuide
Let's reproduce NanoGPT with JAX!(Part 1)
Part 1: Build 124M GPT2 with JAX. Part 2: Optimize the training speed in Single GPU. Part 3: Multi-GPU Training in Jax.- 24616Murphy ≡ DeepGuide
Implementing Sequential Algorithms on TPU
Accelerating AI/ML Model Training with Custom Operators - Part 3.A- 22263Murphy ≡ DeepGuide
Understanding Automatic Differentiation in JAX: A Deep Dive
Unleashing the Gradient: How JAX Makes Automatic Differentiation Feel Like Magic- 21274Murphy ≡ DeepGuide
Automatic Vectorization in JAX
Let the loops fly away!- 23499Murphy ≡ DeepGuide
We look at an implementation of the HyperLogLog cardinality estimati
Using clustering algorithms such as K-means is one of the most popul
Level up Your Data Game by Mastering These 4 Skills
Learn how to create an object-oriented approach to compare and evalu
When I was a beginner using Kubernetes, my main concern was getting
Tutorial and theory on how to carry out forecasts with moving averag