# Actor-Critic
2 posts
Teaching a Machine to Stock ATMs: Deep Reinforcement Learning for Cash Demand Forecasting
12 min read Quant Research
A PhD course project applying the Deep Deterministic Policy Gradient (DDPG) algorithm to ATM cash demand forecasting. The problem is framed as a continuous Markov Decision Process, evaluated against industry benchmarks on the 111-ATM NN5 dataset.
Reinforcement LearningDeep LearningTime Series ForecastingDDPGActor-Critic
Learning Optimal Pricing with Reinforcement Learning
6 min read Quant Research
A technical implementation of the Actor-Critic algorithm to solve dynamic pricing. This project benchmarks RL against theoretical optima to test its effectiveness in combinatorial optimization.
Reinforcement LearningMachine LearningActor-CriticDynamic PricingCombinatorial Optimization