← All Tags

# Actor-Critic

2 posts

Teaching a Machine to Stock ATMs: Deep Reinforcement Learning for Cash Demand Forecasting

12 min read Quant Research

A PhD course project applying the Deep Deterministic Policy Gradient (DDPG) algorithm to ATM cash demand forecasting. The problem is framed as a continuous Markov Decision Process, evaluated against industry benchmarks on the 111-ATM NN5 dataset.

Reinforcement LearningDeep LearningTime Series ForecastingDDPGActor-Critic

Learning Optimal Pricing with Reinforcement Learning

6 min read Quant Research

A technical implementation of the Actor-Critic algorithm to solve dynamic pricing. This project benchmarks RL against theoretical optima to test its effectiveness in combinatorial optimization.

Reinforcement LearningMachine LearningActor-CriticDynamic PricingCombinatorial Optimization
© 2026 Elvin Zeynalli. All rights reserved.