Midokura Technology RadarMidokura Technology Radar

Reinforcement Learning

aikeep trackteam:mido/aiad
Hold

Why?

Reinforcement learning is a powerful technique which quickly converges to a good solution.

What?

Evaluate RL for optimizing a model in a simulated environment.