Midokura Technology RadarMidokura Technology Radar

Reinforcement Learning

aikeep trackteam:mido/aiad
This item was not updated in last three versions of the Radar. Should it have appeared in one of the more recent editions, there is a good chance it remains pertinent. However, if the item dates back further, its relevance may have diminished and our current evaluation could vary. Regrettably, our capacity to consistently revisit items from past Radar editions is limited.
Hold

Why?

Reinforcement learning is a powerful technique which quickly converges to a good solution.

What?

Evaluate RL for optimizing a model in a simulated environment.