ray.rllib.algorithms.algorithm.Algorithm.evaluate#

Algorithm.evaluate(duration_fn: Callable[[int], int] | None = None) dict[source]#

Evaluates current policy under evaluation_config settings.

Parameters:

duration_fn – An optional callable taking the already run num episodes as only arg and returning the number of episodes left to run. It’s used to find out whether evaluation should continue.