Benchmark for "Offline Policy Comparison with Confidence"
reinforcement-learning policy-evaluation uncertainty-estimation confidence-estimation offline-reinforcement-learning offline-policy-comparison
-
Updated
Oct 25, 2023 - Python