STRATEGIES FOR MITIGATING OVERFITTING AND ASYMPTOTIC BIAS IN BATCH REINFORCEMENT LEARNING WITH PARTIAL OBSERVABILITY. International journal of artificial intelligence, [S. l.], v. 3, n. 04, p. 01–05, 2023. Disponível em: https://www.academicpublishers.org/journals/index.php/ijai/article/view/60.. Acesso em: 29 mar. 2026.