publications
2026
2025
2024
- arXivA Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC GuaranteesReinforcement Learning Conference Workshop, 2024
- RLCA Batch Sequential Halving Algorithm without Performance DegradationReinforcement Learning Conference, 2024
- github
2023
- arXivEnd-to-End Policy Gradient Method for POMDPs and Explainable AgentsarXiv preprint, 2023
2022
- IEEE