publications

2024

  1. arXiv
    A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
    Toshinori Kitamura , Tadashi Kozuno , Masahiro Kato , and 6 more authors
    RLC Workshop, 2024
  2. Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains
    Soichiro Nishimori, Xin-Qiang Cai , Johannes Ackermann , and 1 more author
    arXiv preprint, 2024
  3. A Batch Sequential Halving Algorithm without Performance Degradation
    Sotetsu Koyamada , Soichiro Nishimori, and Shin Ishii
    RLC, 2024

2023

  1. NIPS
    Pgx: Hardware-accelerated parallel game simulators for reinforcement learning
    Sotetsu Koyamada , Shinri Okano , Soichiro Nishimori, and 4 more authors
    NeurIPS, 2023
  2. arXiv
    End-to-End Policy Gradient Method for POMDPs and Explainable Agents
    Soichiro Nishimori, Sotetsu Koyamada , and Shin Ishii
    arXiv preprint, 2023

2022

  1. IEEE
    Mjx: A framework for Mahjong AI research
    Sotetsu Koyamada , Keigo Habara , Nao Goto , and 3 more authors
    In , 2022