Research output

  1. Generalized Optimistic Q-Learning with Provable Efficiency

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

  2. Interval Q-Learning: Balancing Deep and Wide Exploration

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

  3. Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes with Unbounded Rewards

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

View all (3) »

Activities

  1. Generalized Optimistic Q-Learning with Provable Efficiency

    Activity: Talk or presentationTalk or presentation at a conference

View all (1) »

ID: 19407918