Robust Q-Learning against State Perturbations: a Belief-Enriched Pessimistic Approach

Publication
Multi-Agent Security Workshop @ NeurIPS'23