Support estimation under optimal policy

Edited by Johannes Pfeifer