Python Temporal-difference-learning Resources