When trying to train with a DQN (or really any other algorithm - I even tried it with PPO) with the latest nightly installation of Ray , I seem to get a weird error ...
This module takes an input ndarray and either appends a singleton dimension (a dimension of length one) or inserts it before a specific dimension. input: The input ndarray to be unsqueeze axes ...