Hello,
I'm investigating a potential use of Juice framework for deep reinforcement learning (I'm also learning the RL and deep learning as I go, so apologies for potentially newbie questions). RL requires simultaneous learning and using the net for predictions. I've found several issues which I'm not sure are design decisions or implementation shortcuts:
- It looks like if I configure the net for minibatch training, I can't then use it to make predictions on just one input. I get this exception when I try to.
Solver has a network() method, comment for which says that "This is the recommended method to get a usable trained network." However, you can't call forward() on it, since it requires a mut ref.
I can probably work around 1 (like artificially creating a batch by replicating a single input vector) and 2 (by using mut_network()), but it doesn't look right.
Is this something that can (should?) be fixed in the implementation? I'm happy to provide PRs (but will likely require technical guidance).
Thank you!
Hello,
I'm investigating a potential use of Juice framework for deep reinforcement learning (I'm also learning the RL and deep learning as I go, so apologies for potentially newbie questions). RL requires simultaneous learning and using the net for predictions. I've found several issues which I'm not sure are design decisions or implementation shortcuts:
Solverhas anetwork()method, comment for which says that "This is the recommended method to get a usable trained network." However, you can't callforward()on it, since it requires amutref.I can probably work around 1 (like artificially creating a batch by replicating a single input vector) and 2 (by using
mut_network()), but it doesn't look right.Is this something that can (should?) be fixed in the implementation? I'm happy to provide PRs (but will likely require technical guidance).
Thank you!