Juice for Deep Reinforcement Learning

Hello,

I'm investigating a potential use of Juice framework for deep reinforcement learning (I'm also learning the RL and deep learning as I go, so apologies for potentially newbie questions). RL requires simultaneous learning and using the net for predictions. I've found several issues which I'm not sure are design decisions or implementation shortcuts:

1. It looks like if I configure the net for minibatch training, I can't then use it to make predictions on just one input. I get this [exception](https://github.com/spearow/juice/blob/b80fb54a977722b308c04ad5301eb6895d08d612/juice/src/layer.rs#L541) when I try to.
2. `Solver` has a `network()` [method](https://docs.rs/juice/latest/juice/solver/struct.Solver.html#method.network), comment for which says that "This is the recommended method to get a usable trained network." However, you can't call `forward()` on it, since it requires a `mut` ref.

I can probably work around 1 (like artificially creating a batch by replicating a single input vector) and 2 (by using `mut_network()`), but it doesn't look right.

Is this something that can (should?) be fixed in the implementation? I'm happy to provide PRs (but will likely require technical guidance).

Thank you!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Juice for Deep Reinforcement Learning #155

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Juice for Deep Reinforcement Learning #155

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions