Skip to content

[BUG] GPU training causes invalid OpenCL state #30

Description

@xEcho1337

Bug Description

Using the GPU for medium sized training tasks causes invalid OpenCL state (CL_OUT_OF_RESOURCES).

Steps to Reproduce

  1. Create a model
  2. Use the MNIST dataset
  3. Start the training using the GPU

Expected Behavior

Loss should decrease over time and the GPU should speed up training.

Actual Behavior

Brain4J crashes after few computed batches.
What actually happened (including any error messages, stack traces, or console output).

Environment

  • brain4j version: 3.0.0
  • Java version: 21
  • Operating system: Windows

Additional Context

MLP with 784 -> 128 (ReLU) -> 64 (ReLU) -> 10 (Softmax), MNIST dataset, 128 batch size.

Metadata

Metadata

Assignees

Labels

bugIssues that identify broken or incorrect behaviorgpu-accelerationIssues related to GPU support or accelerationhelp-wantedIssues where maintainers are specifically seeking assistancehigh-priorityIssues requiring urgent attentionnative-integrationIssues related to C/native code integrationtrainingIssues related to model training functionality

Type

No fields configured for Bug.

Projects

Status
In Progress

Relationships

None yet

Development

No branches or pull requests

Issue actions