Hello
dfast satndalone version may present a potential overcommit when used via slurm
dfc/dev/gff/gff_config.py and dfc/default_config.py uses multiprocessing.cpu_count() to ge the value of avilalble cpus.
multiprocessing.cpu_count() returns the number of available cpu, but this is not the same as the number of cpu available to the process. For example, you can run in a taskset context or a batch scheduler like slurm.
see:
$ nproc
96
$ taskset -c 1 nproc
1
$ taskset -c 1 python3 -c "import multiprocessing; print(multiprocessing.cpu_count())"
96
I would suggest to use len(os.sched_getaffinity(0)) instead of multiprocessing.cpu_count()
$ python3 -c "import os; print(len(os.sched_getaffinity(0)))"
96
$ taskset -c 1 python3 -c "import os; print(len(os.sched_getaffinity(0)))"
1
regards
Eric
Hello
dfast satndalone version may present a potential overcommit when used via slurm
dfc/dev/gff/gff_config.pyanddfc/default_config.pyusesmultiprocessing.cpu_count()to ge the value of avilalble cpus.multiprocessing.cpu_count()returns the number of available cpu, but this is not the same as the number of cpu available to the process. For example, you can run in a taskset context or a batch scheduler like slurm.see:
I would suggest to use len(os.sched_getaffinity(0)) instead of multiprocessing.cpu_count()
regards
Eric