Multi-GPUS support by MlWoo · Pull Request #152 · Rayhane-mamah/Tacotron-2

MlWoo · 2018-08-14T02:58:16Z

Many friends seem very to be interested in multi-gpus support when training the model. Maybe it is necessary to merge the branch into the master one.

MlWoo · 2018-08-14T03:06:28Z

@begeekmyfriend I have not modified the relative code in terms of the pattern.

begeekmyfriend · 2018-08-14T05:57:15Z

@Rayhane-mamah Yes I agree. In multi-gpu mode we can set r=1 and expand the batch size to obtain smooth gradient. So please consider it as another branch.

Rayhane-mamah · 2018-08-14T08:20:36Z

Yes it seems like people are requesting that. :) well, your multi-gpu attempt @MlWoo is sure much helpful. Since the model content has been changed since you made this implementation, I will need to make few updates here and there, but yeah, I will probably make a new branch for both Wavenet and Tacotron multi-gpu or add those directly on master with optional use or something. (I don't like 4 spaces though hahaha..).

In the meantime, I am leaving this PR open in here so that people can quickly refer to a good multi-gpu implementation :)

Thanks for all your contributions @MlWoo and @begeekmyfriend ;)

ghost · 2018-09-17T08:09:56Z

When I try to use this Fork as it is, I run into the following:

ValueError: Cannot feed value of shape (48, 408, 1025) for Tensor 'datafeeder/linear_targets:0', which has shape '(?, ?, 513)'

What could be the cause of this? I preprocessed LJSpeech with the given hyperparameters btw.

MlWoo · 2018-09-18T02:45:54Z

@tomse-h I have not modified the relative code in terms of the linear pattern. You can complete it with the solution of mel features

shaktikshri · 2020-08-14T14:32:25Z

I might be a bit late into this conversation, but did you guys also see a proportional increase in sec/step when using multiple GPUs? Here are my stats on V100 GPUs with outputs_per_step = 16
#GPU----batchsize----sec/step
1.................32......................~4
2.................64.....................~10
3.................96 ....................~15
4.................128....................~19

MlWoo · 2020-08-17T09:04:17Z

@shaktikshri No, it increases but does scale linearly. You would better check the time of loading data and the unbalance of length of data of each device.

MlWoo and others added 5 commits June 29, 2018 13:30

replace tab with 4 spaces

6a9f9b4

multi gpus support for tacotron

1f5b288

fix bug of wavenet when infering

f991607

Update README.md

56b37f4

Update README.md

0cb1dc8

begeekmyfriend mentioned this pull request Aug 14, 2018

How to use multi-gpu to train #153

Open

MlWoo added 2 commits April 22, 2019 09:22

Update README.md

4aa80a7

Update README.md

4e1f9b5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-GPUS support#152

Multi-GPUS support#152
MlWoo wants to merge 7 commits into
Rayhane-mamah:masterfrom
MlWoo:master

MlWoo commented Aug 14, 2018

Uh oh!

MlWoo commented Aug 14, 2018

Uh oh!

begeekmyfriend commented Aug 14, 2018 •

edited

Loading

Uh oh!

Rayhane-mamah commented Aug 14, 2018

Uh oh!

ghost commented Sep 17, 2018

Uh oh!

MlWoo commented Sep 18, 2018

Uh oh!

shaktikshri commented Aug 14, 2020 •

edited

Loading

Uh oh!

MlWoo commented Aug 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

MlWoo commented Aug 14, 2018

Uh oh!

MlWoo commented Aug 14, 2018

Uh oh!

begeekmyfriend commented Aug 14, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rayhane-mamah commented Aug 14, 2018

Uh oh!

ghost commented Sep 17, 2018

Uh oh!

MlWoo commented Sep 18, 2018

Uh oh!

shaktikshri commented Aug 14, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MlWoo commented Aug 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

begeekmyfriend commented Aug 14, 2018 •

edited

Loading

shaktikshri commented Aug 14, 2020 •

edited

Loading