Hi, when I use vit-huge to train the semi stage, after 12 epochs, loss_u became 0, and the classification accuracy of ema model suddenly became low at 11 epochs, and it is expected that the training will definitely fail. I would like to ask the authors whether you have encountered this problem
Hi, when I use vit-huge to train the semi stage, after 12 epochs, loss_u became 0, and the classification accuracy of ema model suddenly became low at 11 epochs, and it is expected that the training will definitely fail. I would like to ask the authors whether you have encountered this problem