Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OOM error #4

Open
laybebe opened this issue Apr 22, 2023 · 5 comments
Open

OOM error #4

laybebe opened this issue Apr 22, 2023 · 5 comments

Comments

@laybebe
Copy link

laybebe commented Apr 22, 2023

Thanks for your excellent work! What's the GPU memory of your Titan RTX, 24G? I have tried to train MaskFreeVIS with backbone ResNet50 and batch size 16 on 8x3090(24G), but it will result in an OOM error.

@lkeab
Copy link
Collaborator

lkeab commented Apr 22, 2023

which training script are you using?

@laybebe
Copy link
Author

laybebe commented Apr 22, 2023

which training script are you using?

I used "MaskFreeVIS/mfvis_nococo/configs/youtubevis_2019/video_maskformer2_R50_bs16_8ep.yaml" and did not modify any parameter.

@lkeab
Copy link
Collaborator

lkeab commented Apr 22, 2023

In this case, you can reduce the SAMPLING_FRAME_NUM from 5 to 3, and modify the codes here accordingly (to make TK loss work in 3-frame tube). It will reduce memory a lot but the same time training will be slightly less stable.

@laybebe
Copy link
Author

laybebe commented Apr 22, 2023

Thanks for your reply. In fact, I ran the code by reducing the batch size. The result (AP=41.7) was slightly lower than the result in the paper.

@lkeab
Copy link
Collaborator

lkeab commented Apr 22, 2023

Yeah, that's actually normal due to the weak supervision (no any mask usage) and the randomness in sampling frames of the dataloader.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants