Megatron style TFLOPs Calculation #537

abhinavgoel95 · 2024-03-20T03:13:47Z

@rwitten this is a draft.

This type of change would be specific to a few transformer models (e.g., Gemma, LLama, GPT, etc.). It wouldn't work with MoE, or some new architectures.

I was thinking that walking through the train-step and calculating the FLOPs layer-by-layer would be a very intrusive change.

What do you think?

MaxText/maxtext_utils.py

added config adding support for megatron style tflops calculation adding support for megatron style tflops calculation adding support for megatron style tflops calculations

abhinavgoel95 · 2024-04-01T16:41:33Z

Made the changes as requested in the meeting @rwitten

abhinavgoel95 · 2024-04-17T22:25:22Z

cc @rwitten following up on this

rwitten requested changes Mar 20, 2024

View reviewed changes

MaxText/maxtext_utils.py Outdated Show resolved Hide resolved

MaxText/maxtext_utils.py Outdated Show resolved Hide resolved

MaxText/maxtext_utils.py Outdated Show resolved Hide resolved

abhinavgoel95 force-pushed the megatron_tflops branch 2 times, most recently from 3980d41 to 01c78bb Compare March 27, 2024 19:35

added support to use megatron tflops calculation

43f3451

added config adding support for megatron style tflops calculation adding support for megatron style tflops calculation adding support for megatron style tflops calculations

abhinavgoel95 force-pushed the megatron_tflops branch from 01c78bb to 43f3451 Compare April 1, 2024 16:40

abhinavgoel95 requested a review from rwitten April 1, 2024 16:41

abhinavgoel95 marked this pull request as ready for review April 1, 2024 20:00

Merge branch 'main' into megatron_tflops

c79e00d

abhinavgoel95 requested a review from gobbleturk as a code owner April 3, 2024 20:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Megatron style TFLOPs Calculation #537

Megatron style TFLOPs Calculation #537

abhinavgoel95 commented Mar 20, 2024

abhinavgoel95 commented Apr 1, 2024

abhinavgoel95 commented Apr 17, 2024

Megatron style TFLOPs Calculation #537

Are you sure you want to change the base?

Megatron style TFLOPs Calculation #537

Conversation

abhinavgoel95 commented Mar 20, 2024

abhinavgoel95 commented Apr 1, 2024

abhinavgoel95 commented Apr 17, 2024