#8563: sweep split_query_key_value_and_split_heads, split and concat #8610

sjameelTT · 2024-05-17T16:04:26Z

Add new sweep tests for ttnn.transformers.split_query_key_value_and_split_heads, split and concat

split_query_key_value_and_split_heads:

Batch_size, and cores_h are together in a tuple to minimize permutations that are expected to fail
(Num_q_heads, num_kv_heads, cores_w) and (seq_len_q, seq_len_kv) are also in a tuple for the same reason
If all required combinations were tested with all data types and memory configurations then we would have 24192 as opposed to the current 2918
PCC will be low for the interleaved version since the sharded and interleaved versions both expect the logical QKV tensor to be concatenated along different dimensions
TODO: add expected failure cases too for configurations that shouldn't be supported

split:

use ttnn.experimental.tensor.split_dim_two_chunks_tiled instead of ttnn test for now since split is not implemented and just a wrapper
add some known working configs since the current split is hardcoded to split in half
TODO: ttnn.split should be implemented, potentially call the ttnn.experimental versino
TODO: remove known working config cases when ttnn.split is implemented

concat:

concat tests reworked to be able to track the dimensions for each test case (makes for easier debugging)

sjameelTT · 2024-06-05T18:45:38Z

https://github.com/tenstorrent/tt-metal/actions/runs/9388929767

- Batch_size, and cores_h are together in a tuple to minimize permutations that are expected to fail - (Num_q_heads, num_kv_heads, cores_w) and (seq_len_q, seq_len_kv) are also in a tuple for the same reason - If all required combinations were tested with all data types and memory configurations then we would have 24192 as opposed to the current 2918 - PCC will be low for the interleaved version since the sharded and interleaved versions both expect the logical QKV tensor to be concatenated along different dimensions - TODO: add expected failure cases too for configurations that shouldn't be supported

- concat tests reworked to be able to track the dimensions for each test case (makes for easier debugging)

- use ttnn.experimental.tensor.split_dim_two_chunks_tiled instead of the ttnn split for now since split is not implemented and just a wrapper - add some known working configs since the current split is hardcoded to split in half - TODO: ttnn.split should be implemented, potentially call the ttnn.experimental versino - TODO: remove known working config cases when ttnn.split is implemented

sjameelTT requested review from eyonland, arakhmati, cfjchu and xanderchin as code owners May 17, 2024 16:04

sjameelTT requested review from yan-zaretskiy and tarafdarTT May 17, 2024 16:06

sjameelTT temporarily deployed to dev May 21, 2024 17:20 — with GitHub Actions Inactive

sjameelTT temporarily deployed to dev May 21, 2024 17:24 — with GitHub Actions Inactive

sjameelTT temporarily deployed to production May 21, 2024 17:43 — with GitHub Actions Inactive

sjameelTT force-pushed the sjameel/sweeps branch 7 times, most recently from ef8ed30 to 11570d3 Compare May 24, 2024 18:45

sjameelTT changed the title ~~#8563: sweep split_query_key_value_and_split_heads~~ #8563: sweep split_query_key_value_and_split_heads, split and concat May 27, 2024

sjameelTT force-pushed the sjameel/sweeps branch from 5809249 to abfc6a9 Compare May 28, 2024 22:56

sjameelTT requested a review from TT-BrianLiu as a code owner May 28, 2024 22:56

sjameelTT force-pushed the sjameel/sweeps branch 2 times, most recently from bfe9a36 to 0a1cdac Compare May 30, 2024 17:21

sjameelTT temporarily deployed to dev June 5, 2024 17:55 — with GitHub Actions Inactive

sjameelTT temporarily deployed to dev June 5, 2024 17:59 — with GitHub Actions Inactive

sjameelTT had a problem deploying to dev June 5, 2024 17:59 — with GitHub Actions Failure

arakhmati approved these changes Jun 5, 2024

View reviewed changes

sjameelTT temporarily deployed to production June 5, 2024 18:14 — with GitHub Actions Inactive

sjameelTT force-pushed the sjameel/sweeps branch from 0a1cdac to c2ebaef Compare June 5, 2024 18:15

sjameelTT added 3 commits June 5, 2024 14:50

#8757: change concat sweep tests to be more thorough

a46c53d

- concat tests reworked to be able to track the dimensions for each test case (makes for easier debugging)

sjameelTT force-pushed the sjameel/sweeps branch from c2ebaef to a1336a5 Compare June 5, 2024 18:50

sjameelTT merged commit 6220520 into main Jun 5, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#8563: sweep split_query_key_value_and_split_heads, split and concat #8610

#8563: sweep split_query_key_value_and_split_heads, split and concat #8610

sjameelTT commented May 17, 2024 •

edited

sjameelTT commented Jun 5, 2024

#8563: sweep split_query_key_value_and_split_heads, split and concat #8610

#8563: sweep split_query_key_value_and_split_heads, split and concat #8610

Conversation

sjameelTT commented May 17, 2024 • edited

sjameelTT commented Jun 5, 2024

sjameelTT commented May 17, 2024 •

edited