Add torchao to PT2 Benchmark Runner #2268

xuzhao9 · 2024-05-16T22:38:22Z

Summary: Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Reviewed By: jerryzh168

Differential Revision: D57463273

facebook-github-bot · 2024-05-16T22:38:34Z

This pull request was exported from Phabricator. Differential Revision: D57463273

Summary: Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273

facebook-github-bot · 2024-05-16T22:40:22Z

This pull request was exported from Phabricator. Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273

facebook-github-bot · 2024-05-16T22:49:33Z

This pull request was exported from Phabricator. Differential Revision: D57463273

Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273

facebook-github-bot · 2024-05-16T22:59:10Z

This pull request was exported from Phabricator. Differential Revision: D57463273

Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273

facebook-github-bot · 2024-05-17T21:08:58Z

This pull request was exported from Phabricator. Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273

facebook-github-bot · 2024-05-18T00:18:08Z

This pull request was exported from Phabricator. Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273

facebook-github-bot · 2024-05-18T01:25:36Z

This pull request was exported from Phabricator. Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Differential Revision: D57463273 Pulled By: xuzhao9

Summary: X-link: #2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. X-link: pytorch/pytorch#126469 Reviewed By: jerryzh168 Differential Revision: D57463273 Pulled By: xuzhao9 fbshipit-source-id: 64520f18b63107ce5f07447ef7f4a8c841d9ff1f

Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273 Pull Request resolved: #126469 Approved by: https://github.com/huydhn

facebook-github-bot added the cla signed label May 16, 2024

facebook-github-bot added the fb-exported label May 16, 2024

xuzhao9 had a problem deploying to docker-s3-upload May 16, 2024 22:38 — with GitHub Actions Error

xuzhao9 had a problem deploying to docker-s3-upload May 16, 2024 22:39 — with GitHub Actions Error

xuzhao9 force-pushed the export-D57463273 branch from c9e1bed to 7b8e668 Compare May 16, 2024 22:40

xuzhao9 had a problem deploying to docker-s3-upload May 16, 2024 22:40 — with GitHub Actions Error

xuzhao9 mentioned this pull request May 16, 2024

[torchbench] Add torchao to PT2 Benchmark Runner pytorch/pytorch#126469

Closed

xuzhao9 force-pushed the export-D57463273 branch from 7b8e668 to 83e955c Compare May 16, 2024 22:49

xuzhao9 had a problem deploying to docker-s3-upload May 16, 2024 22:49 — with GitHub Actions Error

xuzhao9 had a problem deploying to docker-s3-upload May 16, 2024 22:51 — with GitHub Actions Error

xuzhao9 force-pushed the export-D57463273 branch from 83e955c to 58a6323 Compare May 16, 2024 22:58

xuzhao9 temporarily deployed to docker-s3-upload May 16, 2024 22:59 — with GitHub Actions Inactive

xuzhao9 temporarily deployed to docker-s3-upload May 16, 2024 23:00 — with GitHub Actions Inactive

huydhn approved these changes May 16, 2024

View reviewed changes

xuzhao9 force-pushed the export-D57463273 branch from 58a6323 to f440504 Compare May 17, 2024 02:13

xuzhao9 temporarily deployed to docker-s3-upload May 17, 2024 02:14 — with GitHub Actions Inactive

xuzhao9 had a problem deploying to docker-s3-upload May 17, 2024 19:55 — with GitHub Actions Error

xuzhao9 had a problem deploying to docker-s3-upload May 17, 2024 19:56 — with GitHub Actions Error

xuzhao9 force-pushed the export-D57463273 branch from d46b1d6 to 7d439ee Compare May 17, 2024 21:08

xuzhao9 temporarily deployed to docker-s3-upload May 17, 2024 21:09 — with GitHub Actions Inactive

xuzhao9 had a problem deploying to docker-s3-upload May 17, 2024 21:09 — with GitHub Actions Failure

xuzhao9 had a problem deploying to docker-s3-upload May 17, 2024 22:35 — with GitHub Actions Error

xuzhao9 force-pushed the export-D57463273 branch from 7d439ee to bb573bf Compare May 18, 2024 00:17

xuzhao9 had a problem deploying to docker-s3-upload May 18, 2024 00:18 — with GitHub Actions Error

xuzhao9 had a problem deploying to docker-s3-upload May 18, 2024 00:19 — with GitHub Actions Error

Add torchao to PT2 Benchmark Runner (pytorch#2268)

b6fd770

Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273

xuzhao9 force-pushed the export-D57463273 branch from bb573bf to b6fd770 Compare May 18, 2024 01:25

xuzhao9 temporarily deployed to docker-s3-upload May 18, 2024 01:25 — with GitHub Actions Inactive

xuzhao9 temporarily deployed to docker-s3-upload May 18, 2024 01:26 — with GitHub Actions Inactive

xuzhao9 closed this May 20, 2024

xuzhao9 deleted the export-D57463273 branch May 20, 2024 18:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add torchao to PT2 Benchmark Runner #2268

Add torchao to PT2 Benchmark Runner #2268

xuzhao9 commented May 16, 2024

facebook-github-bot commented May 16, 2024

facebook-github-bot commented May 16, 2024

facebook-github-bot commented May 16, 2024

facebook-github-bot commented May 16, 2024

facebook-github-bot commented May 17, 2024

facebook-github-bot commented May 18, 2024

facebook-github-bot commented May 18, 2024

Add torchao to PT2 Benchmark Runner #2268

Add torchao to PT2 Benchmark Runner #2268

Conversation

xuzhao9 commented May 16, 2024

facebook-github-bot commented May 16, 2024

facebook-github-bot commented May 16, 2024

facebook-github-bot commented May 16, 2024

facebook-github-bot commented May 16, 2024

facebook-github-bot commented May 17, 2024

facebook-github-bot commented May 18, 2024

facebook-github-bot commented May 18, 2024