Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add torchao to PT2 Benchmark Runner #2268

Closed
wants to merge 1 commit into from

Conversation

xuzhao9
Copy link
Contributor

@xuzhao9 xuzhao9 commented May 16, 2024

Summary: Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Reviewed By: jerryzh168

Differential Revision: D57463273

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57463273

xuzhao9 added a commit to xuzhao9/benchmark that referenced this pull request May 16, 2024
Summary:

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Reviewed By: jerryzh168

Differential Revision: D57463273
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57463273

pytorch-bot bot pushed a commit to pytorch/pytorch that referenced this pull request May 16, 2024
Summary:
X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/benchmark that referenced this pull request May 16, 2024
Summary:
X-link: pytorch/pytorch#126469


Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 16, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57463273

xuzhao9 added a commit to xuzhao9/benchmark that referenced this pull request May 16, 2024
Summary:
X-link: pytorch/pytorch#126469


Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 16, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57463273

xuzhao9 added a commit to xuzhao9/benchmark that referenced this pull request May 17, 2024
Summary:
X-link: pytorch/pytorch#126469


Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 17, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
@xuzhao9 xuzhao9 temporarily deployed to docker-s3-upload May 17, 2024 02:14 — with GitHub Actions Inactive
@xuzhao9 xuzhao9 temporarily deployed to docker-s3-upload May 17, 2024 02:14 — with GitHub Actions Inactive
xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 17, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 17, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/benchmark that referenced this pull request May 17, 2024
Summary:
X-link: pytorch/pytorch#126469


Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Reviewed By: jerryzh168

Differential Revision: D57463273
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57463273

@xuzhao9 xuzhao9 temporarily deployed to docker-s3-upload May 17, 2024 21:09 — with GitHub Actions Inactive
xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 18, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/benchmark that referenced this pull request May 18, 2024
Summary:
X-link: pytorch/pytorch#126469


Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 18, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57463273

xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 18, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
Summary:
X-link: pytorch/pytorch#126469


Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 18, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57463273

@xuzhao9 xuzhao9 temporarily deployed to docker-s3-upload May 18, 2024 01:25 — with GitHub Actions Inactive
@xuzhao9 xuzhao9 temporarily deployed to docker-s3-upload May 18, 2024 01:26 — with GitHub Actions Inactive
pytorchmergebot pushed a commit to xuzhao9/pytorch that referenced this pull request May 18, 2024
Summary:

X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch                       
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273
xuzhao9 added a commit to xuzhao9/pytorch that referenced this pull request May 19, 2024
Summary:
X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.


Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Differential Revision: D57463273

Pulled By: xuzhao9
facebook-github-bot pushed a commit that referenced this pull request May 20, 2024
Summary:
X-link: #2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

X-link: pytorch/pytorch#126469

Reviewed By: jerryzh168

Differential Revision: D57463273

Pulled By: xuzhao9

fbshipit-source-id: 64520f18b63107ce5f07447ef7f4a8c841d9ff1f
pytorchmergebot pushed a commit to pytorch/pytorch that referenced this pull request May 20, 2024
Summary:
X-link: pytorch/benchmark#2268

Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.

Test Plan:
```
$ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory

loading model: 0it [00:50, ?it/s]
cuda eval  BERT_pytorch
memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00
running benchmark: 100%
1.003x
```

Reviewed By: jerryzh168

Differential Revision: D57463273

Pull Request resolved: #126469
Approved by: https://github.com/huydhn
@xuzhao9 xuzhao9 closed this May 20, 2024
@xuzhao9 xuzhao9 deleted the export-D57463273 branch May 20, 2024 18:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants