Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FP8 PTQ With Physical Dependency 0515 #8443

Open
wants to merge 1 commit into
base: develop
Choose a base branch
from

Conversation

YZW-explorer
Copy link

@YZW-explorer YZW-explorer commented May 15, 2024

PR types

New features

PR changes

APIs

Description

PaddleNLP设计逻辑:
llm/fp8quant.py
定义FP8的量化逻辑,将FP8UniformObserver写入QuantConfig中
llm/fp8finetune_generation.py
调用llm/fp8quant.py中的量化逻辑,完成全部量化过程

Copy link

paddle-bot bot commented May 15, 2024

Thanks for your contribution!

Copy link

codecov bot commented May 15, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 55.42%. Comparing base (5da340e) to head (8ff1f64).

Additional details and impacted files
@@           Coverage Diff            @@
##           develop    #8443   +/-   ##
========================================
  Coverage    55.42%   55.42%           
========================================
  Files          617      617           
  Lines        96281    96281           
========================================
  Hits         53366    53366           
  Misses       42915    42915           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants