Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA: generalize FP16 fattn vec kernel #7061

Merged
merged 7 commits into from
May 9, 2024

Commits on May 9, 2024

  1. Configuration menu
    Copy the full SHA
    48463c0 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    86636bd View commit details
    Browse the repository at this point in the history
  3. try AMD fix

    JohannesGaessler committed May 9, 2024
    Configuration menu
    Copy the full SHA
    617f129 View commit details
    Browse the repository at this point in the history
  4. fix batch size 2-8

    JohannesGaessler committed May 9, 2024
    Configuration menu
    Copy the full SHA
    d9bcb92 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    fa81c3a View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    2272765 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    fece1fe View commit details
    Browse the repository at this point in the history