-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Computation swp #1
base: swp-base-may-28
Are you sure you want to change the base?
Commits on Jun 14, 2024
-
[Tutorial] fix autotune for flash attention (triton-lang#4046)
Tuning configs should depend on HEAD_DIM. Also fix the input layout of v for fp8.
Configuration menu - View commit details
-
Copy full SHA for 0620175 - Browse repository at this point
Copy the full SHA 0620175View commit details -
do not segfault for reduce on forOp arguments (a PR in draft mode)
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Configuration menu - View commit details
-
Copy full SHA for c6f8a81 - Browse repository at this point
Copy the full SHA c6f8a81View commit details -
fp8 FA: head dim 128, seq len 16384, causal is false
Summary: fixed tuning config, fp8 only Test Plan: Reviewers: Subscribers: Tasks: Tags:
Configuration menu - View commit details
-
Copy full SHA for bf89787 - Browse repository at this point
Copy the full SHA bf89787View commit details -
fp8 FA: support TMA with fixed block size
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Configuration menu - View commit details
-
Copy full SHA for bee081c - Browse repository at this point
Copy the full SHA bee081cView commit details -
Summary: SWP_FIRST_DOT + no PEEL_EPILOGUE SWP_FIRST_DOT + PEEL_EPILOGUE SWP_FIRST_DOT + PEEL_EPILOGUE + MERGE_FIRST_PEEL Test Plan: Reviewers: Subscribers: Tasks: Tags:
Configuration menu - View commit details
-
Copy full SHA for 3790057 - Browse repository at this point
Copy the full SHA 3790057View commit details
Commits on Jun 20, 2024
-
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Configuration menu - View commit details
-
Copy full SHA for 93f1f61 - Browse repository at this point
Copy the full SHA 93f1f61View commit details -
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Configuration menu - View commit details
-
Copy full SHA for 0bc5294 - Browse repository at this point
Copy the full SHA 0bc5294View commit details -
fix FIRST_DOT by adding dependency from tmaCopy to wait
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Configuration menu - View commit details
-
Copy full SHA for 14feee3 - Browse repository at this point
Copy the full SHA 14feee3View commit details -
[fp8 FA] base will be TMA, the other one is without TMA
Summary: We compare the implementations Test Plan: Reviewers: Subscribers: Tasks: Tags:
Configuration menu - View commit details
-
Copy full SHA for c9fd91b - Browse repository at this point
Copy the full SHA c9fd91bView commit details
Commits on Jul 11, 2024
-
LOAD_DIFFERENT_STAGE, remove comparison of results
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Configuration menu - View commit details
-
Copy full SHA for 01419b1 - Browse repository at this point
Copy the full SHA 01419b1View commit details