-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimise OpenCL kernels #7
Labels
enhancement
New feature or request
Comments
OpenCL particle induced velocity: See commit ae683f3 Approximately 7.5% faster:
Not used:
Verified good:
To do:
NB:
|
commit d1a77dc ind_dvort is approximately 27% faster.
As above, there is some overhead in the Julia script used for benchmarking. 3% in worst case. |
commit 6d962b7 ind_dvort is 1.5% faster
|
commit 4db32f2 Changing to transpose method for dvort calculations allows reusing a cross product. 7% faster. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Surely the code can be faster?
The text was updated successfully, but these errors were encountered: