-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cuda time profiles for DY+3j have high non-ME component #994
Comments
specifically, fortran and cpp have 668s, cuda has 826
specifically, fortran has 1945-1910 i.e 35s, cuda has 969-853 i.e. 116s |
valassi
added a commit
to valassi/madgraph4gpu
that referenced
this issue
Sep 11, 2024
… events Note the large overhead in cuda results madgraph5#994 ./parseGridpackLogs.sh pp_dy3j.mad | tee pp_dy3j.mad/summary.txt
valassi
added a commit
to valassi/madgraph4gpu
that referenced
this issue
Sep 14, 2024
Note that there is still a large overhead in cuda results madgraph5#994, but on dy+4j this is background noise... ./parseGridpackLogs.sh pp_dy4j.mad | tee pp_dy4j.mad/summary.txt
valassi
changed the title
Cuda time profiles for DY+3j have high non-ME component and high 'python/bash' component
Cuda time profiles for DY+3j have high non-ME component
Sep 16, 2024
I have stripped off the python/bash component to #1000 (for cuda but not only!). Instead here I keep only the non-ME madevent component (in cuda). |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Yesterday I ran some very first tests of cuda DY+3j with (OLD) timers in PR #948.
The cuda profiles are clearly weird
This is for 500 events
The text was updated successfully, but these errors were encountered: