-
Notifications
You must be signed in to change notification settings - Fork 63
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #716 from LuxDL/ap/fix_amdtests_takeii
Try fixing AMDGPU test stalling
- Loading branch information
Showing
9 changed files
with
38 additions
and
40 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
f1b8c12
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@JuliaRegistrator register
f1b8c12
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Registration pull request created: JuliaRegistries/General/109318
Tip: Release Notes
Did you know you can add release notes too? Just add markdown formatted text underneath the comment after the text
"Release notes:" and it will be added to the registry PR, and if TagBot is installed it will also be added to the
release that TagBot creates. i.e.
To add them here just re-invoke and the PR will be updated.
Tagging
After the above pull request is merged, it is recommended that a tag is created on this repository for the registered package version.
This will be done automatically if the Julia TagBot GitHub Action is installed, or can be done manually through the github interface, or via:
f1b8c12
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Benchmark Results
Dense(2 => 2)/cpu/reverse/ReverseDiff (compiled)/(2, 128)
3683.125
ns3694.375
ns1.00
Dense(2 => 2)/cpu/reverse/Zygote/(2, 128)
7288.666666666667
ns7175.4
ns1.02
Dense(2 => 2)/cpu/reverse/Tracker/(2, 128)
20909
ns21109
ns0.99
Dense(2 => 2)/cpu/reverse/ReverseDiff/(2, 128)
9847.3
ns9923.5
ns0.99
Dense(2 => 2)/cpu/reverse/Flux/(2, 128)
9238.375
ns8936.8
ns1.03
Dense(2 => 2)/cpu/reverse/SimpleChains/(2, 128)
4527.125
ns4492.25
ns1.01
Dense(2 => 2)/cpu/reverse/Enzyme/(2, 128)
1168.5407407407408
ns1164.4202898550725
ns1.00
Dense(2 => 2)/cpu/forward/NamedTuple/(2, 128)
1176.1526717557251
ns1112.5704225352113
ns1.06
Dense(2 => 2)/cpu/forward/ComponentArray/(2, 128)
1186.4857142857143
ns1178
ns1.01
Dense(2 => 2)/cpu/forward/Flux/(2, 128)
1782.859375
ns1797.4705882352941
ns0.99
Dense(2 => 2)/cpu/forward/SimpleChains/(2, 128)
179.37413073713492
ns180.1279554937413
ns1.00
Dense(20 => 20)/cpu/reverse/ReverseDiff (compiled)/(20, 128)
17342
ns17353
ns1.00
Dense(20 => 20)/cpu/reverse/Zygote/(20, 128)
17022
ns17052
ns1.00
Dense(20 => 20)/cpu/reverse/Tracker/(20, 128)
37380
ns37640
ns0.99
Dense(20 => 20)/cpu/reverse/ReverseDiff/(20, 128)
29484.5
ns29785
ns0.99
Dense(20 => 20)/cpu/reverse/Flux/(20, 128)
21770
ns21450
ns1.01
Dense(20 => 20)/cpu/reverse/SimpleChains/(20, 128)
17477.5
ns17402
ns1.00
Dense(20 => 20)/cpu/reverse/Enzyme/(20, 128)
4316.571428571428
ns4325.142857142857
ns1.00
Dense(20 => 20)/cpu/forward/NamedTuple/(20, 128)
3864.625
ns3876
ns1.00
Dense(20 => 20)/cpu/forward/ComponentArray/(20, 128)
3923.5
ns3953.625
ns0.99
Dense(20 => 20)/cpu/forward/Flux/(20, 128)
4809
ns4953.428571428572
ns0.97
Dense(20 => 20)/cpu/forward/SimpleChains/(20, 128)
1660.1
ns1652.1
ns1.00
Conv((3, 3), 3 => 3)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 3, 128)
39311146
ns47320777
ns0.83
Conv((3, 3), 3 => 3)/cpu/reverse/Zygote/(64, 64, 3, 128)
57818439
ns58305356
ns0.99
Conv((3, 3), 3 => 3)/cpu/reverse/Tracker/(64, 64, 3, 128)
70725143
ns102789420
ns0.69
Conv((3, 3), 3 => 3)/cpu/reverse/ReverseDiff/(64, 64, 3, 128)
89020101
ns95601238
ns0.93
Conv((3, 3), 3 => 3)/cpu/reverse/Flux/(64, 64, 3, 128)
72846612
ns78618619
ns0.93
Conv((3, 3), 3 => 3)/cpu/reverse/SimpleChains/(64, 64, 3, 128)
12056878.5
ns11718436
ns1.03
Conv((3, 3), 3 => 3)/cpu/reverse/Enzyme/(64, 64, 3, 128)
17802524.5
ns17850771.5
ns1.00
Conv((3, 3), 3 => 3)/cpu/forward/NamedTuple/(64, 64, 3, 128)
7028063
ns7036938
ns1.00
Conv((3, 3), 3 => 3)/cpu/forward/ComponentArray/(64, 64, 3, 128)
7000092.5
ns7001847
ns1.00
Conv((3, 3), 3 => 3)/cpu/forward/Flux/(64, 64, 3, 128)
9924699
ns11538399
ns0.86
Conv((3, 3), 3 => 3)/cpu/forward/SimpleChains/(64, 64, 3, 128)
6389608
ns6393151.5
ns1.00
vgg16/cpu/reverse/Zygote/(32, 32, 3, 16)
737562829
ns751448443
ns0.98
vgg16/cpu/reverse/Zygote/(32, 32, 3, 64)
2545549640
ns2573472812
ns0.99
vgg16/cpu/reverse/Zygote/(32, 32, 3, 2)
146821325
ns144720323
ns1.01
vgg16/cpu/reverse/Tracker/(32, 32, 3, 16)
868615027
ns968831713.5
ns0.90
vgg16/cpu/reverse/Tracker/(32, 32, 3, 64)
3064060217
ns3278943882
ns0.93
vgg16/cpu/reverse/Tracker/(32, 32, 3, 2)
219512795
ns234124583
ns0.94
vgg16/cpu/reverse/Flux/(32, 32, 3, 16)
685678726
ns746070446
ns0.92
vgg16/cpu/reverse/Flux/(32, 32, 3, 64)
2574375943
ns3009137266
ns0.86
vgg16/cpu/reverse/Flux/(32, 32, 3, 2)
127147427
ns132644713.5
ns0.96
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 16)
171884482
ns174325259
ns0.99
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 64)
650293250.5
ns647357046.5
ns1.00
vgg16/cpu/forward/NamedTuple/(32, 32, 3, 2)
34511836
ns34732801
ns0.99
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 16)
164391167.5
ns164171075.5
ns1.00
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 64)
634653416
ns641446171
ns0.99
vgg16/cpu/forward/ComponentArray/(32, 32, 3, 2)
29977086.5
ns30107004
ns1.00
vgg16/cpu/forward/Flux/(32, 32, 3, 16)
185946798
ns189802799.5
ns0.98
vgg16/cpu/forward/Flux/(32, 32, 3, 64)
765662897.5
ns799823428
ns0.96
vgg16/cpu/forward/Flux/(32, 32, 3, 2)
35241726.5
ns38276609
ns0.92
Conv((3, 3), 64 => 64)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 64, 128)
1245538918.5
ns1306917435
ns0.95
Conv((3, 3), 64 => 64)/cpu/reverse/Zygote/(64, 64, 64, 128)
1864879281
ns1880303414
ns0.99
Conv((3, 3), 64 => 64)/cpu/reverse/Tracker/(64, 64, 64, 128)
2293551179
ns2465824739
ns0.93
Conv((3, 3), 64 => 64)/cpu/reverse/ReverseDiff/(64, 64, 64, 128)
2516850614
ns2587857217
ns0.97
Conv((3, 3), 64 => 64)/cpu/reverse/Flux/(64, 64, 64, 128)
1882887952.5
ns1920389453.5
ns0.98
Conv((3, 3), 64 => 64)/cpu/reverse/Enzyme/(64, 64, 64, 128)
561045265
ns561226426
ns1.00
Conv((3, 3), 64 => 64)/cpu/forward/NamedTuple/(64, 64, 64, 128)
326179109
ns325726548
ns1.00
Conv((3, 3), 64 => 64)/cpu/forward/ComponentArray/(64, 64, 64, 128)
323271956
ns323189696
ns1.00
Conv((3, 3), 64 => 64)/cpu/forward/Flux/(64, 64, 64, 128)
349888101
ns472300185.5
ns0.74
Conv((3, 3), 1 => 1)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 1, 128)
11973548
ns11879578
ns1.01
Conv((3, 3), 1 => 1)/cpu/reverse/Zygote/(64, 64, 1, 128)
17858872
ns18066903
ns0.99
Conv((3, 3), 1 => 1)/cpu/reverse/Tracker/(64, 64, 1, 128)
19168560
ns19358439.5
ns0.99
Conv((3, 3), 1 => 1)/cpu/reverse/ReverseDiff/(64, 64, 1, 128)
23865197
ns24037285
ns0.99
Conv((3, 3), 1 => 1)/cpu/reverse/Flux/(64, 64, 1, 128)
17866720
ns18030067
ns0.99
Conv((3, 3), 1 => 1)/cpu/reverse/SimpleChains/(64, 64, 1, 128)
1158234
ns1161439
ns1.00
Conv((3, 3), 1 => 1)/cpu/reverse/Enzyme/(64, 64, 1, 128)
5814007
ns5877613
ns0.99
Conv((3, 3), 1 => 1)/cpu/forward/NamedTuple/(64, 64, 1, 128)
2054540.5
ns2061078
ns1.00
Conv((3, 3), 1 => 1)/cpu/forward/ComponentArray/(64, 64, 1, 128)
2037248
ns2052642
ns0.99
Conv((3, 3), 1 => 1)/cpu/forward/Flux/(64, 64, 1, 128)
2078324
ns2085073
ns1.00
Conv((3, 3), 1 => 1)/cpu/forward/SimpleChains/(64, 64, 1, 128)
202510.5
ns207838
ns0.97
Dense(200 => 200)/cpu/reverse/ReverseDiff (compiled)/(200, 128)
293437.5
ns297415
ns0.99
Dense(200 => 200)/cpu/reverse/Zygote/(200, 128)
266057.5
ns267444.5
ns0.99
Dense(200 => 200)/cpu/reverse/Tracker/(200, 128)
365572
ns369540
ns0.99
Dense(200 => 200)/cpu/reverse/ReverseDiff/(200, 128)
407804
ns411308
ns0.99
Dense(200 => 200)/cpu/reverse/Flux/(200, 128)
275034
ns277337.5
ns0.99
Dense(200 => 200)/cpu/reverse/SimpleChains/(200, 128)
411080
ns409664.5
ns1.00
Dense(200 => 200)/cpu/reverse/Enzyme/(200, 128)
83504
ns83486
ns1.00
Dense(200 => 200)/cpu/forward/NamedTuple/(200, 128)
81180.5
ns81302
ns1.00
Dense(200 => 200)/cpu/forward/ComponentArray/(200, 128)
81631
ns85018
ns0.96
Dense(200 => 200)/cpu/forward/Flux/(200, 128)
86775.5
ns87734
ns0.99
Dense(200 => 200)/cpu/forward/SimpleChains/(200, 128)
104563
ns104626
ns1.00
Conv((3, 3), 16 => 16)/cpu/reverse/ReverseDiff (compiled)/(64, 64, 16, 128)
203633792
ns208418135
ns0.98
Conv((3, 3), 16 => 16)/cpu/reverse/Zygote/(64, 64, 16, 128)
328082047.5
ns329863332.5
ns0.99
Conv((3, 3), 16 => 16)/cpu/reverse/Tracker/(64, 64, 16, 128)
399733123
ns437868758
ns0.91
Conv((3, 3), 16 => 16)/cpu/reverse/ReverseDiff/(64, 64, 16, 128)
429567326
ns473245652.5
ns0.91
Conv((3, 3), 16 => 16)/cpu/reverse/Flux/(64, 64, 16, 128)
375921768
ns409579507.5
ns0.92
Conv((3, 3), 16 => 16)/cpu/reverse/SimpleChains/(64, 64, 16, 128)
328704380
ns338434555
ns0.97
Conv((3, 3), 16 => 16)/cpu/reverse/Enzyme/(64, 64, 16, 128)
101203246
ns101758684
ns0.99
Conv((3, 3), 16 => 16)/cpu/forward/NamedTuple/(64, 64, 16, 128)
43990642
ns43942909
ns1.00
Conv((3, 3), 16 => 16)/cpu/forward/ComponentArray/(64, 64, 16, 128)
43821294.5
ns43793713
ns1.00
Conv((3, 3), 16 => 16)/cpu/forward/Flux/(64, 64, 16, 128)
53275150
ns57038485
ns0.93
Conv((3, 3), 16 => 16)/cpu/forward/SimpleChains/(64, 64, 16, 128)
28607335
ns28142581.5
ns1.02
Dense(2000 => 2000)/cpu/reverse/ReverseDiff (compiled)/(2000, 128)
19166105
ns19007086
ns1.01
Dense(2000 => 2000)/cpu/reverse/Zygote/(2000, 128)
19549447.5
ns19599865
ns1.00
Dense(2000 => 2000)/cpu/reverse/Tracker/(2000, 128)
23387251
ns23608296
ns0.99
Dense(2000 => 2000)/cpu/reverse/ReverseDiff/(2000, 128)
24155491
ns24199216.5
ns1.00
Dense(2000 => 2000)/cpu/reverse/Flux/(2000, 128)
19735654
ns19621295
ns1.01
Dense(2000 => 2000)/cpu/reverse/Enzyme/(2000, 128)
6562123
ns6523963
ns1.01
Dense(2000 => 2000)/cpu/forward/NamedTuple/(2000, 128)
6547446.5
ns6565571
ns1.00
Dense(2000 => 2000)/cpu/forward/ComponentArray/(2000, 128)
6511687
ns6584434
ns0.99
Dense(2000 => 2000)/cpu/forward/Flux/(2000, 128)
6536680
ns6525087.5
ns1.00
This comment was automatically generated by workflow using github-action-benchmark.