You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
f570fcc
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reactant.jl Benchmarks
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1340567545
ns1331366172
ns1.01
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1354795677
ns1333147663
ns1.02
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1296358153
ns1530087148
ns0.85
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
2617478292
ns3054505913
ns0.86
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Lux
207121854
ns230499086
ns0.90
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
5245986343
ns5310260095
ns0.99
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant
5473784946
ns5117023688
ns1.07
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
5562801011
ns5696115262
ns0.98
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
6785865699
ns6877852849
ns0.99
ViT base (256 x 256 x 3 x 32)/forward/CPU/Lux
28788392011
ns31332890298
ns0.92
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1329681507
ns1377593493
ns0.97
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1310470804
ns1419824350
ns0.92
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1322248869
ns1370284838
ns0.96
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
2593706297
ns2678832914
ns0.97
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Lux
8538279.5
ns8674244
ns0.98
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1569392248
ns1711793645
ns0.92
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant
1563566948
ns1593142398
ns0.98
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1601308923.5
ns1553322361
ns1.03
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
2743850639
ns2788658689
ns0.98
ViT small (256 x 256 x 3 x 4)/forward/CPU/Lux
2498208075
ns3829077050
ns0.65
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1314553261
ns1292126503
ns1.02
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1520974050.5
ns1255035565.5
ns1.21
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1289846630
ns1241708897.5
ns1.04
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
2616640648
ns2627736445
ns1.00
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Lux
21421171
ns21072763
ns1.02
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
2256971186
ns2141201304
ns1.05
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant
2259019481
ns2150005669
ns1.05
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
2253385981
ns2153988118
ns1.05
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
3571597383
ns3403989575
ns1.05
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Lux
6399412347.5
ns6112741293.5
ns1.05
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1315428709.5
ns1319271524
ns1.00
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1285752949.5
ns1321146725.5
ns0.97
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1352009040.5
ns1336696942
ns1.01
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
2483558090
ns2956277164
ns0.84
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Lux
7375914.5
ns7206908
ns1.02
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1471123545
ns1446752820
ns1.02
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant
1460958972
ns1420199736
ns1.03
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1467352849
ns1419361542
ns1.03
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
2771232207
ns2620658225
ns1.06
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Lux
1067840903.5
ns1343552629
ns0.79
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1265463942
ns1339740837
ns0.94
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1336662627
ns1297025776.5
ns1.03
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1343776367
ns1307783663.5
ns1.03
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
2625246543
ns2422904622
ns1.08
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Lux
15448376
ns13705203.5
ns1.13
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
1762453178
ns1689997595
ns1.04
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant
1744480870
ns1709524846
ns1.02
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
1747667945
ns1699909787
ns1.03
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
3050720896
ns2910898852
ns1.05
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Lux
2931138896
ns3172820959
ns0.92
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1334489477
ns1272957214
ns1.05
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1354072030
ns1288787604
ns1.05
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1326138010.5
ns1292898756
ns1.03
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
2585240240
ns2528780375
ns1.02
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Lux
25592502.5
ns25554565
ns1.00
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
2380176040
ns2158163346
ns1.10
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant
2263350697
ns2164587691
ns1.05
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
2217766749
ns2157132364
ns1.03
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
3470178449
ns3395183827
ns1.02
ViT small (256 x 256 x 3 x 16)/forward/CPU/Lux
7784948885.5
ns6334070118
ns1.23
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1267787779
ns1228186625
ns1.03
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1334201675
ns1202678487.5
ns1.11
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1233375595
ns1189947014.5
ns1.04
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
2426265268
ns2346429480
ns1.03
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Lux
50717982
ns50144272.5
ns1.01
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
3180815831
ns2973474372
ns1.07
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant
3006942334
ns3005386081
ns1.00
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
3048786017
ns2959359145
ns1.03
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
4420144603
ns4361243109
ns1.01
ViT small (256 x 256 x 3 x 32)/forward/CPU/Lux
8266182250
ns9378921820
ns0.88
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1289431106
ns1247107066
ns1.03
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1307821758
ns1226562970
ns1.07
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1319155148
ns1232975234.5
ns1.07
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
2570243985
ns2344405060
ns1.10
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Lux
68121207.5
ns67888329.5
ns1.00
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
3184641683
ns3168087012
ns1.01
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant
3196688266
ns3147204975
ns1.02
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
3219193931
ns3238059309
ns0.99
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
4600542400
ns4525581424
ns1.02
ViT base (256 x 256 x 3 x 16)/forward/CPU/Lux
14366625373
ns13793978482
ns1.04
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1248144802
ns1191983955
ns1.05
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1280560424
ns1202390873
ns1.07
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1260333857
ns1204541731
ns1.05
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
2541156555
ns2466522108
ns1.03
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Lux
19634575
ns19336923
ns1.02
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1915886353
ns2207876303
ns0.87
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant
1903714614
ns1998857639
ns0.95
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1892177049
ns1904292223
ns0.99
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
3107063618
ns3061997168
ns1.01
ViT base (256 x 256 x 3 x 4)/forward/CPU/Lux
3075524538.5
ns3696727697
ns0.83
This comment was automatically generated by workflow using github-action-benchmark.