forked from icl-utk-edu/papi
-
Notifications
You must be signed in to change notification settings - Fork 0
/
ChangeLogP700.txt
4978 lines (4269 loc) · 237 KB
/
ChangeLogP700.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
2022-11-10 Giuseppe Congiu <gcongiu@icl.utk.edu>
* ChangeLogP700.txt, RELEASENOTES.txt, doc/Doxyfile-common,
man/man1/PAPI_derived_event_files.1, man/man1/papi_avail.1,
man/man1/papi_clockres.1, man/man1/papi_command_line.1,
man/man1/papi_component_avail.1, man/man1/papi_cost.1,
man/man1/papi_decode.1, man/man1/papi_error_codes.1,
man/man1/papi_event_chooser.1, man/man1/papi_hardware_avail.1,
man/man1/papi_hybrid_native_avail.1, man/man1/papi_mem_info.1,
man/man1/papi_multiplex_cost.1, man/man1/papi_native_avail.1,
man/man1/papi_version.1, man/man1/papi_xml_event_info.1,
man/man3/PAPIF_accum.3, man/man3/PAPIF_add_event.3,
man/man3/PAPIF_add_events.3, man/man3/PAPIF_add_named_event.3,
man/man3/PAPIF_assign_eventset_component.3,
man/man3/PAPIF_cleanup_eventset.3,
man/man3/PAPIF_create_eventset.3,
man/man3/PAPIF_destroy_eventset.3, man/man3/PAPIF_enum_dev_type.3,
man/man3/PAPIF_enum_event.3, man/man3/PAPIF_epc.3,
man/man3/PAPIF_event_code_to_name.3,
man/man3/PAPIF_event_name_to_code.3, man/man3/PAPIF_flips_rate.3,
man/man3/PAPIF_flops_rate.3, man/man3/PAPIF_get_clockrate.3,
man/man3/PAPIF_get_dev_attr.3, man/man3/PAPIF_get_dev_type_attr.3,
man/man3/PAPIF_get_dmem_info.3, man/man3/PAPIF_get_domain.3,
man/man3/PAPIF_get_event_info.3, man/man3/PAPIF_get_exe_info.3,
man/man3/PAPIF_get_granularity.3,
man/man3/PAPIF_get_hardware_info.3, man/man3/PAPIF_get_multiplex.3,
man/man3/PAPIF_get_preload.3, man/man3/PAPIF_get_real_cyc.3,
man/man3/PAPIF_get_real_nsec.3, man/man3/PAPIF_get_real_usec.3,
man/man3/PAPIF_get_virt_cyc.3, man/man3/PAPIF_get_virt_usec.3,
man/man3/PAPIF_ipc.3, man/man3/PAPIF_is_initialized.3,
man/man3/PAPIF_library_init.3, man/man3/PAPIF_lock.3,
man/man3/PAPIF_multiplex_init.3, man/man3/PAPIF_num_cmp_hwctrs.3,
man/man3/PAPIF_num_events.3, man/man3/PAPIF_num_hwctrs.3,
man/man3/PAPIF_perror.3, man/man3/PAPIF_query_event.3,
man/man3/PAPIF_query_named_event.3, man/man3/PAPIF_rate_stop.3,
man/man3/PAPIF_read.3, man/man3/PAPIF_read_ts.3,
man/man3/PAPIF_register_thread.3, man/man3/PAPIF_remove_event.3,
man/man3/PAPIF_remove_events.3,
man/man3/PAPIF_remove_named_event.3, man/man3/PAPIF_reset.3,
man/man3/PAPIF_set_cmp_domain.3,
man/man3/PAPIF_set_cmp_granularity.3, man/man3/PAPIF_set_debug.3,
man/man3/PAPIF_set_domain.3, man/man3/PAPIF_set_event_domain.3,
man/man3/PAPIF_set_granularity.3, man/man3/PAPIF_set_inherit.3,
man/man3/PAPIF_set_multiplex.3, man/man3/PAPIF_shutdown.3,
man/man3/PAPIF_start.3, man/man3/PAPIF_state.3,
man/man3/PAPIF_stop.3, man/man3/PAPIF_thread_id.3,
man/man3/PAPIF_thread_init.3, man/man3/PAPIF_unlock.3,
man/man3/PAPIF_unregister_thread.3, man/man3/PAPIF_write.3,
man/man3/PAPI_accum.3, man/man3/PAPI_add_event.3,
man/man3/PAPI_add_events.3, man/man3/PAPI_add_named_event.3,
man/man3/PAPI_addr_range_option_t.3, man/man3/PAPI_address_map_t.3,
man/man3/PAPI_all_thr_spec_t.3,
man/man3/PAPI_assign_eventset_component.3, man/man3/PAPI_attach.3,
man/man3/PAPI_attach_option_t.3, man/man3/PAPI_cleanup_eventset.3,
man/man3/PAPI_component_info_t.3, man/man3/PAPI_cpu_option_t.3,
man/man3/PAPI_create_eventset.3, man/man3/PAPI_debug_option_t.3,
man/man3/PAPI_destroy_eventset.3, man/man3/PAPI_detach.3,
man/man3/PAPI_disable_component.3,
man/man3/PAPI_disable_component_by_name.3,
man/man3/PAPI_dmem_info_t.3, man/man3/PAPI_domain_option_t.3,
man/man3/PAPI_enum_cmp_event.3, man/man3/PAPI_enum_dev_type.3,
man/man3/PAPI_enum_event.3, man/man3/PAPI_epc.3,
man/man3/PAPI_event_code_to_name.3, man/man3/PAPI_event_info_t.3,
man/man3/PAPI_event_name_to_code.3, man/man3/PAPI_exe_info_t.3,
man/man3/PAPI_flips_rate.3, man/man3/PAPI_flops_rate.3,
man/man3/PAPI_get_cmp_opt.3, man/man3/PAPI_get_component_index.3,
man/man3/PAPI_get_component_info.3, man/man3/PAPI_get_dev_attr.3,
man/man3/PAPI_get_dev_type_attr.3, man/man3/PAPI_get_dmem_info.3,
man/man3/PAPI_get_event_component.3,
man/man3/PAPI_get_event_info.3,
man/man3/PAPI_get_eventset_component.3,
man/man3/PAPI_get_executable_info.3,
man/man3/PAPI_get_hardware_info.3, man/man3/PAPI_get_multiplex.3,
man/man3/PAPI_get_opt.3, man/man3/PAPI_get_overflow_event_index.3,
man/man3/PAPI_get_real_cyc.3, man/man3/PAPI_get_real_nsec.3,
man/man3/PAPI_get_real_usec.3, man/man3/PAPI_get_shared_lib_info.3,
man/man3/PAPI_get_thr_specific.3, man/man3/PAPI_get_virt_cyc.3,
man/man3/PAPI_get_virt_nsec.3, man/man3/PAPI_get_virt_usec.3,
man/man3/PAPI_granularity_option_t.3, man/man3/PAPI_hl_read.3,
man/man3/PAPI_hl_region_begin.3, man/man3/PAPI_hl_region_end.3,
man/man3/PAPI_hl_stop.3, man/man3/PAPI_hw_info_t.3,
man/man3/PAPI_inherit_option_t.3, man/man3/PAPI_ipc.3,
man/man3/PAPI_is_initialized.3, man/man3/PAPI_itimer_option_t.3,
man/man3/PAPI_library_init.3, man/man3/PAPI_list_events.3,
man/man3/PAPI_list_threads.3, man/man3/PAPI_lock.3,
man/man3/PAPI_mh_cache_info_t.3, man/man3/PAPI_mh_info_t.3,
man/man3/PAPI_mh_level_t.3, man/man3/PAPI_mh_tlb_info_t.3,
man/man3/PAPI_mpx_info_t.3, man/man3/PAPI_multiplex_init.3,
man/man3/PAPI_multiplex_option_t.3, man/man3/PAPI_num_cmp_hwctrs.3,
man/man3/PAPI_num_components.3, man/man3/PAPI_num_events.3,
man/man3/PAPI_num_hwctrs.3, man/man3/PAPI_option_t.3,
man/man3/PAPI_overflow.3, man/man3/PAPI_perror.3,
man/man3/PAPI_preload_info_t.3, man/man3/PAPI_profil.3,
man/man3/PAPI_query_event.3, man/man3/PAPI_query_named_event.3,
man/man3/PAPI_rate_stop.3, man/man3/PAPI_read.3,
man/man3/PAPI_read_ts.3, man/man3/PAPI_register_thread.3,
man/man3/PAPI_remove_event.3, man/man3/PAPI_remove_events.3,
man/man3/PAPI_remove_named_event.3, man/man3/PAPI_reset.3,
man/man3/PAPI_set_cmp_domain.3,
man/man3/PAPI_set_cmp_granularity.3, man/man3/PAPI_set_debug.3,
man/man3/PAPI_set_domain.3, man/man3/PAPI_set_granularity.3,
man/man3/PAPI_set_multiplex.3, man/man3/PAPI_set_opt.3,
man/man3/PAPI_set_thr_specific.3, man/man3/PAPI_shlib_info_t.3,
man/man3/PAPI_shutdown.3, man/man3/PAPI_sprofil.3,
man/man3/PAPI_sprofil_t.3, man/man3/PAPI_start.3,
man/man3/PAPI_state.3, man/man3/PAPI_stop.3,
man/man3/PAPI_strerror.3, man/man3/PAPI_thread_id.3,
man/man3/PAPI_thread_init.3, man/man3/PAPI_unlock.3,
man/man3/PAPI_unregister_thread.3, man/man3/PAPI_write.3,
man/man3/PAPIf_hl_read.3, man/man3/PAPIf_hl_region_begin.3,
man/man3/PAPIf_hl_region_end.3, man/man3/PAPIf_hl_stop.3,
man/man3/RateInfo.3, man/man3/binary_tree_t.3,
man/man3/components_t.3, man/man3/local_components_t.3,
man/man3/reads_t.3, man/man3/regions_t.3, man/man3/threads_t.3,
man/man3/value_t.3, papi.spec, src/Makefile.in, src/configure,
src/configure.in, src/papi.h: release: preparation for release
commit - Update documentation - Update version
* src/validation_tests/papi_br_tkn.c: papi_br_tkn: add not taken
branch event to the right eventset The branch not taken event is
added to the eventset for branch taken. Add the not taken event to
the right eventset.
* man/man3/PAPIF_enum_dev_type.3, man/man3/PAPIF_get_dev_attr.3,
man/man3/PAPIF_get_dev_type_attr.3: sysdetect: add missing fortran
man pages Man pages for PAPIF_enum_dev_type,
PAPIF_get_dev_type_attr and PAPIF_get_dev_attr were missing.
2022-11-08 Giuseppe Congiu <gcongiu@icl.utk.edu>
* .../sysdetect/tests/query_device_simple_f.F: sysdetect: update test
to reflect 'list' argument removal Commit 482e8c5f1 removed the
'list' argument from papif_get_dev_attr fortran wrapper. However,
the test still passed 'dummy_list' to every call of the function.
This cause the len of the string to be read from the wrong argument
and the following 'strncpy' to segfault.
2022-11-02 Giuseppe Congiu <gcongiu@icl.utk.edu>
* man/man1/PAPI_derived_event_files.1, man/man1/papi_avail.1,
man/man1/papi_clockres.1, man/man1/papi_command_line.1,
man/man1/papi_component_avail.1, man/man1/papi_cost.1,
man/man1/papi_decode.1, man/man1/papi_error_codes.1,
man/man1/papi_event_chooser.1, man/man1/papi_hardware_avail.1,
man/man1/papi_hybrid_native_avail.1, man/man1/papi_mem_info.1,
man/man1/papi_multiplex_cost.1, man/man1/papi_native_avail.1,
man/man1/papi_version.1, man/man1/papi_xml_event_info.1,
man/man3/PAPIF_accum.3, man/man3/PAPIF_add_event.3,
man/man3/PAPIF_add_events.3, man/man3/PAPIF_add_named_event.3,
man/man3/PAPIF_assign_eventset_component.3,
man/man3/PAPIF_cleanup_eventset.3,
man/man3/PAPIF_create_eventset.3,
man/man3/PAPIF_destroy_eventset.3, man/man3/PAPIF_enum_event.3,
man/man3/PAPIF_epc.3, man/man3/PAPIF_event_code_to_name.3,
man/man3/PAPIF_event_name_to_code.3, man/man3/PAPIF_flips_rate.3,
man/man3/PAPIF_flops_rate.3, man/man3/PAPIF_get_clockrate.3,
man/man3/PAPIF_get_dmem_info.3, man/man3/PAPIF_get_domain.3,
man/man3/PAPIF_get_event_info.3, man/man3/PAPIF_get_exe_info.3,
man/man3/PAPIF_get_granularity.3,
man/man3/PAPIF_get_hardware_info.3, man/man3/PAPIF_get_multiplex.3,
man/man3/PAPIF_get_preload.3, man/man3/PAPIF_get_real_cyc.3,
man/man3/PAPIF_get_real_nsec.3, man/man3/PAPIF_get_real_usec.3,
man/man3/PAPIF_get_virt_cyc.3, man/man3/PAPIF_get_virt_usec.3,
man/man3/PAPIF_ipc.3, man/man3/PAPIF_is_initialized.3,
man/man3/PAPIF_library_init.3, man/man3/PAPIF_lock.3,
man/man3/PAPIF_multiplex_init.3, man/man3/PAPIF_num_cmp_hwctrs.3,
man/man3/PAPIF_num_events.3, man/man3/PAPIF_num_hwctrs.3,
man/man3/PAPIF_perror.3, man/man3/PAPIF_query_event.3,
man/man3/PAPIF_query_named_event.3, man/man3/PAPIF_rate_stop.3,
man/man3/PAPIF_read.3, man/man3/PAPIF_read_ts.3,
man/man3/PAPIF_register_thread.3, man/man3/PAPIF_remove_event.3,
man/man3/PAPIF_remove_events.3,
man/man3/PAPIF_remove_named_event.3, man/man3/PAPIF_reset.3,
man/man3/PAPIF_set_cmp_domain.3,
man/man3/PAPIF_set_cmp_granularity.3, man/man3/PAPIF_set_debug.3,
man/man3/PAPIF_set_domain.3, man/man3/PAPIF_set_event_domain.3,
man/man3/PAPIF_set_granularity.3, man/man3/PAPIF_set_inherit.3,
man/man3/PAPIF_set_multiplex.3, man/man3/PAPIF_shutdown.3,
man/man3/PAPIF_start.3, man/man3/PAPIF_state.3,
man/man3/PAPIF_stop.3, man/man3/PAPIF_thread_id.3,
man/man3/PAPIF_thread_init.3, man/man3/PAPIF_unlock.3,
man/man3/PAPIF_unregister_thread.3, man/man3/PAPIF_write.3,
man/man3/PAPI_accum.3, man/man3/PAPI_add_event.3,
man/man3/PAPI_add_events.3, man/man3/PAPI_add_named_event.3,
man/man3/PAPI_addr_range_option_t.3, man/man3/PAPI_address_map_t.3,
man/man3/PAPI_all_thr_spec_t.3,
man/man3/PAPI_assign_eventset_component.3, man/man3/PAPI_attach.3,
man/man3/PAPI_attach_option_t.3, man/man3/PAPI_cleanup_eventset.3,
man/man3/PAPI_component_info_t.3, man/man3/PAPI_cpu_option_t.3,
man/man3/PAPI_create_eventset.3, man/man3/PAPI_debug_option_t.3,
man/man3/PAPI_destroy_eventset.3, man/man3/PAPI_detach.3,
man/man3/PAPI_disable_component.3,
man/man3/PAPI_disable_component_by_name.3,
man/man3/PAPI_dmem_info_t.3, man/man3/PAPI_domain_option_t.3,
man/man3/PAPI_enum_cmp_event.3, man/man3/PAPI_enum_dev_type.3,
man/man3/PAPI_enum_event.3, man/man3/PAPI_epc.3,
man/man3/PAPI_event_code_to_name.3, man/man3/PAPI_event_info_t.3,
man/man3/PAPI_event_name_to_code.3, man/man3/PAPI_exe_info_t.3,
man/man3/PAPI_flips_rate.3, man/man3/PAPI_flops_rate.3,
man/man3/PAPI_get_cmp_opt.3, man/man3/PAPI_get_component_index.3,
man/man3/PAPI_get_component_info.3, man/man3/PAPI_get_dev_attr.3,
man/man3/PAPI_get_dev_type_attr.3, man/man3/PAPI_get_dmem_info.3,
man/man3/PAPI_get_event_component.3,
man/man3/PAPI_get_event_info.3,
man/man3/PAPI_get_eventset_component.3,
man/man3/PAPI_get_executable_info.3,
man/man3/PAPI_get_hardware_info.3, man/man3/PAPI_get_multiplex.3,
man/man3/PAPI_get_opt.3, man/man3/PAPI_get_overflow_event_index.3,
man/man3/PAPI_get_real_cyc.3, man/man3/PAPI_get_real_nsec.3,
man/man3/PAPI_get_real_usec.3, man/man3/PAPI_get_shared_lib_info.3,
man/man3/PAPI_get_thr_specific.3, man/man3/PAPI_get_virt_cyc.3,
man/man3/PAPI_get_virt_nsec.3, man/man3/PAPI_get_virt_usec.3,
man/man3/PAPI_granularity_option_t.3, man/man3/PAPI_hl_read.3,
man/man3/PAPI_hl_region_begin.3, man/man3/PAPI_hl_region_end.3,
man/man3/PAPI_hl_stop.3, man/man3/PAPI_hw_info_t.3,
man/man3/PAPI_inherit_option_t.3, man/man3/PAPI_ipc.3,
man/man3/PAPI_is_initialized.3, man/man3/PAPI_itimer_option_t.3,
man/man3/PAPI_library_init.3, man/man3/PAPI_list_events.3,
man/man3/PAPI_list_threads.3, man/man3/PAPI_lock.3,
man/man3/PAPI_mh_cache_info_t.3, man/man3/PAPI_mh_info_t.3,
man/man3/PAPI_mh_level_t.3, man/man3/PAPI_mh_tlb_info_t.3,
man/man3/PAPI_mpx_info_t.3, man/man3/PAPI_multiplex_init.3,
man/man3/PAPI_multiplex_option_t.3, man/man3/PAPI_num_cmp_hwctrs.3,
man/man3/PAPI_num_components.3, man/man3/PAPI_num_events.3,
man/man3/PAPI_num_hwctrs.3, man/man3/PAPI_option_t.3,
man/man3/PAPI_overflow.3, man/man3/PAPI_perror.3,
man/man3/PAPI_preload_info_t.3, man/man3/PAPI_profil.3,
man/man3/PAPI_query_event.3, man/man3/PAPI_query_named_event.3,
man/man3/PAPI_rate_stop.3, man/man3/PAPI_read.3,
man/man3/PAPI_read_ts.3, man/man3/PAPI_register_thread.3,
man/man3/PAPI_remove_event.3, man/man3/PAPI_remove_events.3,
man/man3/PAPI_remove_named_event.3, man/man3/PAPI_reset.3,
man/man3/PAPI_set_cmp_domain.3,
man/man3/PAPI_set_cmp_granularity.3, man/man3/PAPI_set_debug.3,
man/man3/PAPI_set_domain.3, man/man3/PAPI_set_granularity.3,
man/man3/PAPI_set_multiplex.3, man/man3/PAPI_set_opt.3,
man/man3/PAPI_set_thr_specific.3, man/man3/PAPI_shlib_info_t.3,
man/man3/PAPI_shutdown.3, man/man3/PAPI_sprofil.3,
man/man3/PAPI_sprofil_t.3, man/man3/PAPI_start.3,
man/man3/PAPI_state.3, man/man3/PAPI_stop.3,
man/man3/PAPI_strerror.3, man/man3/PAPI_thread_id.3,
man/man3/PAPI_thread_init.3, man/man3/PAPI_unlock.3,
man/man3/PAPI_unregister_thread.3, man/man3/PAPI_write.3,
man/man3/PAPIf_hl_read.3, man/man3/PAPIf_hl_region_begin.3,
man/man3/PAPIf_hl_region_end.3, man/man3/PAPIf_hl_stop.3,
man/man3/RateInfo.3, man/man3/binary_tree_t.3,
man/man3/components_t.3, man/man3/local_components_t.3,
man/man3/reads_t.3, man/man3/regions_t.3, man/man3/threads_t.3,
man/man3/value_t.3: sysdetect: regenerate man pages for updated
attributes
* src/papi.c: sysdetect: remove unused attributes from doc
* src/components/sysdetect/tests/query_device_mpi.c: sysdetect: white
space cleanup
2022-11-02 John Rodgers <john.rodgers@hpe.com>
* src/components/cuda/linux-cuda.c: CUDA: Align memory zero with pad
Update logic in `cuda11_makeRoomAllEvents` to ensure the memory
zero'ing operation covers the amount expanded by the `realloc`
operation.
* src/components/cuda/linux-cuda.c: CUDA: CUPTI11 Sporadic Memory
Failures The CUPTI11 portion of the cuda component has exhibited
sporadic memory failures for applications compiled against
MVAPICH's libmpi.so. Specifically, the realloc operation in
`cuda11_makeRoomAllEvents`, called in `_cuda11_add_native_events`,
would fail even when there was sufficient memory to complete the
requested allocation. As a workaround, this patch prevents the
failure by allocating the expected memory up front prior to the
device loop in `_cuda11_add_native_events`.
* src/components/cuda/linux-cuda.c: CUDA: Prevent memory leak
Prevent memory leak by freeing `firstLast` buffer in
`_cuda11_add_native_events`.
* src/components/cuda/linux-cuda.c: CUDA: Remove unnecessary code
Remove logic only necessary when trying to resolve counters without
an active profiling session. Given that a profiling session is
created and active (see: _cuda11_add_native_events ->
_cuda11_init_profiler) creation and usage of
`cuda11_CounterAvailabilityImage` is unnecessary.
* src/components/cuda/linux-cuda.c: CUDA: Prevent component deadlock
Add missing component unlock to `_cuda_update_control_state` to
prevent deadlocks encountered when adding multiple events
sequentially. Patch resolves issue #121
* src/components/cuda/linux-cuda.c, src/components/nvml/linux-nvml.c,
src/components/rocm/rocm.c, src/components/rocm_smi/linux-rocm-
smi.c: DELAY_INIT: Set disabled for delay init comps Ensure
components that leverage the delayed initialization scheme, namely
cuda, nvml, rocm, and rocm_smi, set thier respective <papi-
vector>.cmp_info.disabled flag with `PAPI_EDELAY_INIT` when
completing the standard component initialization. Update necessary
to conform with PR: 328
2022-10-24 Daniel Barry <dbarry@vols.utk.edu>
* src/components/pcp/linux-pcp.c, src/papi.h: pcp: warning instead of
error when 'reason' string truncated When the hostname is too
long, there is not enough memory allocated for the error 'reason'
string. This caused the component to prematurely exit
initialization when the PM daemon is not active. Instead, a warning
is now issued, and the initialization exits appropriately.
Additionally, the size of the 'reason' string has been increased to
accommodate longer host names. These changes have been tested on
the IBM POWER9 architecture.
2022-10-28 Daniel Barry <dbarry@vols.utk.edu>
* src/counter_analysis_toolkit/main.c: cat: support to comment-out
lines in input file These changes add support for users to
comment-out lines in the input file. This allows users to more
flexibly take measurements without having to remove lines or use
multiple input files. These changes have been tested on the AMD
Zen3 architecture.
2022-10-27 Anthony Danalis <adanalis@icl.utk.edu>
* src/validation_tests/branches_testcode.c,
src/validation_tests/papi_br_msp.c: Improved the branch
misprediction validation test. The previous version of the branch
misprediction validation test relied on the libc function random()
to generate entropy. However, this function introduced 15x more
branches than the number of branches in the code of the validation
test, polluting the results. The new code uses an inline Xorshift
pseudo-random number generator which is more than sufficient to
confuse the branch predictor, and does not contain any branch
instructions so it does not pollute the event measurement. Also,
the logic of the test has been simplified.
2022-10-27 Anthony <adanalis@icl.utk.edu>
* src/sde_lib/sde_lib_datastructures.c: Removed unneeded NULL pointer
checks in libsde.
2022-09-14 Giuseppe Congiu <gcongiu@icl.utk.edu>
* man/man1/PAPI_derived_event_files.1, man/man1/papi_avail.1,
man/man1/papi_clockres.1, man/man1/papi_command_line.1,
man/man1/papi_component_avail.1, man/man1/papi_cost.1,
man/man1/papi_decode.1, man/man1/papi_error_codes.1,
man/man1/papi_event_chooser.1, man/man1/papi_hardware_avail.1,
man/man1/papi_hybrid_native_avail.1, man/man1/papi_mem_info.1,
man/man1/papi_multiplex_cost.1, man/man1/papi_native_avail.1,
man/man1/papi_version.1, man/man1/papi_xml_event_info.1,
man/man3/PAPIF_accum.3, man/man3/PAPIF_add_event.3,
man/man3/PAPIF_add_events.3, man/man3/PAPIF_add_named_event.3,
man/man3/PAPIF_assign_eventset_component.3,
man/man3/PAPIF_cleanup_eventset.3,
man/man3/PAPIF_create_eventset.3,
man/man3/PAPIF_destroy_eventset.3, man/man3/PAPIF_enum_event.3,
man/man3/PAPIF_epc.3, man/man3/PAPIF_event_code_to_name.3,
man/man3/PAPIF_event_name_to_code.3, man/man3/PAPIF_flips_rate.3,
man/man3/PAPIF_flops_rate.3, man/man3/PAPIF_get_clockrate.3,
man/man3/PAPIF_get_dmem_info.3, man/man3/PAPIF_get_domain.3,
man/man3/PAPIF_get_event_info.3, man/man3/PAPIF_get_exe_info.3,
man/man3/PAPIF_get_granularity.3,
man/man3/PAPIF_get_hardware_info.3, man/man3/PAPIF_get_multiplex.3,
man/man3/PAPIF_get_preload.3, man/man3/PAPIF_get_real_cyc.3,
man/man3/PAPIF_get_real_nsec.3, man/man3/PAPIF_get_real_usec.3,
man/man3/PAPIF_get_virt_cyc.3, man/man3/PAPIF_get_virt_usec.3,
man/man3/PAPIF_ipc.3, man/man3/PAPIF_is_initialized.3,
man/man3/PAPIF_library_init.3, man/man3/PAPIF_lock.3,
man/man3/PAPIF_multiplex_init.3, man/man3/PAPIF_num_cmp_hwctrs.3,
man/man3/PAPIF_num_events.3, man/man3/PAPIF_num_hwctrs.3,
man/man3/PAPIF_perror.3, man/man3/PAPIF_query_event.3,
man/man3/PAPIF_query_named_event.3, man/man3/PAPIF_rate_stop.3,
man/man3/PAPIF_read.3, man/man3/PAPIF_read_ts.3,
man/man3/PAPIF_register_thread.3, man/man3/PAPIF_remove_event.3,
man/man3/PAPIF_remove_events.3,
man/man3/PAPIF_remove_named_event.3, man/man3/PAPIF_reset.3,
man/man3/PAPIF_set_cmp_domain.3,
man/man3/PAPIF_set_cmp_granularity.3, man/man3/PAPIF_set_debug.3,
man/man3/PAPIF_set_domain.3, man/man3/PAPIF_set_event_domain.3,
man/man3/PAPIF_set_granularity.3, man/man3/PAPIF_set_inherit.3,
man/man3/PAPIF_set_multiplex.3, man/man3/PAPIF_shutdown.3,
man/man3/PAPIF_start.3, man/man3/PAPIF_state.3,
man/man3/PAPIF_stop.3, man/man3/PAPIF_thread_id.3,
man/man3/PAPIF_thread_init.3, man/man3/PAPIF_unlock.3,
man/man3/PAPIF_unregister_thread.3, man/man3/PAPIF_write.3,
man/man3/PAPI_accum.3, man/man3/PAPI_add_event.3,
man/man3/PAPI_add_events.3, man/man3/PAPI_add_named_event.3,
man/man3/PAPI_addr_range_option_t.3, man/man3/PAPI_address_map_t.3,
man/man3/PAPI_all_thr_spec_t.3,
man/man3/PAPI_assign_eventset_component.3, man/man3/PAPI_attach.3,
man/man3/PAPI_attach_option_t.3, man/man3/PAPI_cleanup_eventset.3,
man/man3/PAPI_component_info_t.3, man/man3/PAPI_cpu_option_t.3,
man/man3/PAPI_create_eventset.3, man/man3/PAPI_debug_option_t.3,
man/man3/PAPI_destroy_eventset.3, man/man3/PAPI_detach.3,
man/man3/PAPI_disable_component.3,
man/man3/PAPI_disable_component_by_name.3,
man/man3/PAPI_dmem_info_t.3, man/man3/PAPI_domain_option_t.3,
man/man3/PAPI_enum_cmp_event.3, man/man3/PAPI_enum_dev_type.3,
man/man3/PAPI_enum_event.3, man/man3/PAPI_epc.3,
man/man3/PAPI_event_code_to_name.3, man/man3/PAPI_event_info_t.3,
man/man3/PAPI_event_name_to_code.3, man/man3/PAPI_exe_info_t.3,
man/man3/PAPI_flips_rate.3, man/man3/PAPI_flops_rate.3,
man/man3/PAPI_get_cmp_opt.3, man/man3/PAPI_get_component_index.3,
man/man3/PAPI_get_component_info.3, man/man3/PAPI_get_dev_attr.3,
man/man3/PAPI_get_dev_type_attr.3, man/man3/PAPI_get_dmem_info.3,
man/man3/PAPI_get_event_component.3,
man/man3/PAPI_get_event_info.3,
man/man3/PAPI_get_eventset_component.3,
man/man3/PAPI_get_executable_info.3,
man/man3/PAPI_get_hardware_info.3, man/man3/PAPI_get_multiplex.3,
man/man3/PAPI_get_opt.3, man/man3/PAPI_get_overflow_event_index.3,
man/man3/PAPI_get_real_cyc.3, man/man3/PAPI_get_real_nsec.3,
man/man3/PAPI_get_real_usec.3, man/man3/PAPI_get_shared_lib_info.3,
man/man3/PAPI_get_thr_specific.3, man/man3/PAPI_get_virt_cyc.3,
man/man3/PAPI_get_virt_nsec.3, man/man3/PAPI_get_virt_usec.3,
man/man3/PAPI_granularity_option_t.3, man/man3/PAPI_hl_read.3,
man/man3/PAPI_hl_region_begin.3, man/man3/PAPI_hl_region_end.3,
man/man3/PAPI_hl_stop.3, man/man3/PAPI_hw_info_t.3,
man/man3/PAPI_inherit_option_t.3, man/man3/PAPI_ipc.3,
man/man3/PAPI_is_initialized.3, man/man3/PAPI_itimer_option_t.3,
man/man3/PAPI_library_init.3, man/man3/PAPI_list_events.3,
man/man3/PAPI_list_threads.3, man/man3/PAPI_lock.3,
man/man3/PAPI_mh_cache_info_t.3, man/man3/PAPI_mh_info_t.3,
man/man3/PAPI_mh_level_t.3, man/man3/PAPI_mh_tlb_info_t.3,
man/man3/PAPI_mpx_info_t.3, man/man3/PAPI_multiplex_init.3,
man/man3/PAPI_multiplex_option_t.3, man/man3/PAPI_num_cmp_hwctrs.3,
man/man3/PAPI_num_components.3, man/man3/PAPI_num_events.3,
man/man3/PAPI_num_hwctrs.3, man/man3/PAPI_option_t.3,
man/man3/PAPI_overflow.3, man/man3/PAPI_perror.3,
man/man3/PAPI_preload_info_t.3, man/man3/PAPI_profil.3,
man/man3/PAPI_query_event.3, man/man3/PAPI_query_named_event.3,
man/man3/PAPI_rate_stop.3, man/man3/PAPI_read.3,
man/man3/PAPI_read_ts.3, man/man3/PAPI_register_thread.3,
man/man3/PAPI_remove_event.3, man/man3/PAPI_remove_events.3,
man/man3/PAPI_remove_named_event.3, man/man3/PAPI_reset.3,
man/man3/PAPI_set_cmp_domain.3,
man/man3/PAPI_set_cmp_granularity.3, man/man3/PAPI_set_debug.3,
man/man3/PAPI_set_domain.3, man/man3/PAPI_set_granularity.3,
man/man3/PAPI_set_multiplex.3, man/man3/PAPI_set_opt.3,
man/man3/PAPI_set_thr_specific.3, man/man3/PAPI_shlib_info_t.3,
man/man3/PAPI_shutdown.3, man/man3/PAPI_sprofil.3,
man/man3/PAPI_sprofil_t.3, man/man3/PAPI_start.3,
man/man3/PAPI_state.3, man/man3/PAPI_stop.3,
man/man3/PAPI_strerror.3, man/man3/PAPI_thread_id.3,
man/man3/PAPI_thread_init.3, man/man3/PAPI_unlock.3,
man/man3/PAPI_unregister_thread.3, man/man3/PAPI_write.3,
man/man3/PAPIf_hl_read.3, man/man3/PAPIf_hl_region_begin.3,
man/man3/PAPIf_hl_region_end.3, man/man3/PAPIf_hl_stop.3,
man/man3/RateInfo.3, man/man3/binary_tree_t.3,
man/man3/components_t.3, man/man3/local_components_t.3,
man/man3/reads_t.3, man/man3/regions_t.3, man/man3/threads_t.3,
man/man3/value_t.3: doc: regenerate man pages
2022-10-25 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/utils/papi_hardware_avail.c: papi_hardware_avail: print thread
affinity list for numas
2022-10-26 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/components/sysdetect/tests/query_device_mpi.c: sysdetect: add
GPU affinity example in tests The GPU affinity example utilizes
MPI shared memory windows to workout the GPU affinity of every MPI
rank. The first rank in every GPU rank list prints the list of rank
for the give GPU.
* src/components/Makefile_comp_tests.target.in,
src/components/sysdetect/tests/Makefile: sysdetect: hook mpi tests
to NO_MPI_TESTS The configure step in PAPI checks whether MPI
tests can be enabled or not. If not it sets NO_MPI_TESTS to yes.
This variable is then used in ctests/Makefile.recipies to enable or
disable MPI tests. The sysdetect tests were not relying on this
variable. Instead sysdetect relied on MPICC being set which is no
accurate. This patch make the MPI checks more uniform across the
code by adding NO_MPI_TESTS checks in sysdetect tests too.
2022-10-25 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/components/sysdetect/sysdetect.c: sysdetect: add
PAPI_DEV_ATTR__CPU_UINT_THR_NUMA_AFFINITY
PAPI_DEV_ATTR__CPU_UINT_THR_NUMA_AFFINITY was missing in sysdetect.
This attribute can be used to discover the numa affinity of every
hardware thread in the system.
* src/components/sysdetect/Rules.sysdetect,
src/components/sysdetect/amd_gpu.c,
src/components/sysdetect/nvidia_gpu.c,
src/components/sysdetect/shm.c, src/components/sysdetect/shm.h,
src/components/sysdetect/sysdetect.c,
src/components/sysdetect/sysdetect.h,
src/components/sysdetect/tests/query_device_mpi.c,
.../sysdetect/tests/query_device_simple.c,
.../sysdetect/tests/query_device_simple_f.F, src/configure,
src/configure.in, src/genpapifdef.c, src/papi.h,
src/papi_fwrappers.c, src/utils/papi_hardware_avail.c: sysdetect:
remove builtin support for numa and GPU affinity Numa and GPU
affinity of threads and MPI ranks adds an MPI dependency to PAPI
that may cause problems (link time unresolved MPI symbols) if the
application using PAPI does not link against MPI. Most of the work
that sysdetect currently does to provide affinity lists to the
users can be easily done by the users themselves. Thus, sysdetect
will no longer support them.
2022-10-12 Daniel Barry <dbarry@vols.utk.edu>
* src/counter_analysis_toolkit/Makefile,
src/counter_analysis_toolkit/vec.c: cat: ifdefs for AVX
availability Utilize ifdefs so that the build can be more flexible
between systems with different AVX vector-width availability.
* src/counter_analysis_toolkit/Makefile,
src/counter_analysis_toolkit/vec_arch.h,
src/counter_analysis_toolkit/vec_fma_dp.c,
src/counter_analysis_toolkit/vec_fma_hp.c,
src/counter_analysis_toolkit/vec_fma_sp.c,
src/counter_analysis_toolkit/vec_nonfma_dp.c,
src/counter_analysis_toolkit/vec_nonfma_hp.c,
src/counter_analysis_toolkit/vec_nonfma_sp.c: cat: specify
architecture in macros Rename VEC_WIDTH_[128|256|512] to
X86_VEC_WIDTH_[128|256|512]B to be more specific.
* src/counter_analysis_toolkit/vec_arch.h: cat: remove unused
typedef; add used typedef Typedef 'half' since this type is
actually used in the code, and remove HP_SCALAR_TYPE.
2022-10-11 Daniel Barry <dbarry@vols.utk.edu>
* src/counter_analysis_toolkit/Makefile,
src/counter_analysis_toolkit/vec.c,
src/counter_analysis_toolkit/vec_arch.h,
src/counter_analysis_toolkit/vec_fma_dp.c,
src/counter_analysis_toolkit/vec_fma_hp.c,
src/counter_analysis_toolkit/vec_fma_sp.c,
src/counter_analysis_toolkit/vec_nonfma_dp.c,
src/counter_analysis_toolkit/vec_nonfma_hp.c,
src/counter_analysis_toolkit/vec_nonfma_sp.c: cat: rename macros
for POWER architecture For the sake of consistency, use "POWER"
instead of "IBM."
* src/counter_analysis_toolkit/vec_fma_dp.c,
src/counter_analysis_toolkit/vec_fma_sp.c,
src/counter_analysis_toolkit/vec_scalar_verify.c: cat: remove
unused code Remove unused AMD Bulldozer intrinsics.
2022-09-19 Daniel Barry <dbarry@vols.utk.edu>
* src/counter_analysis_toolkit/Makefile,
src/counter_analysis_toolkit/vec.c,
src/counter_analysis_toolkit/vec_arch.h: cat: consolidate 'INTEL'
and 'AMD' flags for vector FLOPs benchmark Since the ifdefs which
check whether "INTEL" is defined also check whether "AMD" is
defined, use "X86" for both. These changes have been tested on the
Intel Ice Lake architecture.
* src/counter_analysis_toolkit/vec.c,
src/counter_analysis_toolkit/vec_arch.h,
src/counter_analysis_toolkit/vec_fma_dp.c,
src/counter_analysis_toolkit/vec_fma_hp.c,
src/counter_analysis_toolkit/vec_fma_sp.c,
src/counter_analysis_toolkit/vec_nonfma_dp.c,
src/counter_analysis_toolkit/vec_nonfma_hp.c,
src/counter_analysis_toolkit/vec_nonfma_sp.c: cat: specify
architecture vector FLOPs benchmark function names Include the
architecture names in the function names for consistency. These
changes have been tested on the IBM POWER9 architecture.
2022-09-07 Daniel Barry <dbarry@vols.utk.edu>
* src/counter_analysis_toolkit/vec.c,
src/counter_analysis_toolkit/vec_arch.h,
src/counter_analysis_toolkit/vec_fma_dp.c,
src/counter_analysis_toolkit/vec_fma_hp.c,
src/counter_analysis_toolkit/vec_fma_sp.c,
src/counter_analysis_toolkit/vec_nonfma_dp.c,
src/counter_analysis_toolkit/vec_nonfma_hp.c,
src/counter_analysis_toolkit/vec_nonfma_sp.c: cat: vector FLOPs
benchmark for non-x86 architectures bug fix The driver code for
the vector benchmark could not call the functions for the vector
FLOPs kernels because they were declared 'static'. For builds which
use either the NEON or ALTIVEC intrinsics, these static functions
are now wrapped, so they can be called by the driver. These
changes have been tested on the IBM POWER9 architecture.
2022-10-13 Daniel Barry <dbarry@vols.utk.edu>
* src/components/powercap/tests/Makefile,
.../powercap/tests/powercap_basic_read.c,
.../powercap/tests/powercap_basic_readwrite.c: powercap: add new
component tests This adds a new component test for each of the
following: (1) add one event to an event set at a time and read it
(2) add one event at a time, read it, write it, read the new value,
restore the original value These changes have been tested on the
Intel Ice Lake architecture.
2022-10-26 AnustuvICL <anustuv@icl.utk.edu>
* src/components/perf_event/pe_libpfm4_events.c: perf_event: Free
allocated string in function allocate_native_event
2022-10-18 Peinan Zhang <peinan.zhang@intel.com>
* src/components/intel_gpu/README,
src/components/intel_gpu/README.md,
.../intel_gpu/internal/inc/GPUMetricHandler.h,
.../intel_gpu/internal/inc/GPUMetricInterface.h,
.../intel_gpu/internal/src/GPUMetricHandler.cpp,
.../intel_gpu/internal/src/GPUMetricInterface.cpp,
src/components/intel_gpu/internal/src/Makefile,
src/components/intel_gpu/linux_intel_gpu_metrics.c,
src/components/intel_gpu/linux_intel_gpu_metrics.h,
src/components/intel_gpu/tests/Makefile,
src/components/intel_gpu/tests/gemm.spv,
src/components/intel_gpu/tests/gpu_common_utils.c,
src/components/intel_gpu/tests/gpu_common_utils.h,
src/components/intel_gpu/tests/gpu_metric_list.c,
src/components/intel_gpu/tests/gpu_metric_read.c,
src/components/intel_gpu/tests/gpu_query_gemm.cc,
src/components/intel_gpu/tests/gpu_thread_read.c,
src/components/intel_gpu/tests/readme.txt: Added support for
multiple Intel GPU devices and multiple-tiles per device. Allow
query performance metrics on multiple Intel GPUs and multiple tiles
per GPU. Support Intel GPU Arctic Sound and Ponte Sound. Update
test cases for taking metrics from input, so to work with different
platforms. Update conponent README.md file
2022-10-24 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/utils/papi_native_avail.c: sde: make '-sde' option always
visible in papi_native_avail The '-sde' option was not visible in
papi_native_avail unless the SDE component was configured in PAPI.
Now we always have the option visible but return an error if the
SDE component is not configured.
2022-10-25 Anthony <adanalis@icl.utk.edu>
* src/configure, src/configure.in: Make papi_native_avail support the
"-sde" flag only if *both* libsde and the SDE component are
configured in.
* src/components/sde/tests/Makefile,
src/components/sde/tests/README.txt: Added path to libpfm4 in the
SDE tests Makefile, and further instructions for users in the
README.
2022-10-24 AnustuvICL <anustuv@icl.utk.edu>
* src/papi.h: papi.h: Update bit field post removal of members from
struct _papi_component_option
2022-10-23 William Cohen <wcohen@redhat.com>
* src/components/sysdetect/linux_cpu_utils.c, src/linux-memory.c: Use
fgets in place of fscanf functions to avoid possible buffer
overflows There were several locations in the PAPI code that used
fscanf calls like the following statement to read in information:
result=fscanf(fff,"%s",allocation_policy_string); The problem with
this statement is that the fscanf could possibly write past the end
of allocation_policy_string. To limit the write to the size of the
allocation_policy_string an fgets like the following is used in its
place: str_result=fgets(allocation_policy_string, BUFSIZ, fff);
One set of fscanf were for the generic memory information code
reading the cache characteristics. Another fscanf was in the
sysdetect component reading of cache characteristics.
2022-10-10 AnustuvICL <anustuv@icl.utk.edu>
* src/genpapifdef.c, src/papi.h: Remove C++ style commented code
2022-08-31 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/components/perfctr/perfctr.c: perfctr: set disabled flag in cmp
* src/components/perfmon2/perfmon.c: perfmon2: set disabled flag in
cmp
* src/papi_internal.c: papi: do not set disabled flag in framework
* src/components/vmware/vmware.c: vmware: set disabled flag in cmp
* src/components/stealtime/linux-stealtime.c: stealtime: set disabled
flag in cmp
* src/components/sensors_ppc/linux-sensors-ppc.c: sensors_ppc: set
disabled flag in cmp
* src/components/rapl/linux-rapl.c: rapl: set disabled flag in cmp
* src/components/powercap_ppc/linux-powercap-ppc.c: powercap_ppc: set
disabled flag in cmp
* src/components/powercap/linux-powercap.c: powercap: set disabled
flag in cmp
* src/components/perf_event_uncore/perf_event_uncore.c: perf_event_u:
set disabled flag in cmp
* src/components/perf_event/perf_event.c: perf_event: set disabled
flag in cmp
* src/components/pcp/linux-pcp.c: pcp: set disabled flag in cmp
* src/components/net/linux-net.c: net: set disabled flag in cmp
* src/components/mx/linux-mx.c: mx: set disabled flag in cmp
* src/components/lustre/linux-lustre.c: lustre: set disabled flag in
cmp
* src/components/lmsensors/linux-lmsensors.c: lmsensors: set disabled
flag in cmp
* src/components/libmsr/linux-libmsr.c: libmsr: set disabled flag in
cmp
* src/components/io/linux-io.c: io: set disabled flag in cmp
* src/components/intel_gpu/linux_intel_gpu_metrics.c: intel_gpu: set
disabled flag in cmp
* src/components/infiniband/linux-infiniband.c: infiniband: set
disabled flag in cmp
* src/components/micpower/linux-micpower.c: micpower: set disabled
flag in cmp
* src/components/host_micpower/linux-host_micpower.c: host_micpower:
set disabled flag in cmp
* src/components/example/example.c: example: set disabled flag in cmp
* src/components/coretemp_freebsd/coretemp_freebsd.c:
coretemp_freebsd: set disabled flag in cmp
* src/components/coretemp/linux-coretemp.c: coretemp: set disabled
flag in cmp
* src/components/appio/appio.c: appio: set disabled flag in cmp
2022-10-18 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/components/rocm/rocm.c: rocm: return PAPI_ENOEVNT if event not
found
2022-10-17 Giuseppe Congiu <gcongiu@icl.utk.edu>
* .../rocm/tests/intercept_single_kernel_monitoring.cpp,
.../rocm/tests/intercept_single_thread_monitoring.cpp,
src/components/rocm/tests/multi_kernel_monitoring.cpp,
src/components/rocm/tests/multi_thread_monitoring.cpp,
.../rocm/tests/sample_single_kernel_monitoring.cpp,
src/components/rocm/tests/single_thread_monitoring.cpp: rocm:
SQ_WAVES does not reflect logical waves SQ_WAVES counts the number
of logical waves, plus the waves that are restored due to context
switching. This patch computes the logical number of waves as
SQ_WAVES - SQ_WAVES_RESTORED. For those architectures that do not
support SQ_WAVES_RESTORED (preceeding MI200) the tests return with
a warning and the number of waves check is ignored.
2022-10-20 Daniel Barry <dbarry@vols.utk.edu>
* src/counter_analysis_toolkit/main.c: cat: fix memory leak from
hw_desc alloc Free the dynamically allocated memory used by the
hardware description feature of CAT. These changes have been
tested on the Intel Westmere EP architecture.
2022-10-19 Anthony <adanalis@icl.utk.edu>
* src/Makefile.in, src/Makefile.inc,
src/components/Makefile_comp_tests.target.in,
src/components/sde/tests/Makefile, src/configure, src/configure.in,
src/sde_lib/Makefile: Make static libsde.a optional. We build the
static sde library 'libsde.a' only if libpapi.a is also built,
based on the configure flags provided by the user (i.e., --with-
static-lib). Also, the linking of the relevant tests and utilities
depends on the existence or not of the static sde library.
2022-10-20 Daniel Barry <dbarry@vols.utk.edu>
* src/counter_analysis_toolkit/hw_desc.h,
src/counter_analysis_toolkit/main.c: cat: define default number of
OMP threads Using the PAPI_hw_info_t structure, define the default
number of threads as the number of CPUs per socket. These changes
have been tested on the Intel Westmere EP architecture.
2022-10-18 William Cohen <wcohen@redhat.com>
* src/components/sysdetect/tests/query_device_simple_f.F: Removed
unused label and variable from query_device_simple_f.F Clean up
query_device_simple_f.F to eliminate the following warnings:
query_device_simple_f.F:142:12: 142 | 10 format(9I5) |
1 Warning: Label 10 at (1) defined but not used [-Wunused-label]
query_device_simple_f.F:7:41: 7 | integer :: i, j,
ret_val, error, handle, modifier, id, vendor_id |
1 Warning: Unused variable 'error' declared at (1) [-Wunused-
variable]
* src/papi_preset.c: Correctly size papi_preset.c array to avoid
possible overflow Uped the work array size to avoid the following
warnings: papi_preset.c: In function 'update_ops_string':
papi_preset.c:336:50: warning: '%d' directive writing between 1 and
11 bytes into a region of size 9 [-Wformat-overflow=] 336 |
sprintf (work, "N%d", cur_index-1); |
^~ papi_preset.c:336:48: note: directive argument in the range
[-2147483648, 2147483646] 336 |
sprintf (work, "N%d", cur_index-1); |
^~~~~ In file included from /usr/include/stdio.h:906, from
papi_debug.h:23, from papi_internal.h:24, from papi_preset.c:18: In
function 'sprintf', inlined from 'update_ops_string' at
papi_preset.c:336:5: /usr/include/bits/stdio2.h:30:10: note:
'__builtin___sprintf_chk' output between 3 and 13 bytes into a
destination of size 10 30 | return __builtin___sprintf_chk (__s,
__USE_FORTIFY_LEVEL - 1, |
^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 31 |
__glibc_objsize (__s), __fmt, |
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 32 |
__va_arg_pack ()); |
~~~~~~~~~~~~~~~~~
2022-10-13 Daniel Barry <dbarry@vols.utk.edu>
* src/components/powercap/tests/powercap_basic.c: powercap: fix
memory leak in test The component test 'powercap_basic' now frees
the dynamically allocated memory used to store counter readings.
These changes have been tested on the Intel Cascade Lake
architecture.
Sat Oct 1 23:04:01 2022 -0700 Stephane Eranian <eranian@gmail.com>
* src/libpfm4/lib/events/intel_icl_events.h,
src/libpfm4/lib/events/intel_skl_events.h,
src/libpfm4/lib/events/intel_spr_events.h,
src/libpfm4/lib/pfmlib_amd64.c: libpfm4: update to commit 8aaaf17
Original commits: commit 8aaaf1747e96031a47ed6bd9337ff61a21f8cc64
add missing break in amd64_get_revision() Fixed bug introduced by:
commit 79031f76f8a1 ("fix amd_get_revision() to identify AMD Zen3
uniquely") Must have a break statment for AMD Zen3 (model 1) to
avoid errors later. Reported-by: Steve Kaufmann
<steve.kaufmann@hpe.com> commit
bc4233d35418788423e8442395c7920eb156589d update Intel Skylake
event table Based on download.01.or version 1.28. commit
4c0bc1c8ae06abd5f876657888b88aaf9c9530e6 Fix typos in Intel
Icelake event table Based on download.01.org version 1.16.
commit b6f86fb0d8eae38d65d4394e3ed82f528b10bebf Update Intel
SapphireRapid event table Based on download.01.org release 1.06.
Minor changes to ASSITS and DECODE events. Untested
2022-10-13 Daniel Barry <dbarry@vols.utk.edu>
* src/components/powercap/tests/powercap_basic.c: powercap: ensure
proper string format in test Ensure that the proper string is
null-terminated.
2022-10-11 Daniel Barry <dbarry@vols.utk.edu>
* src/components/powercap/linux-powercap.c,
src/components/powercap/tests/Makefile,
src/components/powercap/tests/powercap_basic.c: powercap: fix
formatting Replace tabs with appropriate amounts of spaces. These
changes have been tested on the Intel Cascade Lake architecture.
2022-10-10 Daniel Barry <dbarry@vols.utk.edu>
* src/components/powercap/tests/powercap_basic.c: powercap: fix
compiler warnings for component test powercap_basic The warnings
for the powercap component test can also be squelched by replacing
sizeof() with the actual buffer sizes. These changes have been
tested on the Intel Cascade Lake architecture.
* src/components/powercap/linux-powercap.c: powercap: fix compiler
warnings for component The warnings for the powercap component can
be squelched by replacing sizeof() with the actual size of the
destination buffer. These changes have been tested on the Intel
Cascade Lake architecture.
2022-10-10 AnustuvICL <anustuv@icl.utk.edu>
* src/aix.c, src/components/bgpm/IOunit/linux-IOunit.c,
src/components/bgpm/L2unit/linux-L2unit.c,
src/components/perf_event/perf_event.c,
src/components/perf_event/perf_helpers.h,
src/components/perfctr/perfctr.c,
src/components/perfmon2/perfmon.c, src/components/perfmon_ia64
/perfmon-ia64.c, src/components/perfnec/perfmon.c,
src/components/rocm/rocm.c, src/components/sde/sde.c,
src/ctests/attach2.c, src/ctests/attach3.c,
src/ctests/attach_validate.c, src/ctests/byte_profile.c,
src/ctests/data_range.c, src/ctests/earprofile.c,
src/ctests/prof_utils.c, src/ctests/prof_utils.h,
src/ctests/profile.c, src/ctests/profile_pthreads.c,
src/ctests/profile_twoevents.c, src/ctests/sprofile.c,
src/examples/PAPI_profil.c, src/examples/sprofile.c, src/extras.c,
src/extras.h, src/linux-bgp.c, src/linux-bgq.c, src/linux-
context.h, src/linux-memory.c, src/papi.c, src/papi.h,
src/papi_fwrappers.c, src/papi_internal.h, src/papivi.h, src
/solaris-common.c, src/solaris-common.h, src/solaris-niagara2.c,
src/solaris-ultra.c, src/solaris-ultra.h: Refactor caddr_t to void*
vptr_t
2022-10-11 Anthony <adanalis@icl.utk.edu>
* src/counter_analysis_toolkit/params.h: Missing file that should
have been included in PR 349 (commit 89c0f19).
2022-10-12 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/papi_fwrappers.c: sysdetect: fix warning in papi_fwrappers.c
papi_fwrappers.c is used to generate multiple wrapper versions for
fortran. Because of a global variable not declared static the
different versions cause a redefinition of the symbols when used
with recent versions of the gcc compiler (as the compiler does link
time optimizations). Declaring the variable static should fix the
problem.
2022-09-07 Daniel Barry <dbarry@vols.utk.edu>
* src/counter_analysis_toolkit/Makefile,
src/counter_analysis_toolkit/driver.h,
src/counter_analysis_toolkit/main.c: cat: add MPI support Add MPI
support to accelerate the collection of event data. This works by
splitting up the list of events to be monitored among the MPI
ranks. These changes have been tested on the IBM POWER9
architecture.
2022-09-28 Anthony <adanalis@icl.utk.edu>
* src/counter_analysis_toolkit/scripts/README.txt,
src/counter_analysis_toolkit/scripts/default.gnp,
.../scripts/multi_plot.gnp, .../scripts/process_dcache_output.sh,
.../L2_RQSTS:ALL_DEMAND_REFERENCES.data.reads.stat,
.../L2_RQSTS:DEMAND_DATA_RD_HIT.data.reads.stat,
.../L2_RQSTS:DEMAND_DATA_RD_MISS.data.reads.stat,
.../scripts/single_plot.gnp: Scripts and sample data for viewing
CAT's dcache output.
2022-09-21 Anthony <adanalis@icl.utk.edu>
* src/counter_analysis_toolkit/driver.h,
src/counter_analysis_toolkit/main.c: Removed redundant latency
step.
* src/counter_analysis_toolkit/main.c: Added support for "-quick"
flag which skips the latency tests.
* src/counter_analysis_toolkit/eventstock.c: Force the CPU component
to initialize itself.
* src/counter_analysis_toolkit/branch.c,
src/counter_analysis_toolkit/branch.h,
src/counter_analysis_toolkit/dcache.c,
src/counter_analysis_toolkit/dcache.h,
src/counter_analysis_toolkit/driver.h,
src/counter_analysis_toolkit/icache.c,
src/counter_analysis_toolkit/main.c: Cleaned up the way we handle
the parameters specified via the command line arguments.
2022-09-04 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/components/sysdetect/tests/Makefile,
.../sysdetect/tests/query_device_simple_f.F, src/genpapifdef.c,
src/papi_fwrappers.c: sysdetect: add fortran bindings and test Add
fortran bindings for PAPI sysdetect interface and tests.
2022-09-13 Daniel Barry <dbarry@vols.utk.edu>
* src/components/powercap/linux-powercap.c: powercap: fix wrap-around
arithmetic When the energy counters reach the maximum value (given
by '/sys/class/powercap/intel-rapl*/max_energy_range_uj'), they
wrap around to zero. There is arithmetic in the powercap component
to account for this case, but it previously used the maximum value
for an unsigned int, which is not necessarily the value given by
'max_energy_range_uj'. Thus, the arithmetic has been modified to
now use the values given in the appropriate 'max_energy_range_uj'
files. These changes have been tested on the Intel Cascade Lake
architecture.
2022-10-07 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/components/infiniband/linux-infiniband.c: infiniband: fix
warning in snprintf Instead of using FILENAME_MAX as the length of
the string to be copied over to ev_file use the sum of the
substrings and account for the extra '/'.
2022-08-31 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/components/perfmon2/perfmon.c: perfmon2: funnel init_component
failures init_component failures are handled locally to the
failure. Instead, funnel all error handling code paths through a
single exit point. This makes the code more robust to bugs and also
makes it easier to read.
* src/components/perfctr/perfctr.c: perfctr: funnel init_component
failures init_component failures are handled locally to the
failure. Instead, funnel all error handling code paths through a
single exit point. This makes the code more robust to bugs and also
makes it easier to read.
2022-08-28 Giuseppe Congiu <gcongiu@icl.utk.edu>
* src/components/host_micpower/linux-host_micpower.c: host_micpower:
funnel PAPI_ENOMEM through fn_fail untested due to lack of
hardware
* src/components/host_micpower/linux-host_micpower.c: host_micpower:
rework error handling in init_component
* src/components/host_micpower/linux-host_micpower.c: host_micpower:
delete empty line
* src/components/host_micpower/linux-host_micpower.c: host_micpower:
add fn_exit point
* src/components/host_micpower/linux-host_micpower.c: host_micpower:
rename disable_me to fn_fail
* src/components/vmware/vmware.c: vmware: funnel init_component
failures init_component failures are handled locally to the
failure. Instead, funnel all error handling code paths through a
single exit point. This makes the code more robust to bugs and also
makes it easier to read.
* src/components/stealtime/linux-stealtime.c: stealtime: funnel
init_component failures init_component failures are handled
locally to the failure. Instead, funnel all error handling code
paths through a single exit point. This makes the code more robust
to bugs and also makes it easier to read.
* src/components/sensors_ppc/linux-sensors-ppc.c: sensors_ppc: funnel
init_component failures init_component failures are handled
locally to the failure. Instead, funnel all error handling code
paths through a single exit point. This makes the code more robust
to bugs and also makes it easier to read.
* src/components/rapl/linux-rapl.c: rapl: funnel init_component
failures init_component failures are handled locally to the
failure. Instead, funnel all error handling code paths through a
single exit point. This makes the code more robust to bugs and also
makes it easier to read.
* src/components/powercap_ppc/linux-powercap-ppc.c: powercap_ppc:
funnel init_component failures init_component failures are handled
locally to the failure. Instead, funnel all error handling code
paths through a single exit point. This makes the code more robust
to bugs and also makes it easier to read.
* src/components/powercap/linux-powercap.c: powercap: funnel
init_component failures init_component failures are handled
locally to the failure. Instead, funnel all error handling code
paths through a single exit point. This makes the code more robust
to bugs and also makes it easier to read.
* src/components/pcp/linux-pcp.c: pcp: funnel init_component failures
init_component failures are handled locally to the failure.
Instead, funnel all error handling code paths through a single exit
point. This makes the code more robust to bugs and also makes it
easier to read.
* src/components/pcp/linux-pcp.c: pcp: return PAPI_ECMP on error
instead of ctxHandle
* src/components/net/linux-net.c: net: return PAPI_ECMP on error
instead of num_events
* src/components/net/linux-net.c: net: funnel init_component failures