Refuce f16 #261

CongMa13 · 2024-08-07T17:51:35Z

Support f16/f16, bf16/bf16, f16/f32, bf16/f32 reduction
Support independent C and D
Add another CK instance with InSrcVectorDim==1 which supports far right dim
Support reducing a tensor to a scalar
Support permutation of output of reduction
Fix bugs

Add F16 anf bf16 instances since CK supports them now.

test/03_reduction/reduction_resource.cpp

test/03_reduction/reduction_test.cpp

…ht dim

Test cases that beta != 0 cannot pass for a CK bug. Will add the test cases back when the CK fix is merged into amd_master

library/src/hiptensor.cpp

library/src/reduction/reduction_solution_impl.hpp

Tensor C and D can point to the same tensor or two distinct tensors

library/src/hiptensor.cpp

library/src/reduction/hiptensor_reduction.cpp

library/src/reduction/reduction_solution_impl.hpp

test/03_reduction/reduction_test.hpp

test/03_reduction/reduction_test.cpp

library/src/reduction/hiptensor_reduction.cpp

test/03_reduction/reduction_resource.cpp

test/03_reduction/reduction_test.cpp

library/src/reduction/reduction_cpu_reference_instances.cpp

library/src/reduction/reduction_solution_5_4_f16_f32_instance.cpp

library/src/reduction/reduction_meta_traits.hpp

test/03_reduction/configs/rank1_test_params.yaml

cgmillette

LGTM pending CI

- Support f16/f16, bf16/bf16, f16/f32, bf16/f32 reduction - Use f32 as compute type for f16/f16, bf16/bf16 - Support independent C and D - Add another CK instance with InSrcVectorDim==1 which supports far right dim - Support reducing a tensor to a scalar - Support permutation of output of reduction - Rename label [Reduced Dims] to [Output Dims] in reduction test config file - Commnet out all test cases that beta != 0. Will add the test cases back when the CK fix is merged into amd_master - Fix bugs

CongMa13 added 7 commits August 1, 2024 22:49

Add f16 and bf16 instance files

d5d09a6

Add F16 anf bf16 instances since CK supports them now.

f16 test

de56fa9

Reset input B of reduce tests

f2d5738

Fixed bug in reduction_resource

92e1cdf

fixed a bug of fillRand in reduction_resource

fb0e5b6

remove reduce all tests

58d53a8

use f32 as compute type

ea55ca1

CongMa13 force-pushed the refuce_f16 branch from 1908ead to ea55ca1 Compare August 8, 2024 21:42

CongMa13 commented Aug 8, 2024

View reviewed changes

test/03_reduction/reduction_resource.cpp Outdated Show resolved Hide resolved

CongMa13 commented Aug 8, 2024

View reviewed changes

test/03_reduction/reduction_resource.cpp Outdated Show resolved Hide resolved

CongMa13 commented Aug 8, 2024

View reviewed changes

test/03_reduction/reduction_test.cpp Outdated Show resolved Hide resolved

CongMa13 added 5 commits August 9, 2024 16:40

Add another CK instance with InSrcVectorDim==1 which supports far rig…

1e0843f

…ht dim

make the variable names readable

8019d81

Support reducing a tensor to a scalar

c6e62e7

Rename label in reduction test config file

44debee

Commnet out all test cases that beta != 0

f6dd21f

Test cases that beta != 0 cannot pass for a CK bug. Will add the test cases back when the CK fix is merged into amd_master

CongMa13 commented Aug 12, 2024

View reviewed changes

library/src/hiptensor.cpp Show resolved Hide resolved

CongMa13 commented Aug 12, 2024

View reviewed changes

library/src/reduction/reduction_solution_impl.hpp Show resolved Hide resolved

Support independent tensor C and D

a1b9782

Tensor C and D can point to the same tensor or two distinct tensors

CongMa13 commented Aug 13, 2024

View reviewed changes

library/src/hiptensor.cpp Show resolved Hide resolved

CongMa13 commented Aug 13, 2024

View reviewed changes

library/src/reduction/hiptensor_reduction.cpp Show resolved Hide resolved

CongMa13 commented Aug 13, 2024

View reviewed changes

library/src/reduction/hiptensor_reduction.cpp Show resolved Hide resolved

CongMa13 commented Aug 13, 2024

View reviewed changes

library/src/reduction/reduction_solution_impl.hpp Show resolved Hide resolved

CongMa13 commented Aug 13, 2024

View reviewed changes

test/03_reduction/reduction_test.hpp Show resolved Hide resolved

CongMa13 marked this pull request as ready for review August 13, 2024 14:40

CongMa13 requested review from cgmillette, bragadeesh, mkarunan, dlangbe and afanfa as code owners August 13, 2024 14:40

Support permutation of output of reduction

298d562

CongMa13 commented Aug 13, 2024

View reviewed changes

test/03_reduction/reduction_test.cpp Show resolved Hide resolved

CongMa13 force-pushed the refuce_f16 branch from 5c6a4bb to 298d562 Compare August 13, 2024 23:01

CongMa13 commented Aug 14, 2024

View reviewed changes

library/src/reduction/hiptensor_reduction.cpp Show resolved Hide resolved

CongMa13 commented Aug 14, 2024

View reviewed changes

test/03_reduction/reduction_resource.cpp Show resolved Hide resolved

CongMa13 added 2 commits August 14, 2024 19:57

Add comment of permutation of output of reduction

064f3c6

comment out test cases that beta is not 0

0fdb159