Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Draft] D3D12Compute: HLSL 6 #8207

Open
wants to merge 34 commits into
base: main
Choose a base branch
from

Conversation

soufianekhiat
Copy link

TBD

@steven-johnson
Copy link
Contributor

What's the status on this PR?

@soufianekhiat
Copy link
Author

Still work in progress, feel free to comment it you think the design direction is not correct.
It's related to:
#8203

soufianekhiat and others added 20 commits August 27, 2024 20:54
GCC 12 only supports _Float16 on x86. Support for ARM was added in
GCC 13. This causes a build failure in the manylinux_2_28 images.
The instructions for which llvm to acquire were stale
* Update pip package metadata

* Link to the CMake package docs from Doxygen

* Fix invalid Doxygen annotation in Serialization.h
PyPI rejected this because of a spacing issue.
A few quirks in the Markdown parser were worked around here.

The most notable is that the sequence `]:` causes Doxygen to interpret a
would-be link as a trailing reference even if it is not at the start of
a line. Duplicating the single bracket reference is a portable
workaround, i.e.

    [winget]  ~>  [winget][winget]

It also doesn't stop interpreting `@` directives inside inline code, so
it warns about our use of the `@` as a decorator symbol inside
Python.md.
Someone was using this as a reference expert schedule, but it was
stale and a bit simplistic for large matrices. I rescheduled it
to get a better fraction of peak. This also now demonstrates how
to use rfactor to block an sgemm over the k axis.
* Fix for top-of-tree LLVM

* Update simd_op_check_sve2.cpp
First is just a requirement that a template be declared before
use. Second is removing volatile from non-reference return types.

Co-authored-by: Zalman Stern <zalman@macbook-pro.lan>
* Fix for top-of-tree LLVM

The wasm backend should no longer specialize llvm.wasm.sub.sat* intrinsics, see llvm/llvm-project#109405

* trigger buildbots
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants