Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fused ROPE and reshape cache kernel #229

Open
wants to merge 14 commits into
base: main
Choose a base branch
from

Commits on Sep 19, 2024

  1. Initial Frame, without kernel itself

    Aleksandr Malyshev committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    eee3c78 View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2024

  1. fused kernel

    Aleksandr Malyshev committed Sep 20, 2024
    Configuration menu
    Copy the full SHA
    2f19c01 View commit details
    Browse the repository at this point in the history

Commits on Oct 7, 2024

  1. Merge branch 'main' into fused_re_and_reshape_cache_kernel

    Aleksandr Malyshev committed Oct 7, 2024
    Configuration menu
    Copy the full SHA
    beba692 View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2024

  1. compilation passes, begging query

    Aleksandr Malyshev committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    d7eaac6 View commit details
    Browse the repository at this point in the history
  2. query correctly ROPEd

    Aleksandr Malyshev committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    deaf6aa View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2024

  1. teduced test scope is working

    Aleksandr Malyshev committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    108fb65 View commit details
    Browse the repository at this point in the history
  2. ordered per slot mapping test passed

    Aleksandr Malyshev committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    c5dca86 View commit details
    Browse the repository at this point in the history
  3. kernel ready except fp8

    Aleksandr Malyshev committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    58a93c1 View commit details
    Browse the repository at this point in the history
  4. + fp8 for KV cache support

    Aleksandr Malyshev committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    ef9a664 View commit details
    Browse the repository at this point in the history

Commits on Oct 14, 2024

  1. vllm and test runs, vllm has corretness issue

    Aleksandr Malyshev committed Oct 14, 2024
    Configuration menu
    Copy the full SHA
    b30a7f7 View commit details
    Browse the repository at this point in the history

Commits on Oct 15, 2024

  1. correctness is fixed

    Aleksandr Malyshev committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    f0648c3 View commit details
    Browse the repository at this point in the history
  2. Merge branch 'main' into fused_re_and_reshape_cache_kernel

    Aleksandr Malyshev committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    afa466b View commit details
    Browse the repository at this point in the history
  3. honest switch from fused ROPE and original

    Aleksandr Malyshev committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    2548322 View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2024

  1. timing test

    Aleksandr Malyshev committed Oct 16, 2024
    Configuration menu
    Copy the full SHA
    3af41ef View commit details
    Browse the repository at this point in the history