Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP IBM release #68

Closed

Commits on Jun 27, 2024

  1. Configuration menu
    Copy the full SHA
    0aa4139 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b4ed395 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    0f677c3 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    1133e22 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    fda6325 View commit details
    Browse the repository at this point in the history
  6. [Model] Support Qwen-VL and Qwen-VL-Chat models with text-only inputs (

    …vllm-project#5710)
    
    Co-authored-by: Roger Wang <ywang@roblox.com>
    2 people authored and prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    094fdac View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    e51d665 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    4a0c093 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    83066f6 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    e31d19c View commit details
    Browse the repository at this point in the history
  11. [BugFix] [Kernel] Add Cutlass2x fallback kernels (vllm-project#5744)

    Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
    2 people authored and prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    40e9542 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    a2bf2e2 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    664ebf4 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    5c50a9a View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    e6035dc View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    98bbdeb View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    6244a71 View commit details
    Browse the repository at this point in the history
  18. [ci] Remove aws template (vllm-project#5757)

    Signed-off-by: kevin <kevin@anyscale.com>
    khluu authored and prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    68062fb View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    f4c2e68 View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    726516c View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    f36cd77 View commit details
    Browse the repository at this point in the history
  22. Configuration menu
    Copy the full SHA
    77e41ec View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    ec820f3 View commit details
    Browse the repository at this point in the history
  24. Configuration menu
    Copy the full SHA
    b6ef994 View commit details
    Browse the repository at this point in the history
  25. Configuration menu
    Copy the full SHA
    98fc761 View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    2aeab77 View commit details
    Browse the repository at this point in the history
  27. Configuration menu
    Copy the full SHA
    83a217e View commit details
    Browse the repository at this point in the history
  28. Configuration menu
    Copy the full SHA
    cb6c339 View commit details
    Browse the repository at this point in the history
  29. [CI/Build] Add E2E tests for MLPSpeculator (vllm-project#5791)

    Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
    tdoublep authored and prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    323eb56 View commit details
    Browse the repository at this point in the history
  30. Configuration menu
    Copy the full SHA
    b57155e View commit details
    Browse the repository at this point in the history
  31. [Core] Refactor Worker and ModelRunner to consolidate control plane c…

    …ommunication (vllm-project#5408)
    
    Signed-off-by: Stephanie Wang <swang@cs.berkeley.edu>
    Signed-off-by: Stephanie <swang@anyscale.com>
    Co-authored-by: Stephanie <swang@anyscale.com>
    2 people authored and prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    9504961 View commit details
    Browse the repository at this point in the history
  32. Configuration menu
    Copy the full SHA
    53b9418 View commit details
    Browse the repository at this point in the history
  33. Configuration menu
    Copy the full SHA
    c598c5b View commit details
    Browse the repository at this point in the history
  34. Configuration menu
    Copy the full SHA
    e5e4b11 View commit details
    Browse the repository at this point in the history
  35. Configuration menu
    Copy the full SHA
    eaf51e6 View commit details
    Browse the repository at this point in the history
  36. Configuration menu
    Copy the full SHA
    214cf9d View commit details
    Browse the repository at this point in the history
  37. [Kernel] Adding bias epilogue support for cutlass_scaled_mm (vllm-p…

    …roject#5560)
    
    Co-authored-by: Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com>
    Co-authored-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
    3 people authored and prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    d530475 View commit details
    Browse the repository at this point in the history
  38. Configuration menu
    Copy the full SHA
    2b2a6f0 View commit details
    Browse the repository at this point in the history
  39. Configuration menu
    Copy the full SHA
    7b07f28 View commit details
    Browse the repository at this point in the history
  40. Configuration menu
    Copy the full SHA
    7ef738c View commit details
    Browse the repository at this point in the history
  41. Configuration menu
    Copy the full SHA
    b6c26b3 View commit details
    Browse the repository at this point in the history
  42. Configuration menu
    Copy the full SHA
    e957974 View commit details
    Browse the repository at this point in the history
  43. Configuration menu
    Copy the full SHA
    c6a2818 View commit details
    Browse the repository at this point in the history
  44. [BugFix] Fix cuda graph for MLPSpeculator (vllm-project#5875)

    Co-authored-by: Abhinav Goyal <abhinav.goyal@flipkart.com>
    2 people authored and prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    35ebe7d View commit details
    Browse the repository at this point in the history
  45. Configuration menu
    Copy the full SHA
    8b8b470 View commit details
    Browse the repository at this point in the history
  46. [VLM][Bugfix] Make sure that multi_modal_kwargs is broadcasted prop…

    …erly (vllm-project#5880)
    
    Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com>
    xwjiang2010 authored and prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    8444703 View commit details
    Browse the repository at this point in the history
  47. Configuration menu
    Copy the full SHA
    78b8c94 View commit details
    Browse the repository at this point in the history
  48. Configuration menu
    Copy the full SHA
    147bca0 View commit details
    Browse the repository at this point in the history
  49. Configuration menu
    Copy the full SHA
    9e1d61e View commit details
    Browse the repository at this point in the history
  50. Configuration menu
    Copy the full SHA
    7e0358a View commit details
    Browse the repository at this point in the history
  51. Configuration menu
    Copy the full SHA
    59f2ce5 View commit details
    Browse the repository at this point in the history
  52. Squash 4645

    prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    e3f2711 View commit details
    Browse the repository at this point in the history
  53. Squash 5930

    prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    6c13375 View commit details
    Browse the repository at this point in the history
  54. [Core] Make Ray an optional "extras" requirement

    Still included in built docker images
    njhill authored and prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    d9562cf View commit details
    Browse the repository at this point in the history
  55. 🚧 add ibm-adapter branch

    Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
    prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    cf8b27e View commit details
    Browse the repository at this point in the history
  56. 🎨 fix format

    Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
    prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    95d1306 View commit details
    Browse the repository at this point in the history
  57. 🎨 fix format

    Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
    prashantgupta24 committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    cd15111 View commit details
    Browse the repository at this point in the history

Commits on Jun 28, 2024

  1. Configuration menu
    Copy the full SHA
    cfa5530 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    88e5be1 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d38ed5d View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    bc30b64 View commit details
    Browse the repository at this point in the history
  5. [VLM][BugFix] Make sure that multi_modal_kwargs can broadcast prope…

    …rly with ring buffer. (vllm-project#5905)
    
    Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com>
    Co-authored-by: Roger Wang <ywang@roblox.com>
    2 people authored and prashantgupta24 committed Jun 28, 2024
    Configuration menu
    Copy the full SHA
    9bafffb View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    bba1cc6 View commit details
    Browse the repository at this point in the history
  7. [Core] Registry for processing model inputs (vllm-project#5214)

    Co-authored-by: ywang96 <ywang@roblox.com>
    2 people authored and prashantgupta24 committed Jun 28, 2024
    Configuration menu
    Copy the full SHA
    4a5916d View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    0060a6f View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    358f984 View commit details
    Browse the repository at this point in the history
  10. [Bugfix] Better error message for MLPSpeculator when `num_speculative…

    …_tokens` is set too high (vllm-project#5894)
    
    Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
    tdoublep authored and prashantgupta24 committed Jun 28, 2024
    Configuration menu
    Copy the full SHA
    c5c4a9c View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    1099339 View commit details
    Browse the repository at this point in the history
  12. [Distributed] Make it clear that % should not be in tensor dict keys. (

    …vllm-project#5927)
    
    Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com>
    xwjiang2010 authored and prashantgupta24 committed Jun 28, 2024
    Configuration menu
    Copy the full SHA
    3536f3c View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    1b8ffd1 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    b82befe View commit details
    Browse the repository at this point in the history
  15. [ Misc ] Remove fp8_shard_indexer from Col/Row Parallel Linear (Sim…

    …plify Weight Loading) (vllm-project#5928)
    
    Co-authored-by: Robert Shaw <rshaw@neuralmagic>
    2 people authored and prashantgupta24 committed Jun 28, 2024
    Configuration menu
    Copy the full SHA
    b3a4ff5 View commit details
    Browse the repository at this point in the history
  16. [ Bugfix ] Enabling Loading Models With Fused QKV/MLP on Disk with FP8 (

    vllm-project#5921)
    
    Co-authored-by: Robert Shaw <rshaw@neuralmagic>
    2 people authored and prashantgupta24 committed Jun 28, 2024
    Configuration menu
    Copy the full SHA
    919dc5d View commit details
    Browse the repository at this point in the history
  17. 🎨 format code

    Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
    prashantgupta24 committed Jun 28, 2024
    Configuration menu
    Copy the full SHA
    cf4072b View commit details
    Browse the repository at this point in the history