-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP IBM release #68
WIP IBM release #68
Commits on Jun 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0aa4139 - Browse repository at this point
Copy the full SHA 0aa4139View commit details -
Configuration menu - View commit details
-
Copy full SHA for b4ed395 - Browse repository at this point
Copy the full SHA b4ed395View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f677c3 - Browse repository at this point
Copy the full SHA 0f677c3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1133e22 - Browse repository at this point
Copy the full SHA 1133e22View commit details -
Configuration menu - View commit details
-
Copy full SHA for fda6325 - Browse repository at this point
Copy the full SHA fda6325View commit details -
[Model] Support Qwen-VL and Qwen-VL-Chat models with text-only inputs (…
…vllm-project#5710) Co-authored-by: Roger Wang <ywang@roblox.com>
Configuration menu - View commit details
-
Copy full SHA for 094fdac - Browse repository at this point
Copy the full SHA 094fdacView commit details -
[Misc] Remove vllm-project#4789 workaround left in vllm/entrypoints/o…
…penai/run_batch.py (vllm-project#5756)
Configuration menu - View commit details
-
Copy full SHA for e51d665 - Browse repository at this point
Copy the full SHA e51d665View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4a0c093 - Browse repository at this point
Copy the full SHA 4a0c093View commit details -
Configuration menu - View commit details
-
Copy full SHA for 83066f6 - Browse repository at this point
Copy the full SHA 83066f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for e31d19c - Browse repository at this point
Copy the full SHA e31d19cView commit details -
[BugFix] [Kernel] Add Cutlass2x fallback kernels (vllm-project#5744)
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
Configuration menu - View commit details
-
Copy full SHA for 40e9542 - Browse repository at this point
Copy the full SHA 40e9542View commit details -
Configuration menu - View commit details
-
Copy full SHA for a2bf2e2 - Browse repository at this point
Copy the full SHA a2bf2e2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 664ebf4 - Browse repository at this point
Copy the full SHA 664ebf4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c50a9a - Browse repository at this point
Copy the full SHA 5c50a9aView commit details -
Configuration menu - View commit details
-
Copy full SHA for e6035dc - Browse repository at this point
Copy the full SHA e6035dcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 98bbdeb - Browse repository at this point
Copy the full SHA 98bbdebView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6244a71 - Browse repository at this point
Copy the full SHA 6244a71View commit details -
[ci] Remove aws template (vllm-project#5757)
Signed-off-by: kevin <kevin@anyscale.com>
Configuration menu - View commit details
-
Copy full SHA for 68062fb - Browse repository at this point
Copy the full SHA 68062fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for f4c2e68 - Browse repository at this point
Copy the full SHA f4c2e68View commit details -
[Speculative Decoding] Support draft model on different tensor-paral…
…lel size than target model (vllm-project#5414)
Configuration menu - View commit details
-
Copy full SHA for 726516c - Browse repository at this point
Copy the full SHA 726516cView commit details -
Configuration menu - View commit details
-
Copy full SHA for f36cd77 - Browse repository at this point
Copy the full SHA f36cd77View commit details -
Configuration menu - View commit details
-
Copy full SHA for 77e41ec - Browse repository at this point
Copy the full SHA 77e41ecView commit details -
Configuration menu - View commit details
-
Copy full SHA for ec820f3 - Browse repository at this point
Copy the full SHA ec820f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for b6ef994 - Browse repository at this point
Copy the full SHA b6ef994View commit details -
Configuration menu - View commit details
-
Copy full SHA for 98fc761 - Browse repository at this point
Copy the full SHA 98fc761View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2aeab77 - Browse repository at this point
Copy the full SHA 2aeab77View commit details -
[Hardware][AMD][CI/Build][Doc] Upgrade to ROCm 6.1, Dockerfile improv…
…ements, test fixes (vllm-project#5422)
Configuration menu - View commit details
-
Copy full SHA for 83a217e - Browse repository at this point
Copy the full SHA 83a217eView commit details -
Configuration menu - View commit details
-
Copy full SHA for cb6c339 - Browse repository at this point
Copy the full SHA cb6c339View commit details -
[CI/Build] Add E2E tests for MLPSpeculator (vllm-project#5791)
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 323eb56 - Browse repository at this point
Copy the full SHA 323eb56View commit details -
Configuration menu - View commit details
-
Copy full SHA for b57155e - Browse repository at this point
Copy the full SHA b57155eView commit details -
[Core] Refactor Worker and ModelRunner to consolidate control plane c…
…ommunication (vllm-project#5408) Signed-off-by: Stephanie Wang <swang@cs.berkeley.edu> Signed-off-by: Stephanie <swang@anyscale.com> Co-authored-by: Stephanie <swang@anyscale.com>
Configuration menu - View commit details
-
Copy full SHA for 9504961 - Browse repository at this point
Copy the full SHA 9504961View commit details -
Configuration menu - View commit details
-
Copy full SHA for 53b9418 - Browse repository at this point
Copy the full SHA 53b9418View commit details -
Configuration menu - View commit details
-
Copy full SHA for c598c5b - Browse repository at this point
Copy the full SHA c598c5bView commit details -
Configuration menu - View commit details
-
Copy full SHA for e5e4b11 - Browse repository at this point
Copy the full SHA e5e4b11View commit details -
Configuration menu - View commit details
-
Copy full SHA for eaf51e6 - Browse repository at this point
Copy the full SHA eaf51e6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 214cf9d - Browse repository at this point
Copy the full SHA 214cf9dView commit details -
[Kernel] Adding bias epilogue support for
cutlass_scaled_mm
(vllm-p……roject#5560) Co-authored-by: Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com> Co-authored-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
Configuration menu - View commit details
-
Copy full SHA for d530475 - Browse repository at this point
Copy the full SHA d530475View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b2a6f0 - Browse repository at this point
Copy the full SHA 2b2a6f0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7b07f28 - Browse repository at this point
Copy the full SHA 7b07f28View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ef738c - Browse repository at this point
Copy the full SHA 7ef738cView commit details -
Configuration menu - View commit details
-
Copy full SHA for b6c26b3 - Browse repository at this point
Copy the full SHA b6c26b3View commit details -
Configuration menu - View commit details
-
Copy full SHA for e957974 - Browse repository at this point
Copy the full SHA e957974View commit details -
Configuration menu - View commit details
-
Copy full SHA for c6a2818 - Browse repository at this point
Copy the full SHA c6a2818View commit details -
[BugFix] Fix cuda graph for MLPSpeculator (vllm-project#5875)
Co-authored-by: Abhinav Goyal <abhinav.goyal@flipkart.com>
Configuration menu - View commit details
-
Copy full SHA for 35ebe7d - Browse repository at this point
Copy the full SHA 35ebe7dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8b8b470 - Browse repository at this point
Copy the full SHA 8b8b470View commit details -
[VLM][Bugfix] Make sure that
multi_modal_kwargs
is broadcasted prop……erly (vllm-project#5880) Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 8444703 - Browse repository at this point
Copy the full SHA 8444703View commit details -
Configuration menu - View commit details
-
Copy full SHA for 78b8c94 - Browse repository at this point
Copy the full SHA 78b8c94View commit details -
Configuration menu - View commit details
-
Copy full SHA for 147bca0 - Browse repository at this point
Copy the full SHA 147bca0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e1d61e - Browse repository at this point
Copy the full SHA 9e1d61eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7e0358a - Browse repository at this point
Copy the full SHA 7e0358aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 59f2ce5 - Browse repository at this point
Copy the full SHA 59f2ce5View commit details -
Configuration menu - View commit details
-
Copy full SHA for e3f2711 - Browse repository at this point
Copy the full SHA e3f2711View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6c13375 - Browse repository at this point
Copy the full SHA 6c13375View commit details -
[Core] Make Ray an optional "extras" requirement
Still included in built docker images
Configuration menu - View commit details
-
Copy full SHA for d9562cf - Browse repository at this point
Copy the full SHA d9562cfView commit details -
Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for cf8b27e - Browse repository at this point
Copy the full SHA cf8b27eView commit details -
Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 95d1306 - Browse repository at this point
Copy the full SHA 95d1306View commit details -
Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for cd15111 - Browse repository at this point
Copy the full SHA cd15111View commit details
Commits on Jun 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for cfa5530 - Browse repository at this point
Copy the full SHA cfa5530View commit details -
Configuration menu - View commit details
-
Copy full SHA for 88e5be1 - Browse repository at this point
Copy the full SHA 88e5be1View commit details -
Configuration menu - View commit details
-
Copy full SHA for d38ed5d - Browse repository at this point
Copy the full SHA d38ed5dView commit details -
Configuration menu - View commit details
-
Copy full SHA for bc30b64 - Browse repository at this point
Copy the full SHA bc30b64View commit details -
[VLM][BugFix] Make sure that
multi_modal_kwargs
can broadcast prope……rly with ring buffer. (vllm-project#5905) Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by: Roger Wang <ywang@roblox.com>
Configuration menu - View commit details
-
Copy full SHA for 9bafffb - Browse repository at this point
Copy the full SHA 9bafffbView commit details -
Configuration menu - View commit details
-
Copy full SHA for bba1cc6 - Browse repository at this point
Copy the full SHA bba1cc6View commit details -
[Core] Registry for processing model inputs (vllm-project#5214)
Co-authored-by: ywang96 <ywang@roblox.com>
Configuration menu - View commit details
-
Copy full SHA for 4a5916d - Browse repository at this point
Copy the full SHA 4a5916dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0060a6f - Browse repository at this point
Copy the full SHA 0060a6fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 358f984 - Browse repository at this point
Copy the full SHA 358f984View commit details -
[Bugfix] Better error message for MLPSpeculator when `num_speculative…
…_tokens` is set too high (vllm-project#5894) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for c5c4a9c - Browse repository at this point
Copy the full SHA c5c4a9cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1099339 - Browse repository at this point
Copy the full SHA 1099339View commit details -
[Distributed] Make it clear that % should not be in tensor dict keys. (…
…vllm-project#5927) Signed-off-by: Xiaowei Jiang <xwjiang2010@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 3536f3c - Browse repository at this point
Copy the full SHA 3536f3cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1b8ffd1 - Browse repository at this point
Copy the full SHA 1b8ffd1View commit details -
Configuration menu - View commit details
-
Copy full SHA for b82befe - Browse repository at this point
Copy the full SHA b82befeView commit details -
[ Misc ] Remove
fp8_shard_indexer
from Col/Row Parallel Linear (Sim……plify Weight Loading) (vllm-project#5928) Co-authored-by: Robert Shaw <rshaw@neuralmagic>
Configuration menu - View commit details
-
Copy full SHA for b3a4ff5 - Browse repository at this point
Copy the full SHA b3a4ff5View commit details -
[ Bugfix ] Enabling Loading Models With Fused QKV/MLP on Disk with FP8 (
vllm-project#5921) Co-authored-by: Robert Shaw <rshaw@neuralmagic>
Configuration menu - View commit details
-
Copy full SHA for 919dc5d - Browse repository at this point
Copy the full SHA 919dc5dView commit details -
Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for cf4072b - Browse repository at this point
Copy the full SHA cf4072bView commit details