[RFC]: Model Deprecation Policy #9669

youkaichao · 2024-10-24T16:33:57Z

Motivation.

Usually, we accept model contribution from model vendors, as long as they can verify the model output is correct.

When a new model is added into vLLM, vLLM maintainers will need to maintain the code, update it when necessary.

However, we find that sometimes the model vendor might not be responsive, and the model can get obsolete and even be broken for new transformers versions.

As stated in https://docs.vllm.ai/en/latest/models/supported_models.html#model-support-policy , some models are community-driven, and vLLM maintainers do not actively keep them up-to-date.

Here, I want to go one step further: if a model is broken (cannot run directly with the latest transformers version), and we cannot hear from the model vendor for a period of time, then we will remove the model from vLLM.

An example: the xverse model added by #3610 . the huggingface repo https://huggingface.co/xverse/XVERSE-7B-Chat/tree/main does not have any update in one year, and the tokenizer is broken in recent transformers, leading to an error similar to huggingface/transformers#31789 . In fact, when I add torch.compile support for this model in #9641 , I find that I have to use the tokenizer from meta-llama/Llama-2-7b-chat-hf in order to run the model.

Proposed Change.

If we find a model is broken (cannot run directly with the latest transformers version), and we cannot hear from the model vendor for a period of time, then we will remove the model from vLLM.

Please comment and vote, the period for deprecation:

one week
two weeks
four weeks

Feedback Period.

1 week ( 10/24 - 10/31, both inclusive)

CC List.

No response

Any Other Things.

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

DarkLight1337 · 2024-10-24T16:37:40Z

Let's give a buffer of four weeks as some PRs have taken considerably longer than that. Meanwhile, we can simply disable any tests related to that model so it won't impact our CI.

robertgshaw2-neuralmagic · 2024-10-24T16:40:15Z

I think we should give a point release with a deprecation warning as well so we can solicit community feedback prior
We can also suggest users leverage the Plugins if they want to use the model

simon-mo · 2024-10-24T17:51:28Z

I can add another bar around the reported usage of the model. I pulled the data for XverseForCausalLM and it seems to be actively used by the same group of users but they are using version 0.4.0

tlrmchlsmth · 2024-10-24T17:54:48Z

I generally agree that it makes sense to have a deprecation policy in place, so we don't need to ad-hoc decide what to do whenever something like this xverse issue pops up

4 weeks seems reasonable to me

ywang96 · 2024-10-24T22:35:47Z

I think overall this policy makes sense to me, but we might need to think about the definition of "broken" models.

It seems that we're going to rely on transformers to determine such, which I have no issue with, but this is something we should explicitly define and state in our documentation.

youkaichao · 2024-10-27T01:35:37Z

definition of "broken" models

I would define it as: it does not work with the latest vllm ( which usually uses the latest transformers / pytorch ) out of the box.

youkaichao added the RFC label Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC]: Model Deprecation Policy #9669

[RFC]: Model Deprecation Policy #9669

youkaichao commented Oct 24, 2024 •

edited

Loading

DarkLight1337 commented Oct 24, 2024

robertgshaw2-neuralmagic commented Oct 24, 2024

simon-mo commented Oct 24, 2024

tlrmchlsmth commented Oct 24, 2024

ywang96 commented Oct 24, 2024

youkaichao commented Oct 27, 2024

[RFC]: Model Deprecation Policy #9669

[RFC]: Model Deprecation Policy #9669

Comments

youkaichao commented Oct 24, 2024 • edited Loading

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Before submitting a new issue...

DarkLight1337 commented Oct 24, 2024

robertgshaw2-neuralmagic commented Oct 24, 2024

simon-mo commented Oct 24, 2024

tlrmchlsmth commented Oct 24, 2024

ywang96 commented Oct 24, 2024

youkaichao commented Oct 27, 2024

youkaichao commented Oct 24, 2024 •

edited

Loading