-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sending multiple shapes binary input data #863
Comments
Hi @eladamittai, Under the hood, Model Analyzer uses Perf Analyzer. You can find documentation for passing in input data here: https://github.com/triton-inference-server/client/blob/main/src/c%2B%2B/perf_analyzer/docs/input_data.md
Now then, you had a number of specific asks. I'm trying to wrap my head around if they are possible. You need binary data: If you do need binary data, then there are a few possible options, although I'm not sure they are all compatible with the rest of your asks. You want different shaped requests You want a ratio of different shapes |
Hey, thank you for answering. I checked the perf analyzer documentation, and I managed to send the requests in multiple shapes for a float32 compiled version I have of the model using a json file, but as you can see from this older issue I opened about sending float16 input using json, it's not possible when using grpc. Unless something changed in the later releases of the model/perf analyzer. From your response I didn't understand if I can send multiple shapes using a binary data. I tried to combine the binary dir with the json file, as such: |
Hey, is there an answer? |
Apologies for the delay. I'm looking into this. |
I believe you can use base64 for binary data. Then you can stick to the normal input_data format and provide shapes.
Using that as a basis, you could provide 3 inputs, 2 of one shape and 1 of another, to accomplish the goal of a 2:1 ratio of input shapes. |
Hey, I'm using a model with a dynamic shape input with float16 type, and I wanted to test it using grpc, so I have to use binary input data. I was wondering if there is a way to send multiple requests in different shapes like in a json input data but with binary data. Also, if there is a way to send the requests in a certain ratio. Like, sending 16000 shaped requests twice the amount of 32000 shaped requests.
The text was updated successfully, but these errors were encountered: