Skip to content

Releases: triton-inference-server/model_navigator

Triton Model Navigator v0.7.4

06 Nov 20:34
Compare
Choose a tag to compare

Updates:

  • new: decoupled mode configuration in Triton Model Config
  • new: support for PyTorch ExportedProgram and ONNX dynamo export
  • new: added GraphSurgeon ONNX optimalization
  • fix: compatibility of generating PyTriton model config through adapter
  • fix: installation of packages that are platform dependent
  • fix: update package config with model loaded from source
  • change: in TensorRT runner, when TensorType.TORCH is the return type lazily convert tensor to Torch
  • change: move from Polygraphy CLI to Polygraphy Python API
  • change: removed Windows from support list

Version of external components used during testing:

Triton Model Navigator v0.7.3

27 Sep 07:59
Compare
Choose a tag to compare

Updates:

  • new: Data dependent dynamic control flow support in nav.Module (multiple computation graphs per module)
  • new: Added find max batch size utility
  • new: Added utilities API documentation
  • new: Add Timer class for measuring execution time of models and Inplace modules.
  • fix: Use wide range of shapes for TensorRT conversion
  • fix: Sorting of samples loaded from workspace
  • change: in Inplace, store one sample by default per module and store shape info for all samples
  • change: always execute export for all supported formats

Known issues and limitations:

  • nav.Module moves original torch.nn.Module to the CPU, in case of weight sharing that might result in unexpected behaviour
  • For data dependent dynamic control flow (multiple computation graphs) nav.Module might copy the weights for each separate graph

Version of external components used during testing:

Triton Model Navigator v0.7.2

30 Aug 10:14
Compare
Choose a tag to compare

Triton Model Navigator v0.7.1

21 Aug 12:47
Compare
Choose a tag to compare

Triton Model Navigator v0.7.0

11 Aug 13:55
Compare
Choose a tag to compare
  • new: Inplace Optimize feature - optimize models directly in the Python code

  • new: Non-tensor inputs and outputs support

  • new: Model warmup support in Triton model configuration

  • new: nav.tensorrt.optimize api added for testing and measuring performance of TensorRT models

  • new: Extended custom configs to pass arguments directly to export and conversion operations like torch.onnx.export or polygraphy convert

  • new: Collect GPU clock during model profiling

  • new: Add option to configure minimal trials and stabilization windows for performance verification and profiling

  • change: Navigator package version change to 0.2.3. Custom configurations now use trt_profiles list instead single value

  • change: Store separate reproduction scripts for runners used during correctness and profiling

  • Version of external components used during testing:

Triton Model Navigator v0.6.3

25 Jul 13:09
Compare
Choose a tag to compare

Triton Model Navigator v0.6.2

19 Jul 08:51
Compare
Choose a tag to compare

Triton Model Navigator v0.6.1

07 Jul 15:27
Compare
Choose a tag to compare

Triton Model Navigator v0.6.0

30 Jun 16:52
Compare
Choose a tag to compare
  • new: Zero-copy runners for Torch, ONNX and TensorRT - omit H2D and D2H memory copy between runners execution

  • new: nav.pacakge.profile API method to profile generated models on provided dataloader

  • change: ProfilerConfig replaced with OptimizationProfile:

    • new: OptimizationProfile impact the conversion for TensorRT
    • new: batch_sizes and max_batch_size limit the max profile in TensorRT conversion
    • new: Allow to provide separate dataloader for profiling - first sample used only
  • new: allow to run nav.package.optimize on empty package - status generation only

  • new: use torch.inference_mode for inference runner when PyTorch 2.x is available

  • fix: Missing model in config when passing package generated during nav.{framework}.optimize directly to nav.package.optimize command

  • Other minor fixes and improvements

  • Version of external components used during testing:

Triton Model Navigator v0.5.6

23 Jun 13:52
Compare
Choose a tag to compare