Skip to content

Commit

Permalink
revert model changes
Browse files Browse the repository at this point in the history
  • Loading branch information
samos123 committed Oct 24, 2024
1 parent 7115887 commit 6055e72
Showing 1 changed file with 0 additions and 2 deletions.
2 changes: 0 additions & 2 deletions manifests/models/llama-3.1-8b-instruct-fp8-l4.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -6,9 +6,7 @@ metadata:
spec:
features: [TextGeneration]
owner: neuralmagic
minReplicas: 1
url: hf://neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8
# cacheProfile: "efs-dynamic"
engine: VLLM
args:
- --max-model-len=16384
Expand Down

0 comments on commit 6055e72

Please sign in to comment.