Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Py-TXI (TGI and TEI) #147

Merged
merged 14 commits into from
Mar 8, 2024
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
name: CLI CPU Py-TGI Tests
name: CLI CPU Py-TXI Tests

on:
workflow_dispatch:
Expand All @@ -12,7 +12,7 @@ concurrency:
cancel-in-progress: true

jobs:
run_cli_cpu_py_tgi_tests:
run_cli_cpu_py_txi_tests:
runs-on: ubuntu-latest
steps:
- name: Free disk space
Expand All @@ -35,10 +35,13 @@ jobs:
- name: Install requirements
run: |
pip install --upgrade pip
pip install -e .[testing,py-tgi]
pip install -e .[testing,py-txi]

- name: Pull TGI docker image
run: docker pull ghcr.io/huggingface/text-generation-inference:latest

- name: Pull TEI docker image
run: docker pull ghcr.io/huggingface/text-embeddings-inference:cpu-latest

- name: Run tests
run: pytest -k "cli and cpu and py_tgi"
run: pytest -k "cli and cpu and py_txi"
36 changes: 36 additions & 0 deletions examples/tei_bert.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
defaults:
- backend: py-txi
- launcher: inline # default launcher
- benchmark: inference # default benchmark
- experiment # inheriting experiment schema
- _self_ # for hydra 1.1 compatibility
- override hydra/job_logging: colorlog # colorful logging
- override hydra/hydra_logging: colorlog # colorful logging

experiment_name: tei_bert

backend:
device: cpu
pooling: cls
model: bert-base-uncased

benchmark:
input_shapes:
batch_size: 64
sequence_length: 128

# hydra/cli specific settings
hydra:
run:
# where to store run results
dir: runs/${experiment_name}
sweep:
# where to store sweep results
dir: sweeps/${experiment_name}
job:
# change working directory to the run directory
chdir: true
env_set:
# set environment variable OVERRIDE_BENCHMARKS to 1
# to not skip benchmarks that have been run before
OVERRIDE_BENCHMARKS: 1
16 changes: 7 additions & 9 deletions examples/tgi_llama.yaml
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
defaults:
- backend: text-generation-inference # default backend
- benchmark: inference # default benchmark
- backend: py-txi
- launcher: inline # default launcher
- benchmark: inference # default benchmark
- experiment # inheriting experiment schema
- _self_ # for hydra 1.1 compatibility
- override hydra/job_logging: colorlog # colorful logging
Expand All @@ -10,18 +10,16 @@ defaults:
experiment_name: tgi_llama

backend:
device: cuda
device: cpu
device_ids: 0,1
device_map: true
model: TheBloke/Llama-2-7B-AWQ
quantization_scheme: awq
sharded: false
no_weights: true
model: NousResearch/Nous-Hermes-llama-2-7b

benchmark:
input_shapes:
batch_size: 1
batch_size: 4
sequence_length: 256
new_tokens: 1000
new_tokens: 100

# hydra/cli specific settings
hydra:
Expand Down
153 changes: 0 additions & 153 deletions optimum_benchmark/backends/py_tgi/backend.py

This file was deleted.

55 changes: 0 additions & 55 deletions optimum_benchmark/backends/py_tgi/config.py

This file was deleted.

Loading
Loading