Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Translation example with ctranslate2's Translator. #246

Open
uahmed93 opened this issue Sep 16, 2024 · 3 comments
Open

Translation example with ctranslate2's Translator. #246

uahmed93 opened this issue Sep 16, 2024 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@uahmed93
Copy link

As we have added support for HF model translation via CrossFit, we are working towards performance improvement with ctranslate2. This work depends on adding support for ctranslate2 in CrossFit, and then will need to create pipeline for this work in NDC.(Draft PR)

With a workaround for ctranslate2 in CrossFit, huge performance improvement was seen. On single GPU, following is the performance :

Experiment Standalone pytorch inference Standalone + ctranslate2 Crossfit+ctranslate2
Inference time ~1hr 50mins 23min 54sec 6min 29sec (including extra processing for workarund : 3sec)
BLEU score - 0.9585 0.9586

BLEU score was calculated w.r.t Standalone pytorch inference on 74058 sentences.

@VibhuJawa
Copy link
Collaborator

CC: @arhamm1 for awareness for the work here

@VibhuJawa VibhuJawa added the enhancement New feature or request label Sep 17, 2024
@uahmed93
Copy link
Author

Added an example notebook here

@Christina-Young-NVIDIA
Copy link
Collaborator

Moving to next sprint per Arham's approval.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants