how to translate with multiple instances of the model using multiple GPUs #5

StephennFernandes · 2024-02-15T09:02:34Z

the translation code could take days given there are multiple SFT datasets and multiple languages to translate on.

is there a way to accelerate the code by launching multiple instances of the model on the same GPU, and launching the same on multiple GPUs such that it can collectively speed up the translation task ?

StephennFernandes · 2024-02-19T19:08:41Z

found a partial solution to the problem here: huggingface/datasets#6186
still i believe there is a better way to populate all the vram on the single GPU until gpu is full by creating multiple instances of the same model on the same GPU multiple times, and further repeating the same on all available GPUs in the server to maximize flops

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to translate with multiple instances of the model using multiple GPUs #5

how to translate with multiple instances of the model using multiple GPUs #5

StephennFernandes commented Feb 15, 2024

StephennFernandes commented Feb 19, 2024

how to translate with multiple instances of the model using multiple GPUs #5

how to translate with multiple instances of the model using multiple GPUs #5

Comments

StephennFernandes commented Feb 15, 2024

StephennFernandes commented Feb 19, 2024