You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
the translation code could take days given there are multiple SFT datasets and multiple languages to translate on.
is there a way to accelerate the code by launching multiple instances of the model on the same GPU, and launching the same on multiple GPUs such that it can collectively speed up the translation task ?
The text was updated successfully, but these errors were encountered:
found a partial solution to the problem here: huggingface/datasets#6186
still i believe there is a better way to populate all the vram on the single GPU until gpu is full by creating multiple instances of the same model on the same GPU multiple times, and further repeating the same on all available GPUs in the server to maximize flops
hey @pacman100
the translation code could take days given there are multiple SFT datasets and multiple languages to translate on.
is there a way to accelerate the code by launching multiple instances of the model on the same GPU, and launching the same on multiple GPUs such that it can collectively speed up the translation task ?
The text was updated successfully, but these errors were encountered: