You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have data saved with save_to_disk. The data is big (700Gb). When I try loading it, the only option is load_from_disk, and this function copies the data to a tmp directory, causing me to run out of disk space. Is there an alternative solution to that?
Steps to reproduce the bug
when trying to load data using load_From_disk after being saved using save_to_disk
Expected behavior
run out of disk space
Environment info
lateest version
The text was updated successfully, but these errors were encountered:
Hello, It's an interesting issue here. I have the same problem, I have a local dataset and I want to push the dataset to the hub but huggingface does a copy of it.
Describe the bug
I have data saved with save_to_disk. The data is big (700Gb). When I try loading it, the only option is load_from_disk, and this function copies the data to a tmp directory, causing me to run out of disk space. Is there an alternative solution to that?
Steps to reproduce the bug
when trying to load data using load_From_disk after being saved using save_to_disk
Expected behavior
run out of disk space
Environment info
lateest version
The text was updated successfully, but these errors were encountered: