You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Current implementation requires a flat-file input csv file where one line = one entry (see readme ).
However, most datasets are not flat, i.e. image datasets require data to be saved in external files
We should consider :
the shuffle process shuffles the lines of the input csv file, image datasets have few files that each contain lots of information (which would decrease the effectiveness of shuffle). idem for the split method.
how to mount those files to the various processes ?
effective way of parsing those files
memory allocation for large datasets
The text was updated successfully, but these errors were encountered:
Description of feature
Current implementation requires a flat-file input csv file where one line = one entry (see readme ).
However, most datasets are not flat, i.e. image datasets require data to be saved in external files
We should consider :
The text was updated successfully, but these errors were encountered: