Replies: 1 comment 3 replies
-
Hi! Yes, your approach to iteration is the right one. Regarding |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi!
I'm currently working on a tabular regression task and I'm using sklearn's
SGDRegressor
, which implements thepartial_fit
method.I would like to ask if there exists a simple method to iterate on the dataset (with batches, not single rows) and automatically split the
X
and they
. All without using the pytorch Dataloader. Something like:It would also be nice to automatically format the X and y batches to pandas and/or numpy but, as far as I understand, at the moment this is only possible for map-style datasets.
At the moment, I am solving the issue as follows, but I'm wondering if there's a better way to do it:
For streaming datasets this works in the same way, except for the fact that the batch is not converted to pandas (note that the conversion simply does not work, no errors raised).
Thanks a lot!
Beta Was this translation helpful? Give feedback.
All reactions