add batch_size = 'all' option? #159

john-veillette · 2021-01-03T22:23:30Z

Hey devs, thanks for this package.

I see that **kwargs is being deprecated from .fit, which makes sense in the interest of making the API more consistent for grid search, etc. However, it still seems desirable to be able to specify batch_size based on the input dimensions in case users want to use vanilla rather than stochastic gradient descent. Maybe adding a batch_size = 'all' option in init would remove any need to specify batch_size in fit?

adriangb · 2021-01-03T23:14:33Z

Hi 👋, glad you're finding it useful!

Would all=X.shape[0]?

john-veillette · 2021-01-03T23:33:42Z

Yep! To replace the option to specify model.fit(X, y, batch_size = X.shape[0]) when fit(X, y, **kwargs) is removed, one could specify batch_size = 'all' on initialization. Should only take a few lines of code to implement.

adriangb · 2021-01-03T23:34:37Z

Agreed. Will leave this issue open to track.

john-veillette · 2021-01-03T23:35:24Z

Thanks!

stsievert · 2021-01-04T04:32:40Z

Thats a decent argument for keeping fit **kwargs. I think compatibility with Keras is important.

adriangb · 2021-02-15T06:32:11Z

Hey @john-veillette, I haven't implemented this yet because like @stsievert points out above, we may ended up keeping **kwargs.

That said, I do think there are some use cases where **kwargs doesn't cut it. The first that comes to mind is cross-val-score, where presumably the length of the dataset is changing but you have no control over the sizes / cannot pass `**kwargs.

For your use case, do you feel that you actually need batch_size="all" or could **kwargs work for you?

Edit: I did a sample implementation, for reference, in #194.

adriangb · 2021-02-16T15:15:17Z

In addition to the implementation in #194, this could also be implemented via #167. Reference implementation here: https://www.adriangb.com/scikeras/refs/pull/167/merge/notebooks/DataTransformers.html#7.-Dynamically-setting-batch_size

john-veillette · 2021-02-16T16:45:27Z

For my use case, **kwargs works just as well! That's how I currently have it implemented, anyhow.

adriangb · 2021-03-06T19:49:09Z

Implemented in #194

stsievert mentioned this issue Feb 15, 2021

Implement batch_size=-1 #194

Merged

adriangb mentioned this issue Feb 16, 2021

ENH: add dependency injection point to transform X & y together #167

Open

adriangb closed this as completed Mar 6, 2021

adriangb mentioned this issue Jul 10, 2021

Can't pickle trained model with callback to TensorBoard #236

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add batch_size = 'all' option? #159

add batch_size = 'all' option? #159

john-veillette commented Jan 3, 2021

adriangb commented Jan 3, 2021

john-veillette commented Jan 3, 2021

adriangb commented Jan 3, 2021

john-veillette commented Jan 3, 2021

stsievert commented Jan 4, 2021

adriangb commented Feb 15, 2021 •

edited

Loading

adriangb commented Feb 16, 2021

john-veillette commented Feb 16, 2021 •

edited

Loading

adriangb commented Mar 6, 2021

add batch_size = 'all' option? #159

add batch_size = 'all' option? #159

Comments

john-veillette commented Jan 3, 2021

adriangb commented Jan 3, 2021

john-veillette commented Jan 3, 2021

adriangb commented Jan 3, 2021

john-veillette commented Jan 3, 2021

stsievert commented Jan 4, 2021

adriangb commented Feb 15, 2021 • edited Loading

adriangb commented Feb 16, 2021

john-veillette commented Feb 16, 2021 • edited Loading

adriangb commented Mar 6, 2021

adriangb commented Feb 15, 2021 •

edited

Loading

john-veillette commented Feb 16, 2021 •

edited

Loading