-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support pyarrow large_list #7019
Merged
+643
−94
Merged
Changes from all commits
Commits
Show all changes
78 commits
Select commit
Hold shift + click to select a range
0545de4
Test polars round trip
albertvillanova 4f23eb0
Test Features.from_arrow_schema
albertvillanova c870450
Add large attribute to Sequence
albertvillanova 427f117
Update get_nested_type to support pa.large_list
albertvillanova 69f3548
Update generate_from_arrow_type to support pa.LargeListType
albertvillanova 9fdec4d
Fix typo
albertvillanova 84f3014
Rename test
albertvillanova 9ea8eaf
Merge remote-tracking branch 'upstream/main' into fix-6834-6984
albertvillanova df13687
Add require_polars to test
albertvillanova 9bc5182
Test from_polars large_list
albertvillanova 6345fdc
Merge remote-tracking branch 'upstream/main' into fix-6834-6984
albertvillanova 0d997cd
Update test array_cast with large list
albertvillanova d1bd580
Support large list in array_cast
albertvillanova 87bd7e3
Test cast_array_to_feature for large list
albertvillanova a772762
Support large list in cast_array_to_feature
albertvillanova 882d363
Merge remote-tracking branch 'upstream/main' into fix-6834-6984
albertvillanova 78b3a8f
Fix support large list in cast_array_to_feature
albertvillanova 300a5a9
Test save_to_disk with a dataset from polars with large_list
albertvillanova cd0901c
Test Features.reorder_fields_as with large Sequence
albertvillanova a2c7bd0
Fix Features.reorder_fields_as by using all Sequence params
albertvillanova d0e114c
Test save_to/load_from disk round trip with large_list dataset
albertvillanova 1f9f594
Test DatasetInfo.from_dict with large Sequence
albertvillanova a4eb288
Test Features to/from dict round trip with large Sequence
albertvillanova 9020ccf
Fix features generate_from_dict by using all Sequence params
albertvillanova 057d184
Remove debug comments
albertvillanova 8f3b02c
Test cast_array_to_feature with struct array
albertvillanova f6e528f
Fix cast_array_to_feature for struct array
albertvillanova 89d4366
Test cast_array_to_feature from/to the same Sequence feature dtype
albertvillanova eaf4c64
Fix cast_array_to_feature for the same Sequence feature dtype
albertvillanova 1f28c5f
Add more tests for dataset with large Sequence
albertvillanova 6f3604c
Merge branch 'main' into fix-6834-6984
albertvillanova 33a1a55
Remove Sequence.large
albertvillanova 6e6e9b7
Remove Sequence.large from tests
albertvillanova bfa8fae
Add LargeList to tests
albertvillanova 8215a61
Replace tests with Sequence.large with LargeList
albertvillanova 152d6dd
Replace Sequence.large with LargeList in test_dataset_info_from_dict
albertvillanova 632d1ea
Implement LargeList
albertvillanova 1f247bc
Test features to_yaml_list with LargeList
albertvillanova f08f216
Support LargeList in Features._to_yaml_list
albertvillanova a79e337
Test Features.from_dict with LargeList
albertvillanova b76aaa0
Support LargeList in Features.from_dict
albertvillanova a677143
Test Features from_yaml_list with LargeList
albertvillanova 31d22dd
Support LargeList in Features._from_yaml_list
albertvillanova 79772a6
Test get_nested_type with scalar/list features
albertvillanova af22e52
Support LargeList in get_nested_type
albertvillanova 0611fdc
Test generate_from_arrow_type with primitive/nested data types
albertvillanova a1eff5c
Support LargeList in generate_from_arrow_type
albertvillanova e72d8fe
Remove Sequence of dict from test cast_array_to_feature
albertvillanova bf646ac
Support LargeList in cast_array_to_feature
albertvillanova 78a9a78
Test Features.encode_example
albertvillanova 968364c
Test encode_nested_example with list types
albertvillanova 60465af
Support LargeList in encode_nested_example
albertvillanova 77aa27f
Test check_non_null_non_empty_recursive with list types
albertvillanova 19e9deb
Support LargeList in check_non_null_non_empty_recursive
albertvillanova b27a8a1
Test require_decoding with list types
albertvillanova 9ec883b
Support LargeList in require_decoding
albertvillanova ab8724b
Test decode_nested_example with list types
albertvillanova 30ba3bc
Support LargeList in decode_nested_example
albertvillanova b1a3db7
Test generate_from_dict with list types
albertvillanova b2a5789
Test Features.from_dict with list types
albertvillanova 7c39b51
Test _visit with list types
albertvillanova 48d143c
Support LargeList in _visit
albertvillanova 3968181
Test require_storage_cast with list types
albertvillanova 8e94ca0
Support LargeList in require_storage_cast
albertvillanova 40622e5
Refactor test_require_storage_cast_with_list_types
albertvillanova 1dea864
Test require_storage_embed with list types
albertvillanova c3bacba
Support LargeList in require_storage_embed
albertvillanova c055ff3
Fix test_features_reorder_fields_as
albertvillanova 823a049
Test Features.reorder_fields_as with list types
albertvillanova 45326a9
Test Features.reorder_fields_as with dict within list types
albertvillanova 9acf8d9
Support LargeList in Features.reorder_fields_as
albertvillanova 5c8646b
Test Features.flatten with list types
albertvillanova f11c56d
Test embed_array_storage with list types
albertvillanova 27d0f94
Support LargeList in embed_array_storage
albertvillanova 4821c24
Delete unused tf_utils.is_numeric_feature
albertvillanova 41f6068
Add LargeList docstring
albertvillanova bb6baf5
Add LargeList to main classes docs
albertvillanova 431694f
Address requested changes
albertvillanova File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
those changes are not necessary but I'm fine with keeping them
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I made them when implementing Sequence.large and decided to keep them for robustness in case we add some other attribute to Sequence in the future.