Add new tables #27

wjones127 · 2022-11-18T05:26:02Z

Description

~~Will rebase after #25 is merged.~~

Gets some, but all tables in Read test cases checklist #7

How was this patch tested?

Does this require an update to the documentation?

wjones127 · 2022-11-19T04:14:25Z

Makefile

@@ -25,7 +25,8 @@ lint-bandit: ## Run bandit
 	@echo "\n${BLUE}Running bandit...${NC}\n"
 	@${POETRY_RUN} bandit -r ${PROJ}

-lint-base: lint-flake8 lint-bandit ## Just run the linters without autolinting
+#lint-base: lint-flake8 lint-bandit ## Just run the linters without autolinting
+lint-base: lint-flake8 # TODO: Can we drop bandit?


@edmondop is there a compelling reason to include bandit in this project? I found it was giving irrelevant warnings and since this project doesn't deal in any non-public data, it's unclear to me why we care that much about security.

wjones127 · 2022-12-09T15:53:26Z

There are some test failures. I'll be on vacation for a week, so won't get around to finishing this until around December 20.

MrPowers · 2022-12-09T18:28:58Z

@wjones127 - alright cool, have a nice vacation!! Let's finish this up and get it merged when you're back!

wjones127 · 2022-12-20T21:27:09Z

dat/generated_tables.py

@@ -35,7 +40,7 @@ def save_expected(case: TestCaseInfo, as_latest=False) -> None:
    # Need to ensure directory exists first
    os.makedirs(case.expected_root(version))

-    df.toPandas().to_parquet(case.expected_path(version))


It turned out toPandas().to_parquet() causes weird things to happen to timestamps, so better to stick to Spark here. Without this, I was able to eliminate the pandas dependency as well.

wjones127 · 2022-12-20T21:28:02Z

dat/generated_tables.py

+def create_nested_types(case: TestCaseInfo, spark: SparkSession):
+    schema = types.StructType([
+        types.StructField(
+            'pk', types.IntegerType()


chispa doesn't support ignoring sort order when comparing tables that contain map types, so we have to add a pk column that tests can sort on. I've added a note about this to the readme.

wjones127 force-pushed the new-tables branch from b223e53 to b4d29b3 Compare November 19, 2022 04:07

wjones127 commented Nov 19, 2022

View reviewed changes

MrPowers self-requested a review November 21, 2022 16:07

MrPowers approved these changes Nov 21, 2022

View reviewed changes

wjones127 force-pushed the new-tables branch from b4d29b3 to 4c3c62b Compare December 4, 2022 21:25

wjones127 added 2 commits December 4, 2022 13:45

feat: new reader tests

c4d2292

more tables

1876315

wjones127 force-pushed the new-tables branch from 4c3c62b to 1876315 Compare December 4, 2022 21:46

fix: don't assume out exists anymore

9cab12b

fix remaining tests

b91f42b

wjones127 commented Dec 20, 2022

View reviewed changes

wjones127 marked this pull request as ready for review December 20, 2022 21:32

wjones127 requested a review from MrPowers December 20, 2022 21:32

wjones127 merged commit 59a47b6 into delta-incubator:master Jan 12, 2023

wjones127 deleted the new-tables branch January 12, 2023 01:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new tables #27

Add new tables #27

wjones127 commented Nov 18, 2022 •

edited

Loading

wjones127 Nov 19, 2022

wjones127 commented Dec 9, 2022

MrPowers commented Dec 9, 2022

wjones127 Dec 20, 2022

wjones127 Dec 20, 2022

Add new tables #27

Add new tables #27

Conversation

wjones127 commented Nov 18, 2022 • edited Loading

Description

How was this patch tested?

Does this require an update to the documentation?

wjones127 Nov 19, 2022

Choose a reason for hiding this comment

wjones127 commented Dec 9, 2022

MrPowers commented Dec 9, 2022

wjones127 Dec 20, 2022

Choose a reason for hiding this comment

wjones127 Dec 20, 2022

Choose a reason for hiding this comment

wjones127 commented Nov 18, 2022 •

edited

Loading