You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
We have a dbt core project that targets a BigQuery data warehouse. When ingesting the metadata into DataHub with include_column_lineage=true and prefer_sql_parser_lineage=true, the column-level lineage (CLL) is missing for some of the datasets. The table-level lineage is fine for all of them.
Thank you for all the details! This is super helpful. We can try to reproduce but I think it's not a bad idea to raise this directly on SQL glot as well to ensure the parser can in general handle these types of statements.
Hey all!
Describe the bug
We have a dbt core project that targets a BigQuery data warehouse. When ingesting the metadata into DataHub with
include_column_lineage=true
andprefer_sql_parser_lineage=true
, the column-level lineage (CLL) is missing for some of the datasets. The table-level lineage is fine for all of them.We've tracked down the issue to a row deduplication macro from dbt_utils. The problem seems to be in the generated SQL code:
When replacing it with a simple
SELECT * FROM all_articles
statement, the CLL is generated correctly.We're not sure if this is specific to DataHub or should be reported to sqlglot instead. Let me know and I can create the issue there.
To Reproduce
We have created a repository with a small dbt project to reproduce it: https://github.com/Starkie/datahub-dbt-lineage-repro
Steps to reproduce the behavior:
<>
with the correct ones for your model:dbt source snapshot-freshness dbt build cp target/run_results.json target/run_results_backup.json dbt docs generate cp target/run_results_backup.json target/run_results.json
Expected behavior
We expected the column-level lineage between the source table and the final model to be generated.
Actual behavior
Desktop (please complete the following information):
OS: Windows 11
Browser Firefox
Version 131.03
DataHub version: 0.14.1
DataHub CLI version: 0.14.1
The text was updated successfully, but these errors were encountered: