Dbt-core - dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

dbt Labs

Last update: Jan 8, 2023

Related tags

Data Analysis slack analytics dbt-viewpoint pypa business-intelligence elt data-modeling

Overview

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Understanding dbt

Analysts using dbt can transform their data by simply writing select statements, while dbt handles turning these statements into tables and views in a data warehouse.

These select statements, or "models", form a dbt project. Models frequently build on top of one another – dbt makes it easy to manage relationships between models, and visualize these relationships, as well as assure the quality of your transformations through testing.

Getting started

Install dbt
Read the introduction and viewpoint

Join the dbt Community

Be part of the conversation in the dbt Community Slack
Read more on the dbt Community Discourse

Reporting bugs and contributing code

Want to report a bug or request a feature? Let us know on Slack, or open an issue
Want to help us build dbt? Check out the Contributing Guide

Code of Conduct

Everyone interacting in the dbt project's codebases, issue trackers, chat rooms, and mailing lists is expected to follow the dbt Code of Conduct.

Comments

[Feature] dbt should know about metrics
Is there an existing feature request for this?

[X] I have searched the existing issues

Describe the Feature

dbt should know about metrics. A metric is a timeseries aggregation over a table that supports zero or more dimensions. Some examples of metrics include:

active users

churn rate

mrr (monthly recurring revenue)

dbt should support metric definitions as a new node type. Like exposures, metrics participate in the dbt DAG and can be expressed in yaml files. By defining metrics in dbt projects, analytics engineers can encode crucial business logic in tested, version controlled code. Further, these metrics definitions can be exposed to downstream tooling to drive consistency and precision in metric reporting.

The ecosystem

There is some prior art for defining metrics in dbt projects. In particular, see

The MetricQL implementation (using meta fields)

The Lightdash implementation (using meta fields)

While these two implementations differ (measures vs. metrics), more on that below, there exists a need in the community for a first-class way to define these metrics in dbt code. It is really neat to see that some folks have already made these definitions possible with dbt, but it would better if metrics were treated as well-defined nodes with field validation and helpful utilities inside of dbt Core.

Specification

A metric is a timeseries aggregation over a table that supports zero or more dimensions. These metrics can be encoded in schema.yml files. In the example below, a new_customers metric is defined as a count of customer records created in a given time grain.

# models/marts/product/schema.yml version: 2 models: - name: dim_customers ... metrics: - name: new_customers label: New Customers model: dim_customers description: "The number of paid customers who are using the product" type: count sql: user_id # superflous here, but shown as an example timestamp: signup_date time_grains: [day, week, month] dimensions: - plan - country filters: - field: is_paying value: true meta: {}

Given this information, a downstream process (or a dbt macro!) can generate a sql SELECT statement that correctly calculates this metric with a specified time grain and set of dimensions. Here is a breakdown of supported fields:

| Field | Description | Example | Required? | |-------------|-------------------------------------------------------------|---------------------------------|-----------| | name | A unique identifier for the metric | new_customers | yes | | model | The dbt model that powers this metric | dim_customers | yes | | label | A short for name / label for the metric | New Customers | no | | description | Long form, human-readable description for the metric | The number of customers who.... | no | | type | The type of calculation to perform when evaluating a metric | count_distinct | yes | | sql | The expression to aggregate/calculate over | user_id | yes | | timestamp | The time-based component of the metric | signup_date | yes | | time_grains | One or more "grains" at which the metric can be evaluated | [day, week, month] | yes | | dimensions | A list of dimensions to group or filter the metric by | [plan, country] | no | | filters | A list of filters to apply before calculating the metric | See below | no | | meta | Arbitrary key/value store | {team: Finance} | no |

Model reference

A reference to a dbt model. This model may be any "materialized" model, or a reference to an ephemeral model. Direct table references are not allowed, and alternate node types (seeds, snapshots) are not supported.

Metric types

The following metric types should be supported:

count

count_distinct

sum

average

min

max

In the future, alternative metric types (ratios, deltas, etc) should be supported in this model.

Filters

Filters should be defined as a list of dictionaries that define predicates for the metric. Filters are ANDed together. If more complex filtering is required, users can (and should) push that logic down into the underlying model.

filters: - field: is_paying value: true

Functional requirements

Metrics should participate in the dbt DAG as a distinct node type

Metric nodes should be accessible in the dbt Core compilation context via:

the graph.metrics variable

one or more accessor functions like metrics.find_by_name('...') (exact mechanism TBD)

Metric nodes should be emitted into the manifest.json artifact

Metrics should work with partial parsing

Metric nodes should be supported in node selection and should be selectable with the metric: selector

When listing nodes, existing graph operators (+, &, etc) should be supported

(in a different issue) Metrics should be surfaced in the dbt Docs website

dbt Core should not, itself, evaluate or calculate metrics. Instead, dbt Core should expose the definition of metrics to downstream tools or packages for evaluation and analysis. In this way, it is critical that dbt Core provides hooks into metrics that can be leveraged in both macro code, or by processes that consume dbt Core manifest.json files.

Describe alternatives you've considered

Don't implement metrics as distinct node types and keep encoding them in meta properties:

This information is untyped and semantically unrepresented in dbt, so it would be a net-improvement to instead create a first-class node type in dbt Core for these logical DAG nodes

Metrics vs. Measures

Metrics are strongly-typed objects. It is extremely common to see folks perform syntactically correct but semantically meaningless calculations over data. This looks like averaging an average, or adding two distinct counts together. You get a number back... but it's not a useful or meaningful result.

To that end, I think we should start with metrics instead of measures. The difference here (and maybe a strawperson of my own creation - tell me if you think so) is that measures are untyped aggregations, whereas metrics are rigorously defined summaries over well-defined datasets. The creation of metrics does not preclude us from teaching dbt about more generic types of aggregations in the future, but I'd prefer to start with a narrow set of functionality and expand over time. It is easy to remove constraints, but it is hard to add them 🙂

Include support for joins

Joins make metric calculations really complicated. dbt should absolutely know about foreign key relationships (outside of the existing relationships test) in the future, but this would be a meaningful expansion of scope for our first cut of this feature

While these joins would be semantically useful, they are not a blocker to defining metrics today. Join logic can be pushed down into model code (whether materialized or ephemeral). We should experiment with this single table paradigm, see how it feels, and then consider the best approaching for teaching dbt about semantic joins in the future.

Where metrics are defined

Should metrics be a property of a model? While that could be functional today, I think this would make it hard to extend metrics to work with joins (see above). Instead, declaring metrics as independent nodes that participate in the DAG is a more future-proof idea, and we'd probably do well to avoid the "patching" flow required to get schema tests (properties of models today) translated into their own independent nodes in the DAG.

Inheriting configuration from models

Should metrics be namespaced under a model? This would make it possible to define some "shared" properties for all of the metrics derived from a model (eg. valid dimensions, the time field, supported time grain). This would be good for ergonomics, but not a huge value-add IMO. I'd like to keep this simple for the initial implementation and then make decisions like this with some more information from the community around example use-cases.

Example:

metrics: - model: dim_customers # dimensions are shared for all metrics defined in terms of this model dimensions: - country - plan definitions: - name: new_customers - name: churned_customers

SQL calculations in dimensions

Should dimensions be allowed to provide arbitrary SQL expressions? I don't think so — that SQL is best encoded in model code, and it would be confusing and dissonant to break up dimension definitions across SQL and yaml files.

Example:

metrics: - name: dim_customers # This logic should be represented in the underlying model dimensions: - field: plan_type sql: case when plan in ('pro', 'extra pro') then 'paid' else 'free' end

Who will this benefit?

Analytics engineers - As with models, AEs will be able to define metric logic under version control. By colocating model and metric code, new metrics or changes to existing metrics can be made in a tested, versioned, documented, code reviewed environment - Further, dbt Core's built in lineage can surface information about how changes to an upstream model may impact a downstream metric

BI/Analytics tooling (and therein, data consumers) - Organizations use metrics to understand performance and make decisions. To that end, the correctness and precision of these metrics is really paramount! By defining metrics rigorously under version control, and then exposing their definitions globally, dbt Core can help ensure consistency in reporting.

The data ecosystem - There are so many tools, both existing and yet to be created, that can benefit from an open source mechanism for defining a semantic model on top of the data warehouse. I believe that this business logic is just too valuable and strategically important for end-users to be locked up in proprietary tooling. To that end, this feature, and future types of semantic logic like this, should be addressable in an open source way

Are you interested in contributing this feature?

I sure am :)

Anything else?

From around the internet:

https://basecase.vc/blog/headless-bi

https://benn.substack.com/p/the-data-os

https://benn.substack.com/p/metrics-layer

enhancement metrics
opened by drewbanin 48
Feature/dbt deps tarball
resolves #4205

add new dbt.deps type: url to internally hosted tarball #420

Continued from https://github.com/dbt-labs/dbt-core/pull/4220

Revision 3 added Nov 6 2022

Proposed solution for feature request 4205

Description

Enable direct linking to tarball urls in packages.yml, for example:

# manufactured test, since you'd want to use hub to install these # public tarball used here as example only! # this would usually be a tarball hosted on an internal network packages: - tarball: https://codeload.github.com/dbt-labs/dbt-utils/tar.gz/0.6.5 name: 'dbt_utils_065'

Rational:

dbt projects being self hosted on larger enterprise environments often don't have a connection to the internet (dbt hubs won't work).

dbt users on larger enterprise environments like to build internal private packages for non-public use (help out other dbt users in company with specific functionality)

git package install is not a good option at scale for larger enterprise environments

internal file hosting service (such as internal artifactory service or internal cloud storage buckets) can be easily configured to host packages for install during deployment, so lets give dbt users a way to install from a direct tar file link

Sketching out doc changes here: https://github.com/timle2/docs.getdbt.com/blob/dbt-docs-tarball-package-updates/website/docs/docs/building-a-dbt-project/package-management.md#tar-files

Checklist

[x] I have signed the CLA

[x] I have run this code in development and it appears to resolve the stated issue

[x] This PR includes tests, or tests are not required/relevant for this PR

[x] I have run changie new to create a changelog entry

[x] I have opened an issue to add/update docs~, or docs changes are not required/relevant for this PR~

https://github.com/dbt-labs/docs.getdbt.com/issues/2474

cla:yes Team:Language ready_for_review
opened by timle2 43
Gets columns to update from config for BQ and Snowflake
resolves #1862

Description

Incremental models currently default to updating all columns. For databases that support merge statements, this PR allows the user to pass in an update_columns config parameter to selectively update only a subset of columns by replacing the call to adapter.get_columns_in_relation in the materialization for BigQuery and Snowflake.

Checklist

[x] I have signed the CLA

[x] I have run this code in development and it appears to resolve the stated issue

[x] This PR includes tests, or tests are not required/relevant for this PR

[x] I have updated the CHANGELOG.md and added information about my change to the "dbt next" section.

cla:yes
opened by prratek 43
Adding Full Refresh on schema change option for model config
Goal

This is a work in progress Pull Request for the implementation of the following feature for DBT: https://github.com/fishtown-analytics/dbt/issues/1132

The specific aspect of the feature is for incremental merge models to have a config option that allows them to more gracefully handle schema changes in either the source or target models. through the on_schema_change optional config option, you can either set fail and dbt will throw an exception before it attempts to merge, or full_refresh and dbt will do a full_refresh mode run of the model if it detects a schema change.

Logic for schema change detection

The new config on_schema_change calls the adapter function has_schema_changed(). The function makes the following checks between the temp_relation and the target_relation generated when an incremental merge is attempted.

First compare length of temp_relation schema and target_relation schema, if one is longer, there are more rows and therefore the schemas do not match.

Test to see if all temp_relation column names appear in the target_relation column names, if not then a column name has changed and the schemas are different.

Test to see if the datatypes in temp_relation columns are different from target_relation. Only checks for the named type, not precision (so character varying(5) is only compared as character). Since dbt will handle type resolution only check for a breaking error.

Test to see if there is a target_relation column that does not appear in the temp_relation.

if these tests fail, no schema change detected

NOTE: Logging has been added to explain what changes were detected and show the schema of the temp and target relations.

Approach

The general approach I took for implementing the "on schema change" feature is to:

Add a new configuration key on_schema_change to the appropriate contract - core/dbt/contracts/graph/parsed.py

Create a new SchemaChangeException that will be thrown when a schema change is detected and should be failed - core/dbt/exceptions.py

Add a new abstract method to hold the logic for detecting a schema change for each type of DB - core/dbt/adapters/base/impl.py

Add a concrete implementation for the sql adapter - core/dbt/adapters/sql/impl.py

Add on_schema_change logic to sql adapter incremental materialization macro

Add a concrete implementation for the snowflake adapter -plugins/snowflake/dbt/adapters/snowflake/impl.py

Add on_schema_change logic to snowflake adapter incremental materialization macro

[WIP] Integration tests for multiple configuration setups

[Not started] Working unit tests and style cleaned up
opened by cfraleig 40
Future art for exposures
Describe the feature

What other properties should exposures get?

[x] tags

[x] meta

we do want some traditional meta fields (e.g. owner) to be required or top-level

still could be nice as a catch-all for structured key-value properties users would want to define, beyond what'd be available via tags or description

[ ] new type options:

"reverse pipeline", e.g. a census sync

users supplying their own string types

[ ] new maturity options

higher than high? "mission-critical"

...

Should exposures be ref-able?

exposures that depend on other exposures: one exposure for each Mode query / Looker view, one exposure for the dashboard that depends on those queries / views

models that depend on exposures: modeled input to data science --> data science as exposure --> modeled output

what exactly would ref('my_exposure') return?

Describe alternatives you've considered

We're likely to keep these bare-bones for a little while. I'm still curious to hear what community members want!

Who will this benefit?

Users of the exposures resource type, which is new in v0.18.1
exposures
opened by jtcohen6 34

[CT-346] [Bug] Snapshot never succeeds (but does all the work successfully)

Is there an existing issue for this?

[x] I have searched the existing issues

Current Behavior

When i run dbt snapshot my snapshot model is run, but the cli never moves on to the following task. As far as i can tell all the snapshot work is done successfully (the snapshot table is appended to etc).

This happens both locally and on dbt cloud. Exact same behaviour.

Expected Behavior

I expect the cli to either successfully take me back to an empty prompt, or continue with the next command.

Steps To Reproduce

Setup a snapshot model like this:

% snapshot snap__transaction_records_state %}

{{
    config(
      description='Snapshot of transaction records state for accounting automation project. Started 2022-02-21.',
      target_database='transfer-galaxy',
      target_schema='dbt_snapshots',
      unique_key='tx_id',
      strategy='check',
      check_cols='all',
      partition_by={'field': 'tx_date_day', 'data_type': 'date'},
    )
}}

select
    tx_id,
    date(tx_time_cet) as tx_date_day,
    tx_state_enum,
    tx_state,
from
    {{ ref('src_portal__transaction_records') }}

{% endsnapshot %}

Run "dbt snapshot'
Get terminal output like:

11:12:50  Running with dbt=1.0.3
11:12:50  Found 205 models, 100 tests, 1 snapshot, 13 analyses, 392 macros, 0 operations, 5 seed files, 80 sources, 1 exposure, 0 metrics
11:12:50  
11:12:51  Concurrency: 16 threads (target='default')
11:12:51  
11:12:51  1 of 1 START snapshot dbt_snapshots.snap__transaction_records_state............. [RUN]

Wait an eternity (until a timeout of some sort, in dbt cloud i think it's 13 hours), until the job is cancelled.

Relevant log output

Log output from dbt cloud for "dbt snapshot" here:

2022-03-10 11:12:50.561223 (MainThread): 11:12:50  Running with dbt=1.0.3
2022-03-10 11:12:50.561843 (MainThread): 11:12:50  running dbt with arguments Namespace(cls=<class 'dbt.task.snapshot.SnapshotTask'>, debug=None, defer=None, event_buffer_size=None, exclude=None, fail_fast=None, log_cache_events=False, log_format=None, partial_parse=None, printer_width=None, profile='user', profiles_dir='/tmp/jobs/47162168/.dbt', project_dir=None, record_timing_info=None, rpc_method='snapshot', select=None, selector_name=None, send_anonymous_usage_stats=None, single_threaded=False, state=None, static_parser=None, target='default', threads=None, use_colors=None, use_experimental_parser=None, vars='{}', version_check=None, warn_error=None, which='snapshot', write_json=None)
2022-03-10 11:12:50.562060 (MainThread): 11:12:50  Tracking: tracking
2022-03-10 11:12:50.571622 (MainThread): 11:12:50  Sending event: {'category': 'dbt', 'action': 'invocation', 'label': 'start', 'context': [<snowplow_tracker.self_describing_json.SelfDescribingJson object at 0x7f49a8616a90>, <snowplow_tracker.self_describing_json.SelfDescribingJson object at 0x7f49a8616850>, <snowplow_tracker.self_describing_json.SelfDescribingJson object at 0x7f49a8616c70>]}
2022-03-10 11:12:50.701971 (MainThread): 11:12:50  Partial parsing enabled: 0 files deleted, 0 files added, 0 files changed.
2022-03-10 11:12:50.702302 (MainThread): 11:12:50  Partial parsing enabled, no changes found, skipping parsing
2022-03-10 11:12:50.723178 (MainThread): 11:12:50  Sending event: {'category': 'dbt', 'action': 'load_project', 'label': '35add917-e38d-4443-a411-d0171182a210', 'context': [<snowplow_tracker.self_describing_json.SelfDescribingJson object at 0x7f4991fe4e50>]}
2022-03-10 11:12:50.951696 (MainThread): 11:12:50  Sending event: {'category': 'dbt', 'action': 'resource_counts', 'label': '35add917-e38d-4443-a411-d0171182a210', 'context': [<snowplow_tracker.self_describing_json.SelfDescribingJson object at 0x7f49a8719e50>]}
2022-03-10 11:12:50.952118 (MainThread): 11:12:50  Found 205 models, 100 tests, 1 snapshot, 13 analyses, 392 macros, 0 operations, 5 seed files, 80 sources, 1 exposure, 0 metrics
2022-03-10 11:12:50.964288 (MainThread): 11:12:50  
2022-03-10 11:12:50.964778 (MainThread): 11:12:50  Acquiring new bigquery connection "master"
2022-03-10 11:12:50.965712 (ThreadPoolExecutor-0_0): 11:12:50  Acquiring new bigquery connection "list_transfer-galaxy"
2022-03-10 11:12:50.966081 (ThreadPoolExecutor-0_0): 11:12:50  Opening a new connection, currently in state init
2022-03-10 11:12:51.221515 (ThreadPoolExecutor-1_0): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_risk_scoring"
2022-03-10 11:12:51.222081 (ThreadPoolExecutor-1_1): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_forecasting"
2022-03-10 11:12:51.222541 (ThreadPoolExecutor-1_0): 11:12:51  Opening a new connection, currently in state closed
2022-03-10 11:12:51.222941 (ThreadPoolExecutor-1_2): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_staging_zendesk"
2022-03-10 11:12:51.223551 (ThreadPoolExecutor-1_3): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_snapshots"
2022-03-10 11:12:51.223754 (ThreadPoolExecutor-1_1): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.224255 (ThreadPoolExecutor-1_4): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_staging_airtable"
2022-03-10 11:12:51.224911 (ThreadPoolExecutor-1_5): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_behavioural"
2022-03-10 11:12:51.225430 (ThreadPoolExecutor-1_6): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_seeds"
2022-03-10 11:12:51.225943 (ThreadPoolExecutor-1_2): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.226308 (ThreadPoolExecutor-1_7): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_referral"
2022-03-10 11:12:51.227004 (ThreadPoolExecutor-1_8): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_playground"
2022-03-10 11:12:51.227548 (ThreadPoolExecutor-1_9): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_staging"
2022-03-10 11:12:51.228091 (ThreadPoolExecutor-1_10): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_staging_accounting"
2022-03-10 11:12:51.228691 (ThreadPoolExecutor-1_11): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_accounting"
2022-03-10 11:12:51.229038 (ThreadPoolExecutor-1_3): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.229424 (ThreadPoolExecutor-1_12): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_outgoing"
2022-03-10 11:12:51.230062 (ThreadPoolExecutor-1_13): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_marts"
2022-03-10 11:12:51.230479 (ThreadPoolExecutor-1_4): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.230918 (ThreadPoolExecutor-1_14): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_events"
2022-03-10 11:12:51.231544 (ThreadPoolExecutor-1_15): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_intermediary"
2022-03-10 11:12:51.232549 (ThreadPoolExecutor-1_5): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.232995 (ThreadPoolExecutor-1_6): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.233441 (ThreadPoolExecutor-1_7): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.233739 (ThreadPoolExecutor-1_8): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.234178 (ThreadPoolExecutor-1_9): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.235224 (ThreadPoolExecutor-1_10): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.235596 (ThreadPoolExecutor-1_11): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.236978 (ThreadPoolExecutor-1_12): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.237314 (ThreadPoolExecutor-1_13): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.238763 (ThreadPoolExecutor-1_14): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.240000 (ThreadPoolExecutor-1_15): 11:12:51  Opening a new connection, currently in state init
2022-03-10 11:12:51.509916 (ThreadPoolExecutor-1_11): 11:12:51  Acquiring new bigquery connection "list_transfer-galaxy_dbt_staging_soluno"
2022-03-10 11:12:51.510480 (ThreadPoolExecutor-1_11): 11:12:51  Opening a new connection, currently in state closed
2022-03-10 11:12:51.664519 (MainThread): 11:12:51  Concurrency: 16 threads (target='default')
2022-03-10 11:12:51.664816 (MainThread): 11:12:51  
2022-03-10 11:12:51.667539 (Thread-18): 11:12:51  Began running node snapshot.transfergalaxy.snap__transaction_records_state
2022-03-10 11:12:51.667922 (Thread-18): 11:12:51  1 of 1 START snapshot dbt_snapshots.snap__transaction_records_state............. [RUN]
2022-03-10 11:12:51.668325 (Thread-18): 11:12:51  Acquiring new bigquery connection "snapshot.transfergalaxy.snap__transaction_records_state"
2022-03-10 11:12:51.668513 (Thread-18): 11:12:51  Began compiling node snapshot.transfergalaxy.snap__transaction_records_state
2022-03-10 11:12:51.668694 (Thread-18): 11:12:51  Compiling snapshot.transfergalaxy.snap__transaction_records_state
2022-03-10 11:12:51.673363 (Thread-18): 11:12:51  finished collecting timing info
2022-03-10 11:12:51.673586 (Thread-18): 11:12:51  Began executing node snapshot.transfergalaxy.snap__transaction_records_state
2022-03-10 11:12:51.692717 (Thread-18): 11:12:51  Opening a new connection, currently in state closed
2022-03-10 11:12:51.923759 (Thread-18): 11:12:51  On snapshot.transfergalaxy.snap__transaction_records_state: /* {"app": "dbt", "dbt_version": "1.0.3", "profile_name": "user", "target_name": "default", "node_id": "snapshot.transfergalaxy.snap__transaction_records_state"} */

    select CURRENT_TIMESTAMP() as snapshot_start
  
2022-03-10 11:12:52.774374 (Thread-18): 11:12:52  On snapshot.transfergalaxy.snap__transaction_records_state: /* {"app": "dbt", "dbt_version": "1.0.3", "profile_name": "user", "target_name": "default", "node_id": "snapshot.transfergalaxy.snap__transaction_records_state"} */
select * from (
            



select
    tx_id,
    date(tx_time_cet) as tx_date_day,
    tx_state_enum,
    tx_state,
from
    `transfer-galaxy`.`dbt_staging`.`src_portal__transaction_records`

        ) as __dbt_sbq
        where false
        limit 0
    
2022-03-10 11:12:53.703297 (Thread-18): 11:12:53  On snapshot.transfergalaxy.snap__transaction_records_state: /* {"app": "dbt", "dbt_version": "1.0.3", "profile_name": "user", "target_name": "default", "node_id": "snapshot.transfergalaxy.snap__transaction_records_state"} */
select * from (
            select * from `transfer-galaxy`.`dbt_snapshots`.`snap__transaction_records_state`
        ) as __dbt_sbq
        where false
        limit 0
    
2022-03-10 11:12:55.098683 (Thread-18): 11:12:55  On snapshot.transfergalaxy.snap__transaction_records_state: /* {"app": "dbt", "dbt_version": "1.0.3", "profile_name": "user", "target_name": "default", "node_id": "snapshot.transfergalaxy.snap__transaction_records_state"} */

        

  create or replace table `transfer-galaxy`.`dbt_snapshots`.`snap__transaction_records_state__dbt_tmp`
  partition by tx_date_day
  
  OPTIONS(
      expiration_timestamp=TIMESTAMP_ADD(CURRENT_TIMESTAMP(), INTERVAL 12 hour)
    )
  as (
    with snapshot_query as (

        



select
    tx_id,
    date(tx_time_cet) as tx_date_day,
    tx_state_enum,
    tx_state,
from
    `transfer-galaxy`.`dbt_staging`.`src_portal__transaction_records`


    ),

    snapshotted_data as (

        select *,
            tx_id as dbt_unique_key

        from `transfer-galaxy`.`dbt_snapshots`.`snap__transaction_records_state`
        where dbt_valid_to is null

    ),

    insertions_source_data as (

        select
            *,
            tx_id as dbt_unique_key,
            TIMESTAMP("2022-03-10 11:12:52.097775+00:00") as dbt_updated_at,
            TIMESTAMP("2022-03-10 11:12:52.097775+00:00") as dbt_valid_from,
            nullif(TIMESTAMP("2022-03-10 11:12:52.097775+00:00"), TIMESTAMP("2022-03-10 11:12:52.097775+00:00")) as dbt_valid_to,
            to_hex(md5(concat(coalesce(cast(tx_id as string), ''), '|',coalesce(cast(TIMESTAMP("2022-03-10 11:12:52.097775+00:00") as string), '')))) as dbt_scd_id

        from snapshot_query
    ),

    updates_source_data as (

        select
            *,
            tx_id as dbt_unique_key,
            TIMESTAMP("2022-03-10 11:12:52.097775+00:00") as dbt_updated_at,
            TIMESTAMP("2022-03-10 11:12:52.097775+00:00") as dbt_valid_from,
            TIMESTAMP("2022-03-10 11:12:52.097775+00:00") as dbt_valid_to

        from snapshot_query
    ),

    insertions as (

        select
            'insert' as dbt_change_type,
            source_data.*

        from insertions_source_data as source_data
        left outer join snapshotted_data on snapshotted_data.dbt_unique_key = source_data.dbt_unique_key
        where snapshotted_data.dbt_unique_key is null
           or (
                snapshotted_data.dbt_unique_key is not null
            and (
                (snapshotted_data.tx_id != source_data.tx_id
        or
        (
            ((snapshotted_data.tx_id is null) and not (source_data.tx_id is null))
            or
            ((not snapshotted_data.tx_id is null) and (source_data.tx_id is null))
        ) or snapshotted_data.tx_date_day != source_data.tx_date_day
        or
        (
            ((snapshotted_data.tx_date_day is null) and not (source_data.tx_date_day is null))
            or
            ((not snapshotted_data.tx_date_day is null) and (source_data.tx_date_day is null))
        ) or snapshotted_data.tx_state_enum != source_data.tx_state_enum
        or
        (
            ((snapshotted_data.tx_state_enum is null) and not (source_data.tx_state_enum is null))
            or
            ((not snapshotted_data.tx_state_enum is null) and (source_data.tx_state_enum is null))
        ) or snapshotted_data.tx_state != source_data.tx_state
        or
        (
            ((snapshotted_data.tx_state is null) and not (source_data.tx_state is null))
            or
            ((not snapshotted_data.tx_state is null) and (source_data.tx_state is null))
        ))
            )
        )

    ),

    updates as (

        select
            'update' as dbt_change_type,
            source_data.*,
            snapshotted_data.dbt_scd_id

        from updates_source_data as source_data
        join snapshotted_data on snapshotted_data.dbt_unique_key = source_data.dbt_unique_key
        where (
            (snapshotted_data.tx_id != source_data.tx_id
        or
        (
            ((snapshotted_data.tx_id is null) and not (source_data.tx_id is null))
            or
            ((not snapshotted_data.tx_id is null) and (source_data.tx_id is null))
        ) or snapshotted_data.tx_date_day != source_data.tx_date_day
        or
        (
            ((snapshotted_data.tx_date_day is null) and not (source_data.tx_date_day is null))
            or
            ((not snapshotted_data.tx_date_day is null) and (source_data.tx_date_day is null))
        ) or snapshotted_data.tx_state_enum != source_data.tx_state_enum
        or
        (
            ((snapshotted_data.tx_state_enum is null) and not (source_data.tx_state_enum is null))
            or
            ((not snapshotted_data.tx_state_enum is null) and (source_data.tx_state_enum is null))
        ) or snapshotted_data.tx_state != source_data.tx_state
        or
        (
            ((snapshotted_data.tx_state is null) and not (source_data.tx_state is null))
            or
            ((not snapshotted_data.tx_state is null) and (source_data.tx_state is null))
        ))
        )
    )

    select * from insertions
    union all
    select * from updates

  );
    
2022-03-10 11:13:00.608568 (Thread-18): 11:13:00  BigQuery adapter: Adding columns ([]) to table `transfer-galaxy`.`dbt_snapshots`.`snap__transaction_records_state`".
2022-03-10 11:13:01.249736 (Thread-18): 11:13:01  Writing runtime SQL for node "snapshot.transfergalaxy.snap__transaction_records_state"
2022-03-10 11:13:01.250481 (Thread-18): 11:13:01  On snapshot.transfergalaxy.snap__transaction_records_state: /* {"app": "dbt", "dbt_version": "1.0.3", "profile_name": "user", "target_name": "default", "node_id": "snapshot.transfergalaxy.snap__transaction_records_state"} */

      merge into `transfer-galaxy`.`dbt_snapshots`.`snap__transaction_records_state` as DBT_INTERNAL_DEST
    using `transfer-galaxy`.`dbt_snapshots`.`snap__transaction_records_state__dbt_tmp` as DBT_INTERNAL_SOURCE
    on DBT_INTERNAL_SOURCE.dbt_scd_id = DBT_INTERNAL_DEST.dbt_scd_id

    when matched
     and DBT_INTERNAL_DEST.dbt_valid_to is null
     and DBT_INTERNAL_SOURCE.dbt_change_type in ('update', 'delete')
        then update
        set dbt_valid_to = DBT_INTERNAL_SOURCE.dbt_valid_to

    when not matched
     and DBT_INTERNAL_SOURCE.dbt_change_type = 'insert'
        then insert (`tx_id`, `tx_date_day`, `tx_state_enum`, `tx_state`, `dbt_updated_at`, `dbt_valid_from`, `dbt_valid_to`, `dbt_scd_id`)
        values (`tx_id`, `tx_date_day`, `tx_state_enum`, `tx_state`, `dbt_updated_at`, `dbt_valid_from`, `dbt_valid_to`, `dbt_scd_id`)


  
2022-03-10 11:13:06.113359 (Thread-18): 11:13:06  On snapshot.transfergalaxy.snap__transaction_records_state: /* {"app": "dbt", "dbt_version": "1.0.3", "profile_name": "user", "target_name": "default", "node_id": "snapshot.transfergalaxy.snap__transaction_records_state"} */
drop table if exists `transfer-galaxy`.`dbt_snapshots`.`snap__transaction_records_state__dbt_tmp`
2022-03-10 11:13:06.726671 (Thread-18): 11:13:06  finished collecting timing info
2022-03-10 11:13:06.727242 (Thread-18): 11:13:06  Sending event: {'category': 'dbt', 'action': 'run_model', 'label': '35add917-e38d-4443-a411-d0171182a210', 'context': [<snowplow_tracker.self_describing_json.SelfDescribingJson object at 0x7f499065b280>]}
2022-03-10 14:11:42.039645 (MainThread): 14:11:42  [33mThe bigquery adapter does not support query cancellation. Some queries may still be running![0m
2022-03-10 14:11:42.040546 (MainThread): 14:11:42  
2022-03-10 14:11:42.040856 (MainThread): 14:11:42  [33mExited because of keyboard interrupt.[0m
2022-03-10 14:11:42.041085 (MainThread): 14:11:42  
2022-03-10 14:11:42.041310 (MainThread): 14:11:42  Done. PASS=0 WARN=0 ERROR=0 SKIP=0 TOTAL=0
2022-03-10 14:11:42.041545 (MainThread): 14:11:42  Connection 'master' was properly closed.
2022-03-10 14:11:42.041723 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_risk_scoring' was properly closed.
2022-03-10 14:11:42.041904 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_forecasting' was properly closed.
2022-03-10 14:11:42.042059 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_staging_zendesk' was properly closed.
2022-03-10 14:11:42.042208 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_snapshots' was properly closed.
2022-03-10 14:11:42.042409 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_staging_airtable' was properly closed.
2022-03-10 14:11:42.042577 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_behavioural' was properly closed.
2022-03-10 14:11:42.042729 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_seeds' was properly closed.
2022-03-10 14:11:42.042878 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_referral' was properly closed.
2022-03-10 14:11:42.043024 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_playground' was properly closed.
2022-03-10 14:11:42.043170 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_staging' was properly closed.
2022-03-10 14:11:42.043315 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_staging_accounting' was properly closed.
2022-03-10 14:11:42.043459 (MainThread): 14:11:42  Connection 'snapshot.transfergalaxy.snap__transaction_records_state' was properly closed.
2022-03-10 14:11:42.043603 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_outgoing' was properly closed.
2022-03-10 14:11:42.043746 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_marts' was properly closed.
2022-03-10 14:11:42.043890 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_events' was properly closed.
2022-03-10 14:11:42.044032 (MainThread): 14:11:42  Connection 'list_transfer-galaxy_dbt_intermediary' was properly closed.
2022-03-10 14:11:42.044218 (MainThread): 14:11:42  Flushing usage events
2022-03-10 14:11:42.065240 (MainThread):

Environment

- OS: macOS Big Sur version 11.1
- Python: 3.7.9
- dbt: 1.0

What database are you using dbt with?

bigquery

Additional Context

Snapshot model code:

`{% snapshot snap__transaction_records_state %}

{{ config( description='Snapshot of transaction records state for accounting automation project. Started 2022-02-21.', target_database='transfer-galaxy', target_schema='dbt_snapshots', unique_key='tx_id', strategy='check', check_cols='all', partition_by={'field': 'tx_date_day', 'data_type': 'date'}, ) }}

select tx_id, date(tx_time_cet) as tx_date_day, tx_state_enum, tx_state, from {{ ref('src_portal__transaction_records') }}

{% endsnapshot %} `

bug Team:Execution

opened by MaxKrog 32

Include hard-deletes when making snapshot

Continues the work of the closed PR #2355, which resolves #249.

Description

The idea is to select ids from the snapshotted table which no longer exist in the source table, and update these to set dbt_valid_to to the current timestamp. Marking them as hard deleted.

As of now, I didn't write any integration tests yet as I'm having difficulties running them locally. Any pointers on how these work would be nice. Running integration test (postgres) locally currently fails, but I'm unsure why.

scheduling tests via LoadScheduling

test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestSimpleSnapshotFiles::test__postgres_ref_snapshot 
test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestSimpleSnapshotFiles::test__postgres__simple_snapshot 
test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestCustomSnapshotFiles::test__postgres__simple_custom_snapshot 
test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestSimpleColumnSnapshotFiles::test_postgres_renamed_source 
[gw1] PASSED test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestSimpleSnapshotFiles::test__postgres_ref_snapshot 
test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestNamespacedCustomSnapshotFiles::test__postgres__simple_custom_snapshot_namespaced 
[gw2] FAILED test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestSimpleColumnSnapshotFiles::test_postgres_renamed_source 

============================================================================================ FAILURES =============================================================================================
___________________________________________________________________ TestSimpleColumnSnapshotFiles.test_postgres_renamed_source ____________________________________________________________________
[gw2] linux -- Python 3.6.9 /usr/app/.tox/integration-postgres-py36/bin/python

self = <test_simple_snapshot.TestSimpleColumnSnapshotFiles testMethod=test_postgres_renamed_source>

    @use_profile('postgres')
    def test_postgres_renamed_source(self):
>       self._run_snapshot_test()

test/integration/004_simple_snapshot_test/test_simple_snapshot.py:158: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
test/integration/004_simple_snapshot_test/test_simple_snapshot.py:135: in _run_snapshot_test
    self.run_dbt(['snapshot', '--vars', '{seed_name: seed_newcol}'])
test/integration/base.py:578: in run_dbt
    "dbt exit state did not match expected")
E   AssertionError: False != True : dbt exit state did not match expected
-------------------------------------------------------------------------------------- Captured logbook call --------------------------------------------------------------------------------------
[DEBUG] dbt: Acquiring new postgres connection "__test".
[DEBUG] dbt: Acquiring new postgres connection "__test".
[DEBUG] dbt: test connection "__test" executing: DROP SCHEMA IF EXISTS "test15998113637441561482_simple_snapshot_004" CASCADE
[DEBUG] dbt: Opening a new connection, currently in state init
[DEBUG] dbt: On __test: Close
[DEBUG] dbt: Acquiring new postgres connection "__test".
[DEBUG] dbt: Acquiring new postgres connection "__test".
[DEBUG] dbt: test connection "__test" executing: CREATE SCHEMA "test15998113637441561482_simple_snapshot_004"
[DEBUG] dbt: Opening a new connection, currently in state closed
[DEBUG] dbt: On __test: Close
[INFO] dbt: Invoking dbt with ['--strict', '--test-new-parser', 'seed', '--profiles-dir', '/tmp/dbt-int-test-1dn56bze', '--log-cache-events']
[INFO] dbt: Invoking dbt with ['--strict', '--test-new-parser', 'snapshot', '--profiles-dir', '/tmp/dbt-int-test-1dn56bze', '--log-cache-events']
[DEBUG] dbt: Acquiring new postgres connection "__test".
[DEBUG] dbt: test connection "__test" executing: select * from dbt.test15998113637441561482_simple_snapshot_004.my_snapshot
[DEBUG] dbt: Opening a new connection, currently in state init
[DEBUG] dbt: On __test: Close
[INFO] dbt: Invoking dbt with ['--strict', '--test-new-parser', 'snapshot', '--vars', '{seed_name: seed_newcol}', '--profiles-dir', '/tmp/dbt-int-test-1dn56bze', '--log-cache-events']
[DEBUG] dbt: Acquiring new postgres connection "__test".
[DEBUG] dbt: Acquiring new postgres connection "__test".
[DEBUG] dbt: test connection "__test" executing: DROP SCHEMA IF EXISTS "test15998113637441561482_simple_snapshot_004" CASCADE
[DEBUG] dbt: Opening a new connection, currently in state closed
[DEBUG] dbt: On __test: Close
[DEBUG] dbt: Connection '__test' was properly closed.
===================================================================================== slowest test durations ======================================================================================
9.03s call     test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestSimpleColumnSnapshotFiles::test_postgres_renamed_source
6.44s call     test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestSimpleSnapshotFiles::test__postgres_ref_snapshot

(0.00 durations hidden.  Use -vv to show these durations.)
===================================================================================== short test summary info =====================================================================================
FAILED test/integration/004_simple_snapshot_test/test_simple_snapshot.py::TestSimpleColumnSnapshotFiles::test_postgres_renamed_source - AssertionError: False != True : dbt exit state did not m...
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! xdist.dsession.Interrupted: stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
============================================================================= 1 failed, 1 passed in 60.78s (0:01:00) ==============================================================================
ERROR: InvocationError for command /bin/bash -c '/usr/app/.tox/integration-postgres-py36/bin/python -m pytest --durations 0 -v -m profile_postgres -s -x -m profile_postgres test/integration/004_simple_snapshot_test/test_simple_snapshot.py -n4 test/integration/*' (exited with code 2)
_____________________________________________________________________________________________ summary _____________________________________________________________________________________________
ERROR:   integration-postgres-py36: commands failed

Checklist

[x] I have signed the CLA
[x] I have run this code in development and it appears to resolve the stated issue
[x] This PR includes tests, or tests are not required/relevant for this PR
[x] I have updated the CHANGELOG.md and added information about my change to the "dbt next" section.

cla:yes

opened by joelluijmes 27

Propagate docs to the database

Feature

We should propagate docs to BQ table and column descriptions.

Feature description

Cool that dbt generates a web page and docs for your project. Unfortunately, it's a bit removed from where our org is used to looking for documentation. We use BQ extensively and rely on the table and column descriptions provided in the BQ UI.

Who will this benefit?

Anyone using BQ would benefit.
enhancement

opened by jakebiesinger 26
Pass schema in credentials to Postgresql

There is a property called schema in Postgresql credentials but it looks like DBT doesn't actually use it probably because Psycopg2 doesn't provide a native way to pass schema to Postgresql.

It's a bit tricky but there's actually a way to do that via options of libpq. I believe that we should support this feature since we have the schema parameter in Postgresql configuration.

opened by buremba 25
Wider google-cloud dependencies
resolves #2794

Description

As discussed in the issue. I can narrow the range of these as per that issue; figured I'd ensure tests pass before fixing the versions.

Checklist

[ ] I have signed the CLA

[ ] I have run this code in development and it appears to resolve the stated issue

[x] This PR includes tests, or tests are not required/relevant for this PR

[ ] I have updated the CHANGELOG.md and added information about my change to the "dbt next" section.

cla:yes
opened by max-sixty 24

Indefinite Snowflake auth loop when using browser authentication

Describe the bug

We encounter indefinite auth loop while running dbt with browser auth enabled

Steps To Reproduce

Install dbt 0.17.2 and run anything on Snowflake

Expected behavior

Expected two auth requests in case of MFA and dbt proceeding to transform. (non-MFA logins fail into the loop too)

Screenshots and log output

(dbt-transformations) <wiped out>@<wiped out> dbt-transformations % dbt run --models <wiped out>
Running with dbt=0.17.2
Found 129 models, 135 tests, 3 snapshots, 0 analyses, 140 macros, 0 operations, 1 seed file, 53 sources

Initiating login request with your identity provider. A browser window should have opened for you to complete the login. If you can't see it, check existing browser windows, or your OS settings. Press CTRL+C to abort and try again...
Initiating login request with your identity provider. A browser window should have opened for you to complete the login. If you can't see it, check existing browser windows, or your OS settings. Press CTRL+C to abort and try again...
Initiating login request with your identity provider. A browser window should have opened for you to complete the login. If you can't see it, check existing browser windows, or your OS settings. Press CTRL+C to abort and try again...
Initiating login request with your identity provider. A browser window should have opened for you to complete the login. If you can't see it, check existing browser windows, or your OS settings. Press CTRL+C to abort and try again...
Initiating login request with your identity provider. A browser window should have opened for you to complete the login. If you can't see it, check existing browser windows, or your OS settings. Press CTRL+C to abort and try again...
^C^C^Cctrl-c

System information

Which database are you using dbt with?

[ ] postgres
[ ] redshift
[ ] bigquery
[x] snowflake
[ ] other (specify: ____________)

The output of dbt --version:

installed version: 0.17.2
   latest version: 0.17.2

Up to date!

Plugins:
  - bigquery: 0.17.2
  - snowflake: 0.17.2
  - redshift: 0.17.2
  - postgres: 0.17.2

The operating system you're using: MacOS 10.15.5 (19F101)

The output of python --version: Python 3.7.7

Additional context

reverting dbt to 0.17.1 fixed the issue

bug

opened by eugene-nikolaev 24

[CT-1758] [Bug] dbt process is never reaped，occur a large number of zombie processes.
Is this a new bug in dbt-core?

[X] I believe this is a new bug in dbt-core

[X] I have searched the existing issues, and I could not find an existing issue for this bug

Current Behavior

Every time I execute the dbt project, a python zombie process will be added.

I also found that other people used the dbt project to have this problem（https://github.com/python/cpython/issues/88887）. It may be a python problem, but I still want to confirm that dbt is also caused by this? Is there any good solution?

and i try to add threads=1, but It still appears zombie process.

Expected Behavior

occur a large number of zombie processes.

Steps To Reproduce

run dbt project by golang cmd.Command().

Relevant log output

No response

Environment

- OS: CentOS Linux release 7.9.2009 - Python: 3.9.13 - dbt:1.3.0

Which database adapter are you using with dbt?

postgres

Additional Context

No response
bug triage
opened by abeizn 0
[DO NOT MERGE] test cl/merge-main-click-cli branch

Opening this just for purposes of sanity testing the merge main to click branch.

Confirming that unit and integration tests (including new ones we're bringing in from main) would pass without major/unpredicted modifications to the feature/click-cli branch using the old CLI. Explanations of any changes necessary to get tests passing in the comments.

Note: mypy/flake8 errors are a result of some quick and dirty fixes here, nothing concerning.
cla:yes

opened by MichelleArk 1
[CT-1753] [Bug] Allow using agate 1.7.1 due to serious vulnerability in future
Is this a new bug in dbt-core?

[X] I believe this is a new bug in dbt-core

[X] I have searched the existing issues, and I could not find an existing issue for this bug

Current Behavior

It has been recently found that the future package has a serious vulnerability and can lead to a denial of service (see details here https://nvd.nist.gov/vuln/detail/CVE-2022-40899). The current version of dbt-core is indirectly depended on it. The current version of dbt-core specifies the dependency on the agate package to be constrained: "agate>=1.6,<1.7.1" but agate <1.7.1 depended on the version parsedatetime that in its turn depended on future. The future repository has been dead since 2019 (latest release) and parsedatetime dropped the dependency on it: https://github.com/bear/parsedatetime/releases/tag/v2.5. agate reacted to that and also updated its version to 1.7.1 to use the latest version of parsedatetime without the future dependency: https://github.com/wireservice/agate/commit/52198daae198389649a44deb0ec2d1d41f6720c1.

Please, update your dependency on agate to allow version 1.7.1 and by doing so get rid of the dependency on future.

Expected Behavior

The dependency on agate is updated to allow version 1.7.1.

Steps To Reproduce

.

Relevant log output

No response

Environment

- OS: - Python: - dbt:

Which database adapter are you using with dbt?

No response

Additional Context

No response
bug dependencies security support_rotation
opened by piankris 2
[CT-1752] [Bug] Small typos related to the interactive profile set up
Is this a new bug in dbt-core?

[X] I believe this is a new bug in dbt-core

[X] I have searched the existing issues, and I could not find an existing issue for this bug

Current Behavior

I believe interative is a typo of interactive

https://github.com/dbt-labs/dbt-core/blob/54538409509e0d677876f1102aa3bc0c67007278/core/dbt/cli/params.py#L272-L274 https://github.com/dbt-labs/dbt-core/blob/60f80056b1b09b35a22ac91c93319827bdac0e0e/core/dbt/main.py#L348-L356 https://github.com/dbt-labs/dbt-core/blob/b3440417ad1e38a40c488c3632a63f5d2e0c88f4/core/dbt/docs/build/html/index.html#L321-L325

Expected Behavior

Fix typos:

interative to interactive

Steps To Reproduce

Run dbt init --h

Relevant log output

N/A

Environment

- OS:Ubuntu 22.04.1 - Python: 3.10.6 - dbt: 1.1.3

Which database adapter are you using with dbt?

No response

Additional Context

N/A
bug good_first_issue
opened by nshuman1 0
[CT-1751] Config to optionally skip population of relation cache
Users should have the ability to turn off relation cache population, if they really need to. It should still be "on" by default.

$ dbt --no-populate-cache [run|test|...] $ DBT_POPULATE_CACHE=0|1 dbt [run|test|...]

# profiles.yml (UserConfig) config: populate_cache: true|false

This is different from entirely disabling or skipping over the cache — we're just skipping the population of the cache on startup. When dbt needs to run caching queries, I think it should still report "cache miss," and then cache the result of the metadata query, if it needs to be used again in the same invocation.

Where

https://github.com/dbt-labs/dbt-core/blob/54538409509e0d677876f1102aa3bc0c67007278/core/dbt/task/runnable.py#L407-L421

Who is it for?

Users in large projects, or of data warehouses that are quite slow at running metadata queries.

Users running into inexplicable issues with specific relations not showing up in the relation cache (https://github.com/dbt-labs/dbt-core/issues/6050)

Interactive compile & preview (#6358, #6359), which need to be blazing-fast

YMMV

End users will need to experiment with the approach that's most efficient for them, between:

full cache enabled

"cache selected only" (docs)

--no-populate-cache

I expect mileage may vary between dev, CI, and prod environments.

Questions

Would this break behavior around --defer, which expects to use the relation cache to determine if model X already exists in the dev schema, or should have its reference rewritten to use the schema defined in the other manifest?

https://github.com/dbt-labs/dbt-core/blob/54538409509e0d677876f1102aa3bc0c67007278/core/dbt/contracts/graph/manifest.py#L1018

Imagining a future where interactive compile/preview want to be both very fast, and able to correctly leverage --defer: We should also think more about making the adapter cache pluggable, as something that can live & persist outside of a single dbt-core invocation. It would be the responsibility of that other application wrapping dbt-core to handle cache invalidation (fun!).
enhancement Team:Execution Team:Adapters adapter_caching
opened by jtcohen6 0
Update agate requirement from <1.7.1,>=1.6 to >=1.6,<1.7.2 in /core
Updates the requirements on agate to permit the latest version.

Changelog

Sourced from agate's changelog.

1.7.1 - Jan 4, 2023

Allow parsedatetime 2.6.

1.7.0 - Jan 3, 2023

Add Python 3.11 support.

Add Python 3.10 support.

Drop Python 3.6 support (end-of-life was December 23, 2021).

Drop Python 2.7 support (end-of-life was January 1, 2020).

1.6.3 - July 15, 2021

feat: :meth:.Table.from_csv accepts a row_limit keyword argument. (#740)

feat: :meth:.Table.from_json accepts an encoding keyword argument. (#734)

feat: :meth:.Table.print_html accepts a max_precision keyword argument, like :meth:.Table.print_table. (#753)

feat: :class:.TypeTester accepts a null_values keyword argument, like individual data types. (#745)

feat: :class:.Min, :class:.Max and :class:.Sum (#735) work with :class:.TimeDelta.

feat: :class:.FieldSizeLimitError includes the line number in the error message. (#681)

feat: :class:.csv.Sniffer warns on error while sniffing CSV dialect.

fix: :meth:.Table.normalize works with basic processing methods. (#691)

fix: :meth:.Table.homogenize works with basic processing methods. (#756)

fix: :meth:.Table.homogenize casts compare_values and default_row. (#700)

fix: :meth:.Table.homogenize accepts tuples. (#710)

fix: :meth:.TableSet.group_by accepts input with no rows. (#703)

fix: :class:.TypeTester warns if a column specified by the force argument is not in the table, instead of raising an error. (#747)

fix: Aggregations return None if all values are None, instead of raising an error. Note that Sum, MaxLength and MaxPrecision continue to return 0 if all values are None. (#706)

fix: Ensure files are closed when errors occur. (#734)

build: Make PyICU an optional dependency.

Drop Python 3.5 support (end-of-life was September 13, 2020).

Drop Python 3.4 support (end-of-life was March 18, 2019).

1.6.2 - March 10, 2021

feat: :meth:.Date.__init__ and :meth:.DateTime.__init__ accepts a locale keyword argument (e.g. :code:en_US) for parsing formatted dates. (#730)

feat: :meth:.Number.cast casts True to 1 and False to 0. (#733)

fix: :meth:.utils.max_precision ignores infinity when calculating precision. (#726)

fix: :meth:.Date.cast catches OverflowError when type testing. (#720)

Included examples in Python package. (#716)

1.6.1 - March 11, 2018

feat: :meth:.Table.to_json can use Decimal as keys. (#696)

fix: :meth:.Date.cast and :meth:.DateTime.cast no longer parse non-date strings that contain date sub-strings as dates. (#705)

docs: Link to tutorial now uses version through Sphinx to avoid bad links on future releases. (#682)

... (truncated)

Commits

52198da build: Allow parsedatetime 2.6

cd63792 docs: Update changelog

32cce43 docs: Update badges

a1e2aaf docs: Update changelog

4d5267e ci: No duplicate builds

ec184b7 Merge pull request #772 from wireservice/python311

a82b43c chore: Fix misconversion from c83d726d535dcec08b0c5b1301b8671068c25b12

7945f1a chore: Simplify super() calls and elif/else after returns

9abb426 chore: Remove use of u''

c83d726 chore: Remove Python 2 code

Additional commits viewable in compare view

You can trigger a rebase of this PR by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies cla:yes python
opened by dependabot[bot] 3

Releases(v1.2.4)

v1.2.4(Jan 5, 2023)

Release notes
Source code(tar.gz)
Source code(zip)
dbt-core-1.2.4.tar.gz(772.22 KB)
dbt-postgres-1.2.4.tar.gz(13.65 KB)
dbt_core-1.2.4-py3-none-any.whl(853.95 KB)
dbt_postgres-1.2.4-py3-none-any.whl(16.29 KB)
v1.1.3(Jan 5, 2023)

Release notes
Source code(tar.gz)
Source code(zip)
dbt-core-1.1.3.tar.gz(758.06 KB)
dbt-postgres-1.1.3.tar.gz(11.93 KB)
dbt_core-1.1.3-py3-none-any.whl(832.15 KB)
dbt_postgres-1.1.3-py3-none-any.whl(12.72 KB)
v1.0.9(Jan 5, 2023)

Release notes
Source code(tar.gz)
Source code(zip)
dbt-core-1.0.9.tar.gz(745.75 KB)
dbt-postgres-1.0.9.tar.gz(11.97 KB)
dbt_core-1.0.9-py3-none-any.whl(817.98 KB)
dbt_postgres-1.0.9-py3-none-any.whl(12.78 KB)
v1.3.2(Jan 4, 2023)

Release notes
Source code(tar.gz)
Source code(zip)
dbt-core-1.3.2.tar.gz(796.79 KB)
dbt-postgres-1.3.2.tar.gz(14.03 KB)
dbt_core-1.3.2-py3-none-any.whl(882.64 KB)
dbt_postgres-1.3.2-py3-none-any.whl(17.19 KB)
v1.3.2rc1(Jan 4, 2023)

Release notes
Source code(tar.gz)
Source code(zip)
dbt-core-1.3.2rc1.tar.gz(796.87 KB)
dbt-postgres-1.3.2rc1.tar.gz(14.01 KB)
dbt_core-1.3.2rc1-py3-none-any.whl(882.67 KB)
dbt_postgres-1.3.2rc1-py3-none-any.whl(17.22 KB)
v1.4.0b1(Dec 15, 2022)
dbt-core 1.4.0-b1 - December 15, 2022

Features

Added favor-state flag to optionally favor state nodes even if unselected node exists (#2968)

Update structured logging. Convert to using protobuf messages. Ensure events are enriched with node_info. (#5610)

Friendlier error messages when packages.yml is malformed (#5486)

Migrate dbt-utils current_timestamp macros into core + adapters (#5521)

Allow partitions in external tables to be supplied as a list (#5929)

extend -f flag shorthand for seed command (#5990)

This pulls the profile name from args when constructing a RuntimeConfig in lib.py, enabling the dbt-server to override the value that's in the dbt_project.yml (#6201)

Adding tarball install method for packages. Allowing package tarball to be specified via url in the packages.yaml. (#4205)

Added an md5 function to the base context (#6246)

Exposures support metrics in lineage (#6057)

Add support for Python 3.11 (#6147)

incremental predicates (#5680)

Fixes

Account for disabled flags on models in schema files more completely (#3992)

Add validation of enabled config for metrics, exposures and sources (#6030)

check length of args of python model function before accessing it (#6041)

Add functors to ensure event types with str-type attributes are initialized to spec, even when provided non-str type params. (#5436)

Allow hooks to fail without halting execution flow (#5625)

Clarify Error Message for how many models are allowed in a Python file (#6245)

After this, will be possible to use default values for dbt.config.get (#6309)

Use full path for writing manifest (#6055)

[CT-1284] Change Python model default materialization to table (#6345)

Repair a regression which prevented basic logging before the logging subsystem is completely configured. (#6434)

Docs

minor doc correction (dbt-docs/#5791)

Generate API docs for new CLI interface (dbt-docs/#5528)

(dbt-docs/#5880)

Fix rendering of sample code for metrics (dbt-docs/#323)

Alphabetize core/dbt/README.md (dbt-docs/#6368)

Under the Hood

Put black config in explicit config (#5946)

Added flat_graph attribute the Manifest class's deepcopy() coverage (#5809)

Add mypy configs so mypy passes from CLI (#5983)

Exception message cleanup. (#6023)

Add dmypy cache to gitignore (#6028)

Provide useful errors when the value of 'materialized' is invalid (#5229)

Clean up string formatting (#6068)

Fixed extra whitespace in strings introduced by black. (#1350)

Remove the 'root_path' field from most nodes (#6171)

Combine certain logging events with different levels (#6173)

Convert threading tests to pytest (#5942)

Convert postgres index tests to pytest (#5770)

Convert use color tests to pytest (#5771)

Add github actions workflow to generate high level CLI API docs (#5942)

Functionality-neutral refactor of event logging system to improve encapsulation and modularity. (#6139)

Consolidate ParsedNode and CompiledNode classes (#6383)

Prevent doc gen workflow from running on forks (#6386)

Fix intermittent database connection failure in Windows CI test (#6394)

Refactor and clean up manifest nodes (#6426)

Restore important legacy logging behaviors, following refactor which removed them (#6437)

Dependencies

Update pathspec requirement from ~=0.9.0 to >=0.9,<0.11 in /core (#5917)

Bump black from 22.8.0 to 22.10.0 (#6019)

Bump mashumaro[msgpack] from 3.0.4 to 3.1.1 in /core (#6108)

Update colorama requirement from <0.4.6,>=0.3.9 to >=0.3.9,<0.4.7 in /core (#6144)

Bump mashumaro[msgpack] from 3.1.1 to 3.2 in /core (#4904)

Contributors

@andy-clapson (dbt-docs/#5791)

@chamini2 (#6041)

@daniel-murray (#2968)

@dave-connors-3 (#5990)

@dbeatty10 (dbt-docs/#6368, #6394)

@devmessias (#6309)

@eve-johns (#6068)

@haritamar (#6246)

@jared-rimmer (#5486)

@josephberni (#2968)

@joshuataylor (#6147)

@justbldwn (#6245)

@luke-bassett (#1350)

@max-sixty (#5946, #5983, #6028)

@paulbenschmidt (dbt-docs/#5880)

@pgoslatara (#5929)

@racheldaniel (#6201)

@timle2 (#4205)

@dave-connors-3 (#5680)

Source code(tar.gz)
Source code(zip)
dbt-core-1.4.0b1.tar.gz(804.56 KB)
dbt-postgres-1.4.0b1.tar.gz(14.02 KB)
dbt_core-1.4.0b1-py3-none-any.whl(891.33 KB)
dbt_postgres-1.4.0b1-py3-none-any.whl(17.22 KB)
v1.3.1(Nov 16, 2022)

Release notes
Source code(tar.gz)
Source code(zip)
dbt-core-1.3.1.tar.gz(796.53 KB)
dbt-postgres-1.3.1.tar.gz(14.04 KB)
dbt_core-1.3.1-py3-none-any.whl(882.36 KB)
dbt_postgres-1.3.1-py3-none-any.whl(17.19 KB)
v1.2.3(Nov 16, 2022)

Release notes
Source code(tar.gz)
Source code(zip)
dbt-core-1.2.3.tar.gz(771.95 KB)
dbt-postgres-1.2.3.tar.gz(13.64 KB)
dbt_core-1.2.3-py3-none-any.whl(853.69 KB)
dbt_postgres-1.2.3-py3-none-any.whl(16.29 KB)
v1.3.0(Oct 12, 2022)
dbt-core 1.3.0 - Edgar Allan Poe (October 12, 2022)

Breaking Changes

Renaming Metric Spec Attributes (#5774, #5775)

Features

Add --defer flag to dbt compile & dbt docs generate (#4110, #4514)

Python model inital version (#5261, #5421)

allows user to include the file extension for .py models in the dbt run -m command. (#5289, #5295)

Incremental materialization refactor and cleanup (#5245, #5359)

Python models can support incremental logic (#0, #35)

Add reusable function for retrying adapter connections. Utilize said function to add retries for Postgres (and Redshift). (#5022, #5432)

merge_exclude_columns for incremental materialization (#5260, #5457)

add exponential backoff to connection retries on Postgres (and Redshift) (#5502, #5503)

use MethodName.File when value ends with .csv (#5578, #5581)

Make docs configurable in dbt_project.yml and add a node_color attribute to change the color of nodes in the DAG (#5333, #5397)

Adding ResolvedMetricReference helper functions and tests (#5567, #5607)

Check dbt-core version requirements when installing Hub packages (#5648, #5651)

Search current working directory for profiles.yml (#5411, #5717)

Adding the window parameter to the metric spec. (#5721, #5722)

Add invocation args dict to ProviderContext class (#5524, #5782)

Adds new cli framework (#5526, #5647)

Flags work with new Click CLI (#5529, #5790)

Add metadata env method to ProviderContext class (#5522, #5794)

Array macros (#5520, #5823)

Add enabled config to exposures and metrics (#5422, #5815)

Migrate dbt-utils current_timestamp macros into core + adapters (#5521, #5838)

add -fr flag shorthand (#5878, #5879)

add type_boolean as a data type macro (#5739, #5875)

Support .dbtignore in project root to ignore certain files being read by dbt (#5733, #5897)

This conditionally no-ops warehouse connection at compile depending on an env var, disabling introspection/queries during compilation only. This is a temporary solution to more complex permissions requirements for the semantic layer. (#5936, #5926)

Fixes

Remove the default 256 characters limit on postgres character varying type when no limitation is set (#5238, #5292)

Include schema file config in unrendered_config (#5338, #5344)

Add context to compilation errors generated while rendering generic test configuration values. (#5294, #5393)

Resolves #5351 - Do not consider shorter varchar cols as schema changes (#5351, #5395)

Rename try to strict for more intuitiveness (#5475, #5477)

on_shchma_change fail verbosity enhancement (#5504, #5505)

Ignore empty strings passed in as secrets (#5312, #5518)

Fix handling of top-level exceptions (#5564, #5560)

Fix error rendering docs block in metrics description (#5585, #5603)

Extended validations for the project names (#5379, #5620)

Use sys.exit instead of exit (#5621, #5627)

Finishing logic upgrade to Redshift for name truncation collisions. (#5586, #5656)

multiple args for ref and source (#5634, #5635)

Fix Unexpected behavior when chaining methods on dbt-ref'ed/sourced dataframes (#5646, #5677)

Fix typos of comments in core/dbt/adapters/ (#5690, #5693)

Include py.typed in MANIFEST.in. This enables packages that install dbt-core from pypi to use mypy. (#5703, #5703)

Removal of all .coverage files when using make clean command (#5633, #5759)

Remove temp files generated by unit tests (#5631, #5749)

Fix warnings as errors during tests (#5424, #5800)

Prevent event_history from holding references (#5848, #5858)

Account for disabled flags on models in schema files more completely (#3992, #5868)

ConfigSelectorMethod should check for bools (#5890, #5889)

shorthand for full refresh should be one character (#5878, #5908)

Fix macro resolution order during static analysis for custom generic tests (#5720, #5907)

Fix race condition when invoking dbt via lib.py concurrently (#5919, #5921)

check length of args of python model function before accessing it (#6041, #6042)

Docs

Update dependency inline-source from ^6.1.5 to ^7.2.0 (dbt-docs/#299, dbt-docs/#291)

Update dependency jest from ^26.2.2 to ^28.1.3 (dbt-docs/#299, dbt-docs/#291)

Update dependency underscore from ^1.9.0 to ^1.13.4 (dbt-docs/#299, dbt-docs/#291)

Update dependency webpack-cli from ^3.3.12 to ^4.7.0 (dbt-docs/#299, dbt-docs/#291)

Update dependency webpack-dev-server from ^3.1.11 to ^4.9.3 (dbt-docs/#299, dbt-docs/#291)

Searches no longer require perfect matches, and instead consider each word individually. my model or model my will now find my_model, without the need for underscores (dbt-docs/#143, dbt-docs/#145)

Support the renaming of SQL to code happening in dbt-core (dbt-docs/#299, dbt-docs/#292)

Leverages docs.node_color from dbt-core to color nodes in the DAG (dbt-docs/#44, dbt-docs/#281)

Refer to exposures by their label by default. (dbt-docs/#306, dbt-docs/#307)

Under the Hood

Added language to tracked fields in run_model event (#5571, #5469)

Update mashumaro to 3.0.3 (#4940, #5118)

Add python incremental materialization test (#0000, #5571)

Save use of default env vars to manifest to enable partial parsing in those cases. (#5155, #5589)

add more information to log line interop test failures (#5658, #5659)

Add supported languages to materializations (#5569, #5695)

Migrate integration test 014 but also fix the snapshot hard delete test's timezone logic and force all integration tests to run flags.set_from_args to force environment variables are accessible to all integration test threads. (#5760, #5760)

Support dbt-metrics compilation by rebuilding flat_graph (#5525, #5786)

Reworking the way we define the window attribute of metrics to match freshness tests (#5722, #5793)

Add PythonJobHelper base class in core and add more type checking (#5802, #5802)

The link did not go to the anchor directly, now it does (#5813, #5814)

remove key as reserved keyword from test_bool_or (#5817, #5818)

Convert default selector tests to pytest (#5728, #5820)

Compatibiltiy for metric attribute renaming (#5807, #5825)

remove source quoting setting in adapter tests (#5836, #5839)

Add name validation for metrics (#5456, #5841)

Validate exposure name and add label (#5606, #5844)

Adding validation for metric expression attribute (#5871, #5873)

Profiling and Adapter Management work with Click CLI (#5531, #5892)

Reparse references to deleted metric (#5444, #5920)

Dependencies

Upgrade to Jinja2==3.1.2 from Jinja2==2.11.3 (#4748, #5465)

Bump mypy from 0.961 to 0.971 (#4904, #5495)

Remove pin for MarkUpSafe from >=0.23,<2.1 (#5506, #5507)

Dependency

Bump python from 3.10.5-slim-bullseye to 3.10.6-slim-bullseye in /docker (#4904, #5623)

Bump mashumaro[msgpack] from 3.0.3 to 3.0.4 in /core (#4904, #5649)

Bump black from 22.6.0 to 22.8.0 (#4904, #5750)

Bump python from 3.10.6-slim-bullseye to 3.10.7-slim-bullseye in /docker (#4904, #5805)

Contributors

@Goodkat (#5581, #5518, #5620)

@Ilanbenb (#5505)

@b-per (#5397, dbt-docs/#281)

@bbroeksema (#5749)

@callum-mcdata (#5775, #5607, #5722, #5793, #5825, #5873)

@danielcmessias (#5889)

@dave-connors-3 (#5457, #5879, #5908)

@dbeatty10 (#5717, #5823)

@drewbanin (#5921, dbt-docs/#292)

@epapineau (#5395)

@graciegoheen (#5823)

@jared-rimmer (#5782, #5794, #5759)

@jeremyyeo (#5477)

@joellabes (dbt-docs/#145)

@jpmmcneill (#5875)

@kadero (#4514)

@leoebfolsom (#5295)

@matt-winkler (#5397, dbt-docs/#281)

@nicholasyager (#5393)

@panasenco (#5703)

@racheldaniel (#5926)

@sdebruyn (#5814, #5818, #5839)

@shrodingers (#5292)

@sungchun12 (#5397, dbt-docs/#281)

@tomasfarias (#5432)

@varun-dc (#5627)

@yoiki (#5693)

@chamini2 (#6042)

Edgar Allan Poe (1809 – 1849)

An homage to Edgar Allan Poe—a famous Philadelphian (for a time), and the namesake of this dbt Core release.

Once upon a midday yawning, which I’d squandered, pip and pawning, Over many a strange and spurious module of forgotten code— While I squinted, much still missing, suddenly there came a hissing, As of something softly sneaking, sliding through my headphone port— “’Tis some static," I muttered, "hissing through my headphone port—
                Only this and nothing more.” Ah, to recount distinctly, it was in eerie October; And each separate spooling thread kept spinning as the seconds soared. Eagerly I wished ‘twould finish;—vainly I had sought to diminish The time they took in so running—running toward the fateful Core— For the well-grained factful marts whom the analysts name as “Core”—
                Dimless here for evermore. Out now I unplugged the cable, when, with first a hiss and then hooray, There appeared a wisened Python of the quaint old days of yore; Not the least salutation made it; not a minute stopped or stayed it; But, with look of staff or sov’reign, coiled beside my headphone port— Coiled upon an O’Reilly just beside my headphone port—
                Coiled, and stared, and nothing more. Then this wry ophidian, beguiling my surprise into smiling, By the sparkling winking of its reptilian esprit, “Though thy eye be bright and shining, thou,” I said, “art sure a wise one, Novel new and clever Python slithering through Production’s quay— Tell me what has orchestrated thee on this Day’s Production spree!"                 Quoth the Python, “dbt.”
Source code(tar.gz)
Source code(zip)
dbt-core-1.3.0.tar.gz(795.95 KB)
dbt-postgres-1.3.0.tar.gz(14.01 KB)
dbt_core-1.3.0-py3-none-any.whl(882.12 KB)
dbt_postgres-1.3.0-py3-none-any.whl(17.19 KB)
v1.3.0rc2(Oct 3, 2022)
dbt-core 1.3.0-rc2 - October 03, 2022

Features

Migrate dbt-utils current_timestamp macros into core + adapters (#5521, #5838)

Fixes

Account for disabled flags on models in schema files more completely (#3992, #5868)

Source code(tar.gz)
Source code(zip)
dbt-core-1.3.0rc2.tar.gz(795.95 KB)
dbt-postgres-1.3.0rc2.tar.gz(14.02 KB)
dbt_core-1.3.0rc2-py3-none-any.whl(882.15 KB)
dbt_postgres-1.3.0rc2-py3-none-any.whl(17.22 KB)
v1.2.2(Oct 3, 2022)

Release notes
Source code(tar.gz)
Source code(zip)
dbt-core-1.2.2.tar.gz(771.73 KB)
dbt-postgres-1.2.2.tar.gz(13.64 KB)
dbt_core-1.2.2-py3-none-any.whl(853.48 KB)
dbt_postgres-1.2.2-py3-none-any.whl(16.29 KB)
v1.3.0rc1(Sep 28, 2022)
dbt-core 1.3.0-rc1 - September 28, 2022

Breaking Changes

Renaming Metric Spec Attributes (#5774, #5775)

Features

merge_exclude_columns for incremental materialization (#5260, #5457)

Search current working directory for profiles.yml (#5411, #5717)

Adding the window parameter to the metric spec. (#5721, #5722)

Add invocation args dict to ProviderContext class (#5524, #5782)

Adds new cli framework (#5526, #5647)

Flags work with new Click CLI (#5529, #5790)

Add metadata env method to ProviderContext class (#5522, #5794)

Array macros (#5520, #5823)

Add enabled config to exposures and metrics (#5422, #5815)

add -fr flag shorthand (#5878, #5879)

add type_boolean as a data type macro (#5739, #5875)

Support .dbtignore in project root to ignore certain files being read by dbt (#5733, #5897)

This conditionally no-ops warehouse connection at compile depending on an env var, disabling introspection/queries during compilation only. This is a temporary solution to more complex permissions requirements for the semantic layer. (#5936, #5926)

Fixes

Fix typos of comments in core/dbt/adapters/ (#5690, #5693)

Include py.typed in MANIFEST.in. This enables packages that install dbt-core from pypi to use mypy. (#5703, #5703)

Removal of all .coverage files when using make clean command (#5633, #5759)

Remove temp files generated by unit tests (#5631, #5749)

Fix warnings as errors during tests (#5424, #5800)

Prevent event_history from holding references (#5848, #5858)

ConfigSelectorMethod should check for bools (#5890, #5889)

shorthand for full refresh should be one character (#5878, #5908)

Fix macro resolution order during static analysis for custom generic tests (#5720, #5907)

Fix race condition when invoking dbt via lib.py concurrently (#5919, #5921)

Docs

Refer to exposures by their label by default. (dbt-docs/#306, dbt-docs/#307)

Under the Hood

Migrate integration test 014 but also fix the snapshot hard delete test's timezone logic and force all integration tests to run flags.set_from_args to force environment variables are accessible to all integration test threads. (#5760, #5760)

Support dbt-metrics compilation by rebuilding flat_graph (#5525, #5786)

Reworking the way we define the window attribute of metrics to match freshness tests (#5722, #5793)

Add PythonJobHelper base class in core and add more type checking (#5802, #5802)

The link did not go to the anchor directly, now it does (#5813, #5814)

remove key as reserved keyword from test_bool_or (#5817, #5818)

Convert default selector tests to pytest (#5728, #5820)

Compatibiltiy for metric attribute renaming (#5807, #5825)

remove source quoting setting in adapter tests (#5836, #5839)

Add name validation for metrics (#5456, #5841)

Validate exposure name and add label (#5606, #5844)

Adding validation for metric expression attribute (#5871, #5873)

Profiling and Adapter Management work with Click CLI (#5531, #5892)

Reparse references to deleted metric (#5444, #5920)

Dependency

Bump black from 22.6.0 to 22.8.0 (#4904, #5750)

Bump python from 3.10.6-slim-bullseye to 3.10.7-slim-bullseye in /docker (#4904, #5805)

Contributors

@bbroeksema (#5749)

@callum-mcdata (#5775, #5722, #5793, #5825, #5873)

@danielcmessias (#5889)

@dave-connors-3 (#5457, #5879, #5908)

@dbeatty10 (#5717, #5823)

@drewbanin (#5921)

@graciegoheen (#5823)

@jared-rimmer (#5782, #5794, #5759)

@jpmmcneill (#5875)

@panasenco (#5703)

@racheldaniel (#5926)

@sdebruyn (#5814, #5818, #5839)

@yoiki (#5693)

Source code(tar.gz)
Source code(zip)
dbt-core-1.3.0rc1.tar.gz(794.87 KB)
dbt-postgres-1.3.0rc1.tar.gz(13.92 KB)
dbt_core-1.3.0rc1-py3-none-any.whl(880.79 KB)
dbt_postgres-1.3.0rc1-py3-none-any.whl(16.91 KB)
v1.3.0b2(Aug 29, 2022)
dbt-core 1.3.0-b2 - August 29, 2022

Features

Add --defer flag to dbt compile & dbt docs generate (#4110, #4514)

use MethodName.File when value ends with .csv (#5578, #5581)

Make docs configurable in dbt_project.yml and add a node_color attribute to change the color of nodes in the DAG (#5333, #5397)

Adding ResolvedMetricReference helper functions and tests (#5567, #5607)

Check dbt-core version requirements when installing Hub packages (#5648, #5651)

Fixes

Remove the default 256 characters limit on postgres character varying type when no limitation is set (#5238, #5292)

Include schema file config in unrendered_config (#5338, #5344)

Resolves #5351 - Do not consider shorter varchar cols as schema changes (#5351, #5395)

on_shchma_change fail verbosity enhancement (#5504, #5505)

Fix error rendering docs block in metrics description (#5585, #5603)

Extended validations for the project names (#5379, #5620)

Use sys.exit instead of exit (#5621, #5627)

Finishing logic upgrade to Redshift for name truncation collisions. (#5586, #5656)

multiple args for ref and source (#5634, #5635)

Fix Unexpected behavior when chaining methods on dbt-ref'ed/sourced dataframes (#5646, #5677)

Docs

Leverages docs.node_color from dbt-core to color nodes in the DAG (dbt-docs/#44, dbt-docs/#281)

Under the Hood

Save use of default env vars to manifest to enable partial parsing in those cases. (#5155, #5589)

add more information to log line interop test failures (#5658, #5659)

Add supported languages to materializations (#5569, #5695)

Dependency

Bump python from 3.10.5-slim-bullseye to 3.10.6-slim-bullseye in /docker (#4904, #5623)

Bump mashumaro[msgpack] from 3.0.3 to 3.0.4 in /core (#4904, #5649)

Contributors

@Goodkat (#5581, #5620)

@Ilanbenb (#5505)

@b-per (#5397, dbt-docs/#281)

@callum-mcdata (#5607)

@epapineau (#5395)

@kadero (#4514)

@matt-winkler (#5397, dbt-docs/#281)

@shrodingers (#5292)

@sungchun12 (#5397, dbt-docs/#281)

@varun-dc (#5627)

Source code(tar.gz)
Source code(zip)
dbt-core-1.3.0b2.tar.gz(782.78 KB)
dbt-postgres-1.3.0b2.tar.gz(13.94 KB)
dbt_core-1.3.0b2-py3-none-any.whl(865.62 KB)
dbt_postgres-1.3.0b2-py3-none-any.whl(16.90 KB)
v1.2.1(Aug 25, 2022)
dbt-core 1.2.1 - August 25, 2022

Fixes

Fix handling of top-level exceptions (#5564, #5560)

Fix error rendering docs block in metrics description (#5585, #5603)

Use sys.exit instead of exit (#5621, #5627)

Finishing logic upgrade to Redshift for name truncation collisions. (#5586, #5656)

Contributors

@varun-dc (#5627)

Source code(tar.gz)
Source code(zip)
dbt-core-1.2.1.tar.gz(771.13 KB)
dbt-postgres-1.2.1.tar.gz(13.65 KB)
dbt_core-1.2.1-py3-none-any.whl(852.89 KB)
dbt_postgres-1.2.1-py3-none-any.whl(16.29 KB)
v1.2.1rc2(Aug 18, 2022)
dbt-core 1.2.1-rc2 - August 18, 2022

Fixes

Finishing logic upgrade to Redshift for name truncation collisions. (#5586, #5656)

Source code(tar.gz)
Source code(zip)
dbt-core-1.2.1rc2.tar.gz(771.22 KB)
dbt-postgres-1.2.1rc2.tar.gz(13.66 KB)
dbt_core-1.2.1rc2-py3-none-any.whl(852.93 KB)
dbt_postgres-1.2.1rc2-py3-none-any.whl(16.32 KB)
v1.2.1rc1(Aug 11, 2022)
dbt-core 1.2.1-rc1 - August 10, 2022

Fixes

Fix handling of top-level exceptions (#5564, #5560)

Fix error rendering docs block in metrics description (#5585, #5603)

Use sys.exit instead of exit (#5621, #5627)

Contributors

@varun-dc (#5627)

Source code(tar.gz)
Source code(zip)
dbt-core-1.2.1rc1.tar.gz(770.87 KB)
dbt-postgres-1.2.1rc1.tar.gz(13.64 KB)
dbt_core-1.2.1rc1-py3-none-any.whl(852.64 KB)
dbt_postgres-1.2.1rc1-py3-none-any.whl(16.32 KB)
v1.3.0b1(Jul 29, 2022)
dbt-core 1.3.0-b1 - July 29, 2022

Features

Python model inital version (#5261, #5421)

allows user to include the file extension for .py models in the dbt run -m command. (#5289, #5295)

Incremental materialization refactor and cleanup (#5245, #5359)

Python models can support incremental logic (#0, #35)

Add reusable function for retrying adapter connections. Utilize said function to add retries for Postgres (and Redshift). (#5022, #5432)

add exponential backoff to connection retries on Postgres (and Redshift) (#5502, #5503)

Fixes

Add context to compilation errors generated while rendering generic test configuration values. (#5294, #5393)

Rename try to strict for more intuitiveness (#5475, #5477)

Ignore empty strings passed in as secrets (#5312, #5518)

Fix handling of top-level exceptions (#5564, #5560)

Docs

Update dependency inline-source from ^6.1.5 to ^7.2.0 (#5574, #5577)

Update dependency jest from ^26.2.2 to ^28.1.3 (#5574, #5577)

Update dependency underscore from ^1.9.0 to ^1.13.4 (#5574, #5577)

Update dependency webpack-cli from ^3.3.12 to ^4.7.0 (#5574, #5577)

Update dependency webpack-dev-server from ^3.1.11 to ^4.9.3 (#5574, #5577)

Searches no longer require perfect matches, and instead consider each word individually. my model or model my will now find my_model, without the need for underscores (#5574, #5577)

Support the renaming of SQL to code happening in dbt-core (#5574, #5577)

Under the Hood

Added language to tracked fields in run_model event (#5571, #5469)

Update mashumaro to 3.0.3 (#4940, #5118)

Add python incremental materialization test (#0000, #5571)

Dependencies

Upgrade to Jinja2==3.1.2 from Jinja2==2.11.3 (#4748, #5465)

Bump mypy from 0.961 to 0.971 (#4904, #5495)

Remove pin for MarkUpSafe from >=0.23,<2.1 (#5506, #5507)

Contributors

@Goodkat (#5518)

@drewbanin (#5577)

@jeremyyeo (#5477)

@joellabes (#5577)

@leoebfolsom (#5295)

@nicholasyager (#5393)

@tomasfarias (#5432)

Source code(tar.gz)
Source code(zip)
dbt-core-1.3.0b1.tar.gz(778.17 KB)
dbt-postgres-1.3.0b1.tar.gz(13.88 KB)
dbt_core-1.3.0b1-py3-none-any.whl(860.41 KB)
dbt_postgres-1.3.0b1-py3-none-any.whl(16.86 KB)
v1.1.2(Jul 29, 2022)
dbt-core 1.1.2 - July 29, 2022

Fixes

Define compatibility for older manifest versions when using state: selection methods (#5213, #5346)

Under the Hood

Add annotation to render_value method reimplemented in #5334 (#4796, #5382)

Source code(tar.gz)
Source code(zip)
dbt-core-1.1.2.tar.gz(757.83 KB)
dbt-postgres-1.1.2.tar.gz(11.93 KB)
dbt_core-1.1.2-py3-none-any.whl(831.91 KB)
dbt_postgres-1.1.2-py3-none-any.whl(12.72 KB)
v1.2.0(Jul 26, 2022)
dbt-core 1.2.0 - Henry George (July 26, 2022)

Features

Add selector method when reading selector definitions (#4821, #4827)

Add set and zip function to contexts (#2345, #5107)

Adds itertools to modules Jinja namespace (#5130, #5140)

allow target as an option in profile_template.yml (#5179, #5184)

seed: Add new macro get_csv_sql (#5206, #5207)

Grants as Node Configs (#5189, #5230)

Adds file selectors and support for file selectors in the default method selector (#5240, #5241)

Move cross-db macros from dbt-utils into dbt-core global project (#4813, #5265)

Prettify duration message at the end of execution (#5253, #5364)

Early return from dbt init if no available adapters (#5365, #5366)

Allow customizing target-path and log-path through environment variables and CLI flags. (#5399, #5402)

Move type_* macros from dbt-utils into dbt-core, with tests (#5317, #5428)

Add support for ratio metrics (#4884, #5027)

Allow users to define grants as a reasonable default in the dbt_project.yml or within each model sql or yml file combined. (#5263, #5369)

Add reusable function for retrying adapter connections. Utilize said function to add retries for Postgres (and Redshift). (#5022, #5432)

Fixes

Adding new cols to check_cols in snapshots (#3146, #4893)

Truncate relation names when appending a suffix that will result in len > 63 characters using make_temp_relation and make_backup_relation macros (#2869, #4921)

Restore ability to utilize updated_at for check_cols snapshots (#5076, #5077)

Use yaml renderer (with target context) for rendering selectors (#5131, #5136)

Fix retry logic to return values after initial try (#5023, #5137)

Scrub secret env vars from CommandError in exception stacktrace (#5151, #5152)

Ensure the metric name does not contain spaces (#4572, #5173)

When parsing 'all_sources' should be a list of unique dirs (#5120, #5176)

Add warning if yaml contains duplicate keys (#5114, #5146)

Modifying the drop_test_schema to work better with Redshift issues around locked tables and current transactions (#5200, #5198)

Fix column comparison in snapshot_check_all_get_existing_columns for check-strategy snapshots with explicit check_cols defined (#5222, #5223)

Changed how --select state:modified detects changes for macros nodes depend on (#5202, #5224)

Fix column comparison in snapshot_check_all_get_existing_columns to use adapter.get_columns_in_relation (#5222, #5232)

Remove docs file from manifest when removing doc node (#4146, #5270)

Remove duplicate dbt script entry (#5314, #5304)

Change node ancestor/descendant algo, fixes issue where downstream models aren't run when using networkx >= 2.8.1 (#5286, #5326)

Fixing Windows color regression (#5191, #5327)

Define compatibility for older manifest versions when using state: selection methods (#5213, #5346)

Add inheritance to materialization macro resolution (#4646, #5348)

Improve pluralizations for Documentation and SqlOperation NodeTypes (#5352, #5356)

Properly use quotes for Snowflake snapshots when checking all columns (#2975, #5389)

fixes handling of RESET color code with USE_COLORS=False (#5288, #5394)

Remove duplicate key checking introduced in 1.2.0a1 (#5331, #5403)

Rename try to strict for more intuitiveness (#5475, #5477)

Docs

Fixed sample SQL Code for sources when no database is defined (#5255, #5446)

Add support for file: selector in DAG viz (#5255, #5446)

[Snyk] Upgrade prismjs from 1.27.0 to 1.28.0 (#5255, #5446)

Run build and tests in CI checks (#5255, #5446)

Improve metrics DAG viz and documentation page (#5255, #5446)

Upgrade cytoscape.js fork (#5255, #5446)

Under the Hood

Migrating 005_simple_seed to the new test framework. (#200, #5013)

Convert 029_docs_generate tests to new framework (#5035, #5058)

Move package deprecation check outside of package cache (#5068, #5069)

removal of scaffold first attempt and create_adapter_plugin.py as they are deprecated new scaffold can be found https://github.com/dbt-labs/dbt-database-adapter-scaffold (#4980, #5117)

Mypy -> 0.942 + fixed import logic to allow for full mypy coverage (#4805, #5171)

Converted dbt list tests to pytest (#5049, #5178)

Fix: Call str and repr for UnsetProfileConfig without a RuntimeException (#5081, #5209)

Improve tracking error logging message (#5197, #5211)

Clean up materialization logic: more consistent relation names, loading from cache (#2869, #4921)

Use the default Python version for local dev and test instead of requiring Python 3.8 (#5257, #5269)

Fix test for context set function (#5266, #5272)

Fix pip upgrade step in CI for Windows (#5321, #5320)

Fix unit test test_graph_selection (#5323, #5324)

Update context readme + clean up context code" (#4796, #5334)

removed script meant for snowflake to snowflake (#5361, #5362)

Added the suggested RegEx to check the SemVer string within a package dependency and improved invalid version error handling. (#5201, #5370)

Add annotation to render_value method reimplemented in #5334 (#4796, #5382)

Bump manifest version to v6 (#5417, #5430)

Add tests for SQL grants (#5437, #5447)

Dependencies

Bump ubuntu from 20.04 to 22.04 (#4904, #5141)

Bumping hologram version (#5219, #5218)

Bump mypy from 0.942 to 0.961 (#4904, #5337)

Bump python from 3.10.3-slim-bullseye to 3.10.5-slim-bullseye in /docker (#4904, #5367)

Update colorama requirement from <0.4.5,>=0.3.9 to >=0.3.9,<0.4.6 in /core (#4904, #5388)

Bump black from 22.3.0 to 22.6.0 (#4904, #5420)

Security

Move string interpolation of "secret" env vars outside of Jinja context. Update "contexts" README (#4796, #5334)

Contributors

@GtheSheep (#4893)

@NicolasPA (#5211)

@adamantike (#5207)

@alexrosenfeld10 (#5184)

@b-per (#5446)

@bd3dowling (#5140)

@callum-mcdata (#5027, #5446)

@danieldiamond (#4827)

@darin-reify (#5394)

@dbeatty10 (#5265, #5077)

@drewbanin (#5027, #5446, #5446, #5446)

@epapineau (#4921)

@fivetran-joemarkiewicz (#5370)

@groodt (#5304)

@isidentical (#5402)

@jared-rimmer (#5364)

@jeremyyeo (#5107, #5146, #5403, #5477)

@jwills (#5241, #5269)

@pdebelak (#5356)

@pquadri (#5389)

@tomasfarias (#5432, #5209)

@ulisesojeda (#5366)

@volkangurel (#5348)

Henry George (1839 – 1897)

Thanks to @heysweet for writing this biography of Henry George —a famous Philadelphian, and the namesake of this dbt Core release.

“Men like Henry George are rare unfortunately. One cannot imagine a more beautiful combination of intellectual keenness, artistic form and fervent love of justice. Every line is written as if for our generation. The spreading of these works is a really deserving cause, for our generation especially has many and important things to learn from Henry George.”

— Albert Einstein

The boardgame of Monopoly has long been credited to Philadelphian Charles Darrow — a rags to riches story of a man down on his luck struck with a brilliant idea. In reality, Charles Darrow stole the idea while playing a boardgame **at a dinner party with friends. Poet, comedian, actress, and engineer Lizzie Magie designed the original, called The Landlord’s Game, which was intended to demonstrate the political theories of Henry George (Georgism). While the Capitalist ruleset was the more popular and ended up as the base of Monopoly itself, the game was designed with an alternate ruleset for how to play, which involved all players being rewarded during wealth creation. These two playstyles demonstrate the harm of natural monopolies and the societal benefit that comes to all from a land value tax. (Charles Darrow went on to become the first millionaire game designer, and though Magie had a patent, she made $500 from the game she designed).

Henry George was a prominent political economist and journalist, who advocated strongly for women’s suffrage, universal basic income, a land value tax, free or affordable public services in place of natural monopolies (utilities, mass transit, libraries), and campaign finance reform.

His first book Progress and Poverty (1879) popularized many of the ideas of Georgism (alternatively Geoism or the single tax movement), which argued that people should own the value they produce themselves, and that rent on land, natural resources, and the commons should belong equally to all members of society, to be paid out in the form of a citizen’s dividend. A leading figure of the Progressive Movement John Peter Altgeld wrote that George “made almost as great an impression on the economic thought of the age as Darwin did on the world of science.”

While George’s ideas were popular across the political spectrum among socialists, libertarians, and conservatives alike, George never won a political election. Henry ran for Mayor of New York City as the candidate for the United Labor Party in New York City’s mayoral election, losing to Theodore Roosevelt, came in third in the election for Secretary of State of New York, and in 1897 during his second run for mayor of New York City, succumbed to a fatal stroke during the campaign.

Henry George was born in Philadelphia on September 2, 1839 and died of a stroke four days before his second run for the mayor of New York City on October 29, 1897.
Source code(tar.gz)
Source code(zip)
dbt-core-1.2.0.tar.gz(770.68 KB)
dbt-postgres-1.2.0.tar.gz(13.64 KB)
dbt_core-1.2.0-py3-none-any.whl(852.49 KB)
dbt_postgres-1.2.0-py3-none-any.whl(16.29 KB)
v1.2.0rc2(Jul 20, 2022)
dbt-core 1.2.0-rc2 - July 20, 2022

Features

Add reusable function for retrying adapter connections. Utilize said function to add retries for Postgres (and Redshift). (#5022, #5432)

Fixes

Rename try to strict for more intuitiveness (#5475, #5477)

Contributors

@jeremyyeo (#5477)

@tomasfarias (#5432)

Source code(tar.gz)
Source code(zip)
dbt-core-1.2.0rc2.tar.gz(770.76 KB)
dbt-postgres-1.2.0rc2.tar.gz(13.66 KB)
dbt_core-1.2.0rc2-py3-none-any.whl(852.53 KB)
dbt_postgres-1.2.0rc2-py3-none-any.whl(16.32 KB)
v1.1.2rc1(Jul 20, 2022)
dbt-core 1.1.2-rc1 - July 20, 2022

Fixes

Define compatibility for older manifest versions when using state: selection methods (#5213, #5346)

Under the Hood

Add annotation to render_value method reimplemented in #5334 (#4796, #5382)

Source code(tar.gz)
Source code(zip)
dbt-core-1.1.2rc1.tar.gz(757.86 KB)
dbt-postgres-1.1.2rc1.tar.gz(11.94 KB)
dbt_core-1.1.2rc1-py3-none-any.whl(831.95 KB)
dbt_postgres-1.1.2rc1-py3-none-any.whl(12.75 KB)
v1.2.0rc1(Jul 11, 2022)
dbt-core 1.2.0-rc1 - July 11, 2022

Features

Allow customizing target-path and log-path through environment variables and CLI flags. (#5399, #5402)

Move type_* macros from dbt-utils into dbt-core, with tests (#5317, #5428)

Add support for ratio metrics (#4884, #5027)

Allow users to define grants as a reasonable default in the dbt_project.yml or within each model sql or yml file combined. (#5263, #5369)

Fixes

Add inheritance to materialization macro resolution (#4646, #5348)

Improve pluralizations for Documentation and SqlOperation NodeTypes (#5352, #5356)

Properly use quotes for Snowflake snapshots when checking all columns (#2975, #5389)

fixes handling of RESET color code with USE_COLORS=False (#5288, #5394)

Docs

Fixed sample SQL Code for sources when no database is defined (#5255, #5446)

Add support for file: selector in DAG viz (#5255, #5446)

[Snyk] Upgrade prismjs from 1.27.0 to 1.28.0 (#5255, #5446)

Run build and tests in CI checks (#5255, #5446)

Improve metrics DAG viz and documentation page (#5255, #5446)

Upgrade cytoscape.js fork (#5255, #5446)

Under the Hood

Added the suggested RegEx to check the SemVer string within a package dependency and improved invalid version error handling. (#5201, #5370)

Add annotation to render_value method reimplemented in #5334 (#4796, #5382)

Bump manifest version to v6 (#5417, #5430)

Add tests for SQL grants (#5437, #5447)

Dependencies

Bump mypy from 0.942 to 0.961 (#4904, #5337)

Update colorama requirement from <0.4.5,>=0.3.9 to >=0.3.9,<0.4.6 in /core (#4904, #5388)

Bump black from 22.3.0 to 22.6.0 (#4904, #5420)

Security

Move string interpolation of "secret" env vars outside of Jinja context. Update "contexts" README (#4796, #5334)

Contributors

@b-per (#5446)

@callum-mcdata (#5027, #5446)

@darin-reify (#5394)

@drewbanin (#5027, #5446, #5446, #5446)

@fivetran-joemarkiewicz (#5370)

@isidentical (#5402)

@pdebelak (#5356)

@pquadri (#5389)

@volkangurel (#5348)

Source code(tar.gz)
Source code(zip)
dbt-core-1.2.0rc1.tar.gz(769.41 KB)
dbt-postgres-1.2.0rc1.tar.gz(13.49 KB)
dbt_core-1.2.0rc1-py3-none-any.whl(851.22 KB)
dbt_postgres-1.2.0rc1-py3-none-any.whl(16.15 KB)
v1.2.0b1(Jun 24, 2022)
dbt-core 1.2.0-b1 - June 24, 2022

Features

Add selector method when reading selector definitions (#4821, #4827)

Add set and zip function to contexts (#2345, #5107)

Adds itertools to modules Jinja namespace (#5130, #5140)

allow target as an option in profile_template.yml (#5179, #5184)

seed: Add new macro get_csv_sql (#5206, #5207)

Grants as Node Configs (#5189, #5230)

Adds file selectors and support for file selectors in the default method selector (#5240, #5241)

Move cross-db macros from dbt-utils into dbt-core global project (#4813, #5265)

Prettify duration message at the end of execution (#5253, #5364)

Early return from dbt init if no available adapters (#5365, #5366)

Fixes

Adding new cols to check_cols in snapshots (#3146, #4893)

Truncate relation names when appending a suffix that will result in len > 63 characters using make_temp_relation and make_backup_relation macros (#2869, #4921)

Restore ability to utilize updated_at for check_cols snapshots (#5076, #5077)

Use yaml renderer (with target context) for rendering selectors (#5131, #5136)

Fix retry logic to return values after initial try (#5023, #5137)

Scrub secret env vars from CommandError in exception stacktrace (#5151, #5152)

Ensure the metric name does not contain spaces (#4572, #5173)

When parsing 'all_sources' should be a list of unique dirs (#5120, #5176)

Add warning if yaml contains duplicate keys (#5114, #5146)

Modifying the drop_test_schema to work better with Redshift issues around locked tables and current transactions (#5200, #5198)

Fix column comparison in snapshot_check_all_get_existing_columns for check-strategy snapshots with explicit check_cols defined (#5222, #5223)

Changed how --select state:modified detects changes for macros nodes depend on (#5202, #5224)

Fix column comparison in snapshot_check_all_get_existing_columns to use adapter.get_columns_in_relation (#5222, #5232)

Remove docs file from manifest when removing doc node (#4146, #5270)

Remove duplicate dbt script entry (#5314, #5304)

Change node ancestor/descendant algo, fixes issue where downstream models aren't run when using networkx >= 2.8.1 (#5286, #5326)

Fixing Windows color regression (#5191, #5327)

Define compatibility for older manifest versions when using state: selection methods (#5213, #5346)

Remove duplicate key checking introduced in 1.2.0a1 (#5331, #5403)

Under the Hood

Migrating 005_simple_seed to the new test framework. (#200, #5013)

Convert 029_docs_generate tests to new framework (#5035, #5058)

Move package deprecation check outside of package cache (#5068, #5069)

removal of scaffold first attempt and create_adapter_plugin.py as they are deprecated new scaffold can be found https://github.com/dbt-labs/dbt-database-adapter-scaffold (#4980, #5117)

Mypy -> 0.942 + fixed import logic to allow for full mypy coverage (#4805, #5171)

Converted dbt list tests to pytest (#5049, #5178)

Fix: Call str and repr for UnsetProfileConfig without a RuntimeException (#5081, #5209)

Improve tracking error logging message (#5197, #5211)

Clean up materialization logic: more consistent relation names, loading from cache (#2869, #4921)

Use the default Python version for local dev and test instead of requiring Python 3.8 (#5257, #5269)

Fix test for context set function (#5266, #5272)

Fix pip upgrade step in CI for Windows (#5321, #5320)

Fix unit test test_graph_selection (#5323, #5324)

Update context readme + clean up context code" (#4796, #5334)

removed script meant for snowflake to snowflake (#5361, #5362)

Dependencies

Bump ubuntu from 20.04 to 22.04 (#4904, #5141)

Bumping hologram version (#5219, #5218)

Bump python from 3.10.3-slim-bullseye to 3.10.5-slim-bullseye in /docker (#4904, #5367)

Contributors

@GtheSheep (#4893)

@NicolasPA (#5211)

@adamantike (#5207)

@alexrosenfeld10 (#5184)

@bd3dowling (#5140)

@danieldiamond (#4827)

@dbeatty10 (#5265, #5077)

@dependabot[bot] (#5141, #5367)

@epapineau (#4921)

@groodt (#5304)

@jared-rimmer (#5364)

@jeremyyeo (#5107, #5146, #5403)

@jwills (#5241, #5269)

@tomasfarias (#5209)

@ulisesojeda (#5366)

Source code(tar.gz)
Source code(zip)
dbt-core-1.2.0b1.tar.gz(762.49 KB)
dbt-postgres-1.2.0b1.tar.gz(13.39 KB)
dbt_core-1.2.0b1-py3-none-any.whl(842.83 KB)
dbt_postgres-1.2.0b1-py3-none-any.whl(16.04 KB)
v1.1.1(Jun 15, 2022)
dbt-core 1.1.1 - June 15, 2022

Fixes

Relax minimum supported version of MarkupSafe (#4745, #5039)

When parsing 'all_sources' should be a list of unique dirs (#5120, #5176)

Remove docs file from manifest when removing doc node (#4146, #5270)

Fixing Windows color regression (#5191, #5327)

Under the Hood

Move string interpolation of "secret" env vars outside of Jinja context. Update "contexts" README (#4796, #5334)

Dependencies

Bumping hologram version (#5219, #5218)

Pin networkx to <2.8.4 for v1.1 patches (#5286, #5334)

Contributors

@adamantike (#5039)

Source code(tar.gz)
Source code(zip)
dbt-core-1.1.1.tar.gz(757.41 KB)
dbt-postgres-1.1.1.tar.gz(11.93 KB)
dbt_core-1.1.1-py3-none-any.whl(831.57 KB)
dbt_postgres-1.1.1-py3-none-any.whl(12.72 KB)
v1.0.8(Jun 15, 2022)
dbt-core 1.0.8 - June 15, 2022

Under the Hood

Move string interpolation of "secret" env vars outside of Jinja context. Update "contexts" README (#4796, #5334)

Source code(tar.gz)
Source code(zip)
dbt-core-1.0.8.tar.gz(745.29 KB)
dbt-postgres-1.0.8.tar.gz(11.99 KB)
dbt_core-1.0.8-py3-none-any.whl(817.51 KB)
dbt_postgres-1.0.8-py3-none-any.whl(12.78 KB)
v1.1.1rc2(Jun 7, 2022)
dbt-core 1.1.1-rc2 - June 07, 2022

Fixes

Remove docs file from manifest when removing doc node (#4146, #5270)

Fixing Windows color regression (#5191, #5327)

Under the Hood

Update context readme + clean up context code" (#4796, #5334)

Dependencies

Pin networkx to <2.8.4 for v1.1 patches (#5286, #5334)

Source code(tar.gz)
Source code(zip)
dbt-core-1.1.1rc2.tar.gz(757.47 KB)
dbt-postgres-1.1.1rc2.tar.gz(11.97 KB)
dbt_core-1.1.1rc2-py3-none-any.whl(831.61 KB)
dbt_postgres-1.1.1rc2-py3-none-any.whl(12.75 KB)
v1.1.1rc1(May 20, 2022)
dbt-core 1.1.1-rc1 - May 20, 2022

Fixes

Relax minimum supported version of MarkupSafe (#4745, #5039)

When parsing 'all_sources' should be a list of unique dirs (#5120, #5176)

Dependencies

Bumping hologram version (#5219, #5218)

Contributors

@adamantike (#5039)

Source code(tar.gz)
Source code(zip)
dbt-core-1.1.1rc1.tar.gz(757.48 KB)
dbt-postgres-1.1.1rc1.tar.gz(11.96 KB)
dbt_core-1.1.1rc1-py3-none-any.whl(831.59 KB)
dbt_postgres-1.1.1rc1-py3-none-any.whl(12.75 KB)
v1.0.7(May 20, 2022)
dbt-core 1.0.7 - May 20, 2022

Dependencies

Lowering networkx dependency range due to new version's breaking change (#5254, #5280)

Source code(tar.gz)
Source code(zip)
dbt-core-1.0.7.tar.gz(745.15 KB)
dbt-postgres-1.0.7.tar.gz(12.01 KB)
dbt_core-1.0.7-py3-none-any.whl(817.42 KB)
dbt_postgres-1.0.7-py3-none-any.whl(12.78 KB)
v1.1.0(Apr 28, 2022)
dbt Core 1.1.0 - Gloria Casarez (April 28, 2022)

📖 Docs: "Upgrading to v1.1"

Breaking Changes

There are no breaking changes to code in dbt projects or packages.

Relevant to consumers of dbt metadata:

The manifest schema version has updated to v5. The only change is to the default value of config for parsed nodes (#5033, #5032)

The structured logging interface has updated to v2. The node_info dictionary has moved underneath data, on events where it is available. Previously, it was a top-level key. (#4505)

Relevant to maintainers of adapter plugins:

The abstractmethods get_response and execute now only return a connection.AdapterReponse in type hints. (Previously, they could return a string.) We encourage you to update your methods to return an object of class AdapterResponse, or implement a subclass specific to your adapter (#4499, #4869)

Internal adapter methods set_relations_cache + _relations_cache_for_schemas each take an additional argument, for use with experimental CACHE_SELECTED_ONLY config (#4688, #4860)

We have a new framework for testing dbt adapters. Docs: "Testing a new adapter"

Features

Change behaviour of non_null test so that it only selects all columns if --store-failures is enabled. (#4769, #4777)

Testing framework for dbt adapter testing (#4730, #4846)

Allow unique key to take a list implementation for postgres/redshift (#4738, #4858)

Add --cache_selected_only flag to cache schema object of selected models only. (#4688, #4860)

Support custom names for generic tests (#3348, #4898)

Added Support for Semantic Versioning (#4453, #4644)

New Dockerfile to support specific db adapters and platforms. See docker/README.md for details (#4495, #4487)

Allow unique_key to take a list (#2479, #4618)

Add --quiet global flag and print Jinja function (#3451, #4701)

Add space before justification periods (#4737, #4744)

Enable dbt jobs to run downstream models based on fresher sources. Compare the source freshness results between previous and current state. If any source is fresher and/or new in current vs. previous state, dbt will run and test the downstream models in scope. Example command: dbt build --select source_status:fresher+ (#4050, #4256)

converting unique key as list tests to new pytest format (#4882, #4958)

Add a variable called selected_resources in the Jinja context containing a list of all the resources matching the nodes for the --select, --exclude and/or --selector parameters. (#3471, #5001)

Support the DO_NOT_TRACK environment variable from the consoledonottrack.com initiative (#3540, #5000)

Add --no-print global flag (#4710, #4854)

add enabled as a source config (#3662, #5008)

Fixes

Fix bug causing empty node level meta, snapshot config errors (#4459, #4726)

Inconsistent timestamps between inserted/updated and deleted rows in snapshots (#4347, #4513)

Catch all Requests Exceptions on deps install to attempt retries. Also log the exceptions hit. (#4849, #4865)

Use cli_vars instead of context to create package and selector renderers (#4876, #4878)

depend on new dbt-extractor version with fixed github links (#4891, #4890)

Update bumpervsion config to stop looking for missing setup.py (#-1, #4896)

Corrected permissions settings for docker release workflow (#4902, #4903)

User wasn't asked for permission to overwite a profile entry when running init inside an existing project (#4375, #4447)

Add project name validation to dbt init (#4490, #4536)

Allow override of string and numeric types for adapters. (#4603, #4604)

A change in secret environment variables won't trigger a full reparse (#4650, #4665)

Fix misspellings and typos in docstrings (#4904, #4545)

Catch more cases to retry package retrieval for deps pointing to the hub. Also start to cache the package requests. (#4849, #4982)

Make the warning message for a full event deque more descriptive (#4962, #5011)

Fix hard delete snapshot test (#4916, #5020)

Restore ability to utilize updated_at for check_cols snapshots (#5076, #5077)

Use yaml renderer (with target context) for rendering selectors (#5131, #5136)

Fix retry logic to return values after initial try (#5023, #5137)

Scrub secret env vars from CommandError in exception stacktrace (#5151, #5152)

Docs

Resolve errors related to operations preventing DAG from generating in the docs. Also patch a spark issue to allow search to filter accurately past the missing columns. (#4578, #4763)

Fixed capitalization in UI for exposures of type: ml (#4984, #4995)

List packages and tags in alphabetical order (#4984, #4995)

Bump jekyll from 3.8.7 to 3.9.0 (#4984, #4995)

Updated docker README to reflect necessity of using BuildKit (#4990, #5018)

Under the Hood

Automate changelog generation with changie (#4652, #4743)

add performance regression testing runner without orchestration (#4021, #4602)

Fix broken links for changelog generation and tweak GHA to only post a comment once when changelog entry is missing. (#4848, #4857)

Add support for Python 3.10 (#4562, #4866)

Enable more dialects to snapshot sources with added columns, even those that don't support boolean datatypes (#4488, #4871)

Add Graph Compilation and Adapter Cache tracking (#4625, #4912)

Testing cleanup (#3648, #4509)

Clean up test deprecation warnings (#3988, #4556)

Use mashumaro for serialization in event logging (#4504, #4505)

Drop support for Python 3.7.0 + 3.7.1 (#4584, #4585)

Drop support for Python <3.7.2 (#4584, #4643)

Re-format codebase (except tests) using pre-commit hooks (#3195, #4697)

Add deps module README (#4904, #4686)

Initial conversion of tests to pytest (#4690, #4691)

Fix errors in Windows for tests/functions (#4782, #4767)

Create a dbt.tests.adapter release when releasing dbt and postgres (#4812, #4948)

update docker image to use python 3.10.3 (#4904, #4963)

updates black to 22.3.0 which fixes dependency incompatibility when running with precommit. (#4904, #4972)

Adds config util for ad-hoc creation of project objs or dicts (#4808, #4981)

Remove TableComparison and convert existing calls to use dbt.tests.util (#4778, #4986)

Remove unneeded create_schema in snapshot materialization (#4742, #4993)

Added .git-blame-ignore-revs file to mask re-formmating commits from git blame (#5004, #5019)

Convert version tests to pytest (#5024, #5026)

Updating tests and docs to show that we now support Python 3.10 (#4974, #5025)

Update --version output and logic (#4724, #5029)

Contributors

@agoblet (#5000)

@alswang18 (#4644)

@amirkdv (#4536)

@anaisvaillant (#4256)

@b-per (#5001)

@dbeatty10 (#5077)

@ehmartens (#4701)

@joellabes (#4744)

@jonstacks (#4995)

@kadero (#4513)

@karunpoudel (#4860, #4860)

@kazanzhy (#4545)

@matt-winkler (#4256)

@mdesmet (#4604)

@NiallRees (#4447)

@pgoslatara (#4995)

@poloaraujo (#4854)

@sungchun12 (#4256)

@willbowditch (#4777)

Gloria Casarez (1971-2014)

Thanks to @glin7 for writing this biography of Gloria Casarez—a famous Philadelphian, and the namesake of this dbt Core release.

Born in Philadelphia in 1971, Gloria Casarez was a force for equality in her hometown.

Under her leadership as Philly's first director of LGBT Affairs between 2008-2014, the city became first in the nation to offer tax credits to companies providing domestic partner and transgender health benefits—the broadest LGBTQ protections in the country at the time.

This topped a lifetime of community organizing that included leading GALAEI (Gay and Lesbian Latino AIDS Initiative), where she launched Philly's first mobile HIV testing centers, and co-chairing the board of Prevention Point (a needle exchange program), as well as years of housing and antipoverty activism.

Casarez was known for her persistence as an activist, and inspired devotion in kind from her community of supporters. When property developers painted over a Philadelphia mural of Casarez in late 2020, a local artist responded by recreating the tribute on a building merely steps away. After the second mural was also torn down, a third—and standing—homage was painted on the GALAEI building in 2021. Philadelphia's City Hall unveiled a historical marker honoring Casarez's legacy in the same year.
Source code(tar.gz)
Source code(zip)
dbt-core-1.1.0.tar.gz(757.37 KB)
dbt-postgres-1.1.0.tar.gz(11.96 KB)
dbt_core-1.1.0-py3-none-any.whl(831.49 KB)
dbt_postgres-1.1.0-py3-none-any.whl(12.74 KB)
v1.0.6(Apr 27, 2022)
Fixes

Use yaml renderer (with target context) for rendering selectors (#5131, #5136)

Fix retry logic to return values after initial try (#5023, #5137)

Scrub secret env vars from CommandError in exception stacktrace (#5151, #5152)

Under the Hood

Move package deprecation check outside of package cache (#5068, #5069)

Source code(tar.gz)
Source code(zip)
dbt-core-1.0.6.tar.gz(745.22 KB)
dbt-postgres-1.0.6.tar.gz(12.01 KB)
dbt_core-1.0.6-py3-none-any.whl(817.44 KB)
dbt_postgres-1.0.6-py3-none-any.whl(12.79 KB)