dagster-polars has been added as an integration. Thanks @danielgafni!
[dagster-dbt] @dbt_assets now supports loading projects with semantic models.
[dagster-dbt] @dbt_assets now supports loading projects with model versions.
[dagster-dbt] get_asset_key_for_model now supports retrieving asset keys for seeds and snapshots. Thanks @aksestok!
[dagster-duckdb] The Dagster DuckDB integration supports DuckDB version 0.10.0.
[UPath I/O manager] If a non-partitioned asset is updated to have partitions, the file containing the non-partitioned asset data will be deleted when the partitioned asset is materialized, rather than raising an error.
Fixed an issue where creating a backfill of assets with dynamic partitions and a backfill policy would sometimes fail with an exception.
Fixed an issue with the type annotations on the @asset decorator causing a false positive in Pyright strict mode. Thanks @tylershunt!
[ui] On the asset graph, nodes are slightly wider allowing more text to be displayed, and group names are no longer truncated.
[ui] Fixed an issue where the groups in the asset graph would not update after an asset was switched between groups.
[dagster-k8s] Fixed an issue where setting the security_context field on the k8s_job_executor didn't correctly set the security context on the launched step pods. Thanks @krgn!
Observable source assets can now yield ObserveResults with no data_version.
You can now include FreshnessPolicys on observable source assets. These assets will be considered “Overdue” when the latest value for the “dagster/data_time” metadata value is older than what’s allowed by the freshness policy.
[ui] In Dagster Cloud, a new feature flag allows you to enable an overhauled asset overview page with a high-level stakeholder view of the asset’s health, properties, and column schema.
[kubernetes] Fixed an issue where the Kubernetes agent would sometimes leave dangling kubernetes services if the agent was interrupted during the middle of being terminated.
Within a backfill or within auto-materialize, when submitting runs for partitions of the same assets, runs are now submitted in lexicographical order of partition key, instead of in an unpredictable order.
[dagster-k8s] Include k8s pod debug info in run worker failure messages.
[dagster-dbt] Events emitted by DbtCliResource now include metadata from the dbt adapter response. This includes fields like rows_affected, query_id from the Snowflake adapter, or bytes_processed from the BigQuery adapter.
A previous change prevented asset backfills from grouping multiple assets into the same run when using BackfillPolicies under certain conditions. While the backfills would still execute in the proper order, this could lead to more individual runs than necessary. This has been fixed.
[dagster-k8s] Fixed an issue introduced in the 1.6.4 release where upgrading the Helm chart without upgrading the Dagster version used by user code caused failures in jobs using the k8s_job_executor.
[instigator-tick-logs] Fixed an issue where invoking context.log.exception in a sensor or schedule did not properly capture exception information.
[asset-checks] Fixed an issue where additional dependencies for dbt tests modeled as Dagster asset checks were not properly being deduplicated.
[dagster-dbt] Fixed an issue where dbt model, seed, or snapshot names with periods were not supported.
@observable_source_asset-decorated functions can now return an ObserveResult. This allows including metadata on the observation, in addition to a data version. This is currently only supported for non-partitioned assets.
[auto-materialize] A new AutoMaterializeRule.skip_on_not_all_parents_updated_since_cron class allows you to construct AutoMaterializePolicys which wait for all parents to be updated after the latest tick of a given cron schedule.
[Global op/asset concurrency] Ops and assets now take run priority into account when claiming global op/asset concurrency slots.
Dagster now warns when a solid, pipeline, or other definition is created with an invalid name (for example, a Python keyword). This warning will become an error in the 0.9.13 release.
Community Contributions
Added an int type to EventMetadataEntry (Thanks @ChocoletMousse!)
Added a build_composite_solid_definition method to Lakehouse (Thanks @sd2k!)
Improved broken link detection in Dagster docs (Thanks @keyz!)
New
Improvements to log filtering on Run view in Dagit
Improvements to instance level scheduler page
Log engine events when pipeline termination is initiated
Bugfixes
Syntax errors in user code now display the file and line number with the error in Dagit
Dask executor no longer fails when using intermediate_storage
In the Celery K8s executor, we now mark the step as failed when the step job fails
Changed the DagsterInvalidAssetKey error so that it no longer fails upon being thrown
Documentation
Added API docs for dagster-dbt experimental library
Fixed some cosmetic issues with docs.dagster.io
Added code snippets from Solids examples to test path, and fixed some inconsistencies regarding parameter ordering
Changed to using markers instead of exact line numbers to mark out code snippets
[dagster-dask] Removed the compute option from Dask DataFrame materialization configs for all output types. Setting this option to False (default True) would result in a future that is never computed, leading to missing materializations
Console log messages are now streamlined to live on a single line per message
Added better messaging around $DAGSTER_HOME if it is not set or improperly setup when starting up a Dagster instance
Tools for exporting a file for debugging a run have been added:
dagster debug export - a new CLI entry added for exporting a run by id to a file
dagit-debug - a new CLI added for loading dagit with a run to debug
dagit now has a button to download the debug file for a run via the action menu on the runs page
The dagster api grpc command now defaults to the current working directory if none is specified
Added retries to dagster-postgres connections
Fixed faulty warning message when invoking the same solid multiple times in the same context
Added ability to specify custom liveness probe for celery workers in kubernetes deployment
Bugfixes
Fixed a bug where Dagster types like List/Set/Tuple/Dict/Optional were not displaying properly on dagit logs
Fixed endless spinners on dagit --empty-workspace
Fixed incorrect snapshot banner on pipeline view
Fixed visual overlapping of overflowing dagit logs
Fixed a bug where hanging runs when executing against a gRPC server could cause the Runs page to be unable to load
Fixed a bug in celery integration where celery tasks could return None when an iterable is expected, causing errors in the celery execution loop.
Experimental
[lakehouse] Each time a Lakehouse solid updates an asset, it automatically generates an AssetMaterialization event
[lakehouse] Lakehouse computed_assets now accept a version argument that describes the version of the computation
Setting the “dagster/is_memoized_run” tag to true will cause the run to skip any steps whose versions match the versions of outputs produced in prior runs.
[dagster-dbt] Solids for running dbt CLI commands
Added extensive documentation to illuminate how versions are computed
Added versions for step inputs from config, default values, and from other step outputs
Support for the Dagster step selection DSL: reexecute_pipeline now takes step_selection, which accepts queries like *solid_a.compute++ (i.e., solid_a.compute, all of its ancestors, its immediate descendants, and their immediate descendants). steps_to_execute is deprecated and will be removed in 0.10.0.
Community contributions
[dagster-databricks] Improved setup of Databricks environment (Thanks @sd2k!)
Fixed a bug that pipeline-level hooks were not correctly applied on a pipeline subset.
Improved error messages when execute command can't load a code pointer.
Fixed a bug that prevented serializing Spark intermediates with configured intermediate storages.
Dagit
Enabled subset reexecution via Dagit when part of the pipeline is still running.
Made Schedules clickable and link to View All page in the schedule section.
Various Dagit UI improvements.
Experimental
[lakehouse] Added CLI command for building and executing a pipeline that updates a given set of assets: house update --module package.module —assets my_asset*