Release 453-e LTS (30 Aug 2024)#
Dell Data Analytics Engine, powered by Starburst Enterprise platform (SEP) 453-e LTS is the follow up release to the 443-e LTS release.
It contains all improvements from Starburst Enterprise releases since the 443-e LTS release.
It includes all improvements from the following Trino releases:
Highlights since 443-e#
Added native filesystem support for object sorage.
Added Query result caching.
Promoted Schema discovery to general availability with added UI support.
Added support for location grants with built-in access control.
Breaking Changes#
SEP now requires JDK 22 to run.
The following configuration properties and catalog configuration properties have been removed. These properties must be removed from all configurations or the cluster fails to start:
delta.max-initial-splits
delta.max-initial-split-size
legacy.materialized-view-grace-period
The
PARTITION_COLUMN
andPARTITION_VALUE
arguments for theflush_metadata_cache
procedure of the Hive connector have been renamed toPARTITION_COLUMNS
andPARTITION_VALUES
, respectively. The old argument names no longer work.The BigQuery connector now enables Arrow serializaion by default, which requires a JVM configuration. You must either add
--add-opens=java.base/java.nio=ALL-UNNAMED
to yourjvm.config
or set thebigquery.arrow-serialization.enabled
catalog configuration property tofalse
in your BigQuery catalog configurations.Phoenix versions 5.1.x and earlier are no longer supported by the Phoenix connector.
Warp Speed custom warmup rules are no longer stored in an external RDBMS. Warmup rules are now saved in object storage in a location specified in
warp-speed.objectstore.store.path
. In order to allow an existing warmup configuration to continue uninterrupted, you must export any existing rules from the external database and import them to object storage, then update your catalog configuration with the object store path.The
optimizer.mark-distinct-strategy
andoptimizer.optimize-mixed-distinct-aggregations
configuration properties have been removed and are replaced by optimizer.distinct-aggregations-strategy.The
bigquery.parallelism
configuration property has been removed as scan parallelism is now enabled by default on the BigQuery connector.The
hive.experimental.schema-discovery.enabled
catalog configuration property has been removed as schema discovery is enabled by default. You must remove this property from the cluster configuration or SEP fails to start.
453-e initial changes#
General#
Released Schema discovery in general availability.
Added support for running schema discovery in the Starburst Enterprise web UI.
Security#
Added IAM role support for query result caching.
Delta Lake connector#
Added native filesystem support for cloud storage systems. See the migration guide for more information on switching to native Azure storage, Google Cloud storage, and S3 storage support.
Hive connector#
Added native filesystem support for cloud storage systems. See the migration guide for more information on switching to native Azure storage, Google Cloud storage, and S3 storage support.
Added support for excluding tables from caching with the
hive.file-status-cache-tables.excluded
catalog configuration property.
Hudi connector#
Added native filesystem support for cloud storage systems. See the migration guide for more information on switching to native Azure storage, Google Cloud storage, and S3 storage support.
Iceberg connector#
Added native filesystem support for cloud storage systems. See the migration guide for more information on switching to native Azure storage, Google Cloud storage, and S3 storage support.
SAP HANA connector#
Added support for parallel read operations.
Snowflake connector#
Added support for enforcing
DEFINER
security authorization during query execution for views when using impersonation.
Teradata connector#
Added support for enforcing
DEFINER
security authorization during query execution for views when using impersonation.
453-e.1 changes (30 August 2024)#
Fixed failure when a user-defined type name contains uppercase characters.
Fixed the Trino username incorrectly defaulting to the name of the user running the Trino process when no username is specified.
Fixed query failure when file-based network topology is configured with the
node-scheduler.network-topology.file
configuration property.Fixed failure for queries involving
json_parse()
and a cast to array, map, or row.
453-e.2 changes (13 Sep 2024)#
Fixed an issue which prevented
sysadmin
role from using Schema Discovery UX. To run schema discovery, the service account referenced byschema-discovery.starburst-user
must be listed instarburst.access-control.authorized-users
.Fixed a bug that caused cluster metrics to be created with incorrect intervals and subsequently led to loss of cluster metrics data.
Fixed query failures when Parquet files contain column names that only differ in case.
Fixed memory tracking issue for aggregations that could cause worker crashes with out-of-memory errors.
Fixed UI regression caused by incorrect property name.
Fixed Run and troubleshoot feature when
insights.authorized-groups
configuration property contains authorized groups.Fixed numeric overflow during managed statistics computation for large tables in Teradata mode session.
453-e.3 was skipped.
453-e.4 changes (18 Oct 2024)#
Enabled Warp Speed REST extensions by default.
Fixed failures with queries using table functions when
parent-project-id
is defined.Fixed OpenX JSON decoding a JSON array line that resulted in data being written to the wrong output column.
Fixed reading large Prometheus responses.
Fixed query failure when
bigquery.service-cache-ttl
configuration property is not0ms
andcase-insensitive-name-matching
is enabled.Fixed rare bug causing long planning times when Hive metastore caching is enabled.
Fixed failures for
count(*)
queries with predicates containing non-ASCII strings.Fixed failure when
bigquery.case-insensitive-name-matching
is enabled andbigquery.case-insensitive-name-matching.cache.ttl
is set to0m
.
453-e.5 changes (4 Nov 2024)#
Added support for
tinyint
orsmallint
tointeger
type coercion in Iceberg.Use
hive.metastore.partition-batch-size.max
config property value insync_partition_metadata
procedure. The default batch size is changed to 100 from 1000.Updated Iceberg connector migration procedure to use nullable columns by default.
Fixed query failure in MaxCompute connector when additional projects are specified with
maxcompute.additional-projects
.
453-e.6 changes (13 Nov 2024)#
Fixed memory leak in
InMemoryEventClient
within cache service.Fixed incorrect results when reading array columns and
bigquery.arrow-serialization.enabled
is set totrue
.
453-e.7 changes (27 Nov 2024)#
Improved performance when using various string functions in queries involving joins.
Fixed insert of invalid time zone data for tables using the
TIMESTAMP WITH TIME ZONE
type.Fixed incorrect results for queries filtering on a partition columns and the
NAME
column mapping is used.Fixed server error responses printing unprocessed user input.
Fixed query failures on impersonated Hive during delegated authentication checks.
453-e.8 changes (13 Dec 2024)#
Updated query result caching to use session property managers to resolve session property defaults and ensure that they are consistently applied.
Fixed incorrect column length of
VARCHAR
type in SingleStore version 8.Fixed failure of S3 file listing of buckets that enforce requester pays.
Fixed incorrect quoting of output values when the
CSV_UNQUOTED
orCSV_HEADER_UNQUOTED
format is used.