Skip to content

Starburst

Starburst is a powerful, enterprise-grade data analytics platform that allows you to query and analyze data from a wide variety of sources, all in a single, unified environment. It's built on top of the open-source Trino (formerly Presto) query engine, which is known for its speed, scalability, and flexibility.

Unlike traditional database management systems that require you to move and transform your data into a centralized repository, Starburst lets you query data where it lives - whether that's in a data warehouse, a data lake, or even disparate databases and file systems across your organization. This "data mesh" approach helps you avoid the time and cost of complex ETL (extract, transform, load) processes, while still giving you the ability to access and analyze all your relevant data.

Starburst parameters

Required parameters are in red and Optional parameters are in blue.


  • host, the IP address or URL of your Starburst catalog

  • port, the port number where your Starburst catalog is located
    default: 8080

  • catalog, the catalog to write new data to
    if tables do not exist in the catalog.schema location already, we'll auto-create them

  • schema, the schema to write new data to
    default: public, if tables do not exist in the catalog.schema location already, we'll auto-create them

  • authentication, the type of authentication mechanism to use to connect to Starburst
    ( allowed: basic, jwt )

  • username, your Starburst username

  • secret, the secret value to pass to the authentication mechanism
    this will be either a password or jwt

  • load_strategy, how to write new data into existing tables
    default: APPEND ( allowed: APPEND, TRUNCATE, UPSERT )
How do I use the Starburst syncer in commands?

cs_tools tools searchable bi-server --syncer starburst://host=0.0.0.0&catalog=...&schema=cs_tools&authentication=basic&username=admin&load_strategy=upsert

- or -

cs_tools tools searchable bi-server --syncer starburst://definition.toml

Definition TOML Example

definition.toml

[configuration]
host = "0.0.0.0"
catalog = "..."
schema = "cs_tools"
authentication = "basic"
username = "admin"
load_strategy = "upsert"