ThoughtSpot Software
Release: 7.0
This is the latest version.
About this Release
Users
Administration
Mobile
Embedding
Deploy
Embrace
DataFlow
- Overview
- Key features
- How DataFlow works
- DataFlow home page
- Requirements and guidelines
- Database Connections
  - Supported database connections
  - Amazon Aurora
    - Overview
    - Connect
    - Sync
    - Reference
  - Amazon Redshift
    - Overview
    - Connect
    - Sync
    - Reference
  - Azure Synapse
    - Overview
    - Connect
    - Sync
    - Reference
  - Cassandra
    - Overview
    - Connect
    - Sync
    - Reference
  - Databricks Delta Lake
    - Overview
    - Connect
    - Sync
    - Reference
  - Denodo
    - Overview
    - Connect
    - Sync
    - Reference
  - Google BigQuery
    - Overview
    - Connect
    - Sync
    - Reference
  - Hive
    - Overview
    - Connect
    - Sync
    - Reference
  - IBM Db2
    - Overview
    - Connect
    - Sync
    - Reference
  - JDBC
    - Overview
    - Connect
    - Sync
    - Reference
  - MariaDB
    - Overview
    - Connect
    - Sync
    - Reference
  - MongoDB
    - Overview
    - Connect
    - Sync
    - Reference
  - MySQL
    - Overview
    - Connect
    - Sync
    - Reference
  - Netezza
    - Overview
    - Connect
    - Sync
    - Reference
  - Oracle
    - Overview
    - Connect
    - Sync
    - Reference
  - PostgreSQL
    - Overview
    - Connect
    - Sync
    - Reference
  - Presto
    - Overview
    - Connect
    - Sync
    - Reference
  - SAP Adaptive Server Enterprise
    - Overview
    - Connect
    - Sync
    - Reference
  - SAP HANA
    - Overview
    - Connect
    - Sync
    - Reference
  - SAP SQL Anywhere
    - Overview
    - Connect
    - Sync
    - Reference
  - SAS
    - Overview
    - Connect
    - Sync
    - Reference
  - SQL Server
    - Overview
    - Connect
    - Sync
    - Reference
  - Snowflake
    - Overview
    - Connect
    - Sync
    - Reference
  - Splice Machine
    - Overview
    - Connect
    - Sync
    - Reference
  - Teradata
    - Overview
    - Connect
    - Sync
    - Reference
- File System Connections
  - Supported file system connections
  - Amazon S3
    - Overview
    - Connect
    - Sync
    - Reference
  - Apache Parquet
    - Overview
    - Connect
    - Sync
    - Reference
  - Azure Blob Storage
    - Overview
    - Connect
    - Sync
    - Reference
  - Files
    - Overview
    - Connect
    - Sync
    - Reference
  - Google Cloud Storage
    - Overview
    - Connect
    - Sync
    - Reference
  - HDFS
    - Overview
    - Connect
    - Sync
    - Reference
- Application Connections
  - Supported application connections
  - Salesforce
    - Overview
    - Connect
    - Sync
    - Reference
  - REST API
    - Overview
    - Connect
    - Sync
    - Reference
- Administration
Data Integration
Disaster Recovery
Reference
ThoughtSpot in Practice
- Introduction
- Reaggregation in practice

High Availability (HA) and resilience

Consider these guidelines to ensure HA of ThoughtSpot app, and node resilience.

Requirements for node resilience

The cluster must have at least 3 nodes.
The cluster must have spare capacity; if one node fails, the remaining nodes must be able to host and serve all loaded data.

What happens during node failure

When a node loses connection with the main service manager process, it becomes unhealthy.
ThoughtSpot migrates all migratable services that run on the failed node to other (healthy) nodes. For all practical purposes, ThoughtSpot ignores the failed node until it reports itself as healthy.
ThoughtSpot rebalances and redistributes the data served from the failed node onto healthy nodes. Healthy nodes read the data from the the HDFS storage layer into the in-memory database processes.

Disruption: impact on users

The process of redistributing and loading the data in the affected tables on HDFS layer from a failed node to the remaining healthy nodes is not instantaneous. The failover may impact the user experience.