Prepare the system and ThoughtSpot clusters for installation.
1. Set up hosts for the ThoughtSpot cluster
2. [Optional] Set up AWS Systems Manager Agent
3. Partition the hosts
4. Enable the hosts to download Amazon Linux 2 packages
5. Enable an Ansible Control Server
6. Disable SELinux or run it in permissive mode
7. Ensure tmp has permission 777
8. Remove Defaults requiretty from /etc/sudoers

Set up hosts for the ThoughtSpot cluster

Set up hosts for the ThoughtSpot cluster on Amazon Web Services.

  1. Refer to AWS configuration options for the exact specification for the hosts:
    • CPU
    • memory
    • disks
  2. Refer to Set up AWS resources for ThoughtSpot to create and launch your AWS virtual machines.
    • Ensure that your AMI is an Amazon Linux 2 image.

[Optional] Set up AWS Systems Manager Agent

If you plan to use the AWS SSM agent as an alternative to SSH, create a new IAM role while creating VMs. This IAM role must have an SSM policy to grant AWS SSM permission to perform actions on your instances. Refer to Create an IAM instance profile for Systems Manager.

You must install the SSM agent on each node. Refer to Manually install SSM Agent on EC2 instances for Linux, if the SSM agent is not already on each node.

Partition the hosts

As you create VMs, ensure that all ThoughtSpot hosts have the following partitions on the root drive.

Partition Drive type Description Minimum size
OS partition SSD (root drive) Root partition for OS, /tmp 100GB
Allocate at least 50GB to /tmp
Export partition SSD (root drive)  Stores ThoughtSpot objects, hdfs logs, service logs, and so on 200GB

Enable the hosts to download Amazon Linux 2 packages

Repositories

  • Yum repositories: you must enable the following Yum repositories in your cluster: epel, nux-desktop, google-cloud-sdk, and azure-cli.
  • Python repository: for Python, ensure the machine is able to reach the PyPI repository located at https://pypi.python.org/.
  • R repository: for R, ensure the machine is able to reach the CRAN repository located at cran.rstudio.com.

Make sure that you can download Amazon Linux 2 packages to all hosts, either from the official package repositories, or from a mirror repository owned and managed by your organization.

If you cannot access the Amazon Linux 2 repositories, there is no mirror repository in your organization, or you are unable to access Yum, Python, or R repositories, please contact ThoughtSpot Support.

Official package repositories

If the hosts of your ThoughtSpot cluster can access external repositories, either directly or through a proxy, your cluster is online. You can then proceed to download Yum, Python, and R package repositories.

Internal mirror repository

If the hosts of your ThoughtSpot cluster have access to an internal repository that mirrors the public repositories, copy the Yum, Python, and R package repositories to your hosts.

Enable an Ansible Control Server

Configure an Ansible Control Server, on a separate host, to run the Ansible playbook that ThoughtSpot supplies. You must install both rsync and Ansible on the Ansible Control Server host.

Disable SELinux or run it in permissive mode

ThoughtSpot does not support policies that enforce SELinux. We recommend that you disable SELinux, or run it in permissive mode.

Ensure tmp has permission 777

The /tmp directory must have the 777 permission.

Remove Defaults requiretty from /etc/sudoers

The /etc/sudoers file must not have the Defaults requiretty line. This line can cause cluster creation to fail.

Next steps

Next, get ThoughtSpot artifacts.