Running the Tools
Now that we've configured our environment, let's explore the tools!
You can find all of the tools by either visiting the documentation or by typing..
Archiver: Maintaining metadata
Archiver is a tool that helps you find and clean up stale answers and liveboards in your ThoughtSpot platform.
As the number of users grow, so does the content that they create. Even though no data from your data sources are stored within ThoughtSpot, maintaining unused references to this content still requires resources. Additionally, excess metadata can slow down the backup, restore, and cluster upgrades processes.
From the available commands, we can come up with an administrative process to help maintain a clean environment. You can use the below framework with Archiver.
Run Archiver identify
to isolate inactive content.
Label the tag name as clean-up
.
Communicate to your ThoughtSpot User community that the admin team will be performing a cleanup activity.
Communication to Users
In an effort to keep ThoughtSpot running smoothly, Answers and Liveboards which haven't been viewed in over 120 days will be marked for deletion.
If you wish to keep stale content, please untag your Answers with the clean-up
tag.
Wait a minimum of one week before performing any cleanup actions.
Maintain an active dialogue with your User community during this time, in case they have questions or concerns.
Run Archiver remove
, targeting the tag name clean-up
.
Strongly consider using the --export-tml
so you can restore any content that users wish they
had saved during the deprecation cycle.
Try it out
Let's run the command to identify all content which hasn't been viewed within the past 3 months or created in the last 4
months. We'll use the tag clean-up
to communicate to our Users that this content is targeted for removal.
We'll also ignore all of our Training content, since we haven't run a new user onboarding this quarter, and our Certified datasets, even though it's unlikely that this content hasn't been viewed in the past 90 days.. but it's better to be safe than sorry!
Using --config
Since we set our non-prod
configuration as the default, usage of --config non-prod
is optional here.
CS Tools will attempt to use the default configuration first, but if we wanted to run Archiver
against another cluster (eg. Production), we can override it with the --config
option.
Find the copy button to the right of the code block.
Administrative Responsibility
The maintainers of CS Tools recognize when we give you lots of power.
For this reason, with tools that perform bulk management, it's important to have fail-safes before executing the commands.
With Archiver, you'll be prompted to continue with your operation before affecting potentially hundreds of Users.
Report on Archiver operations
In highly active ThoughtSpot deployments, the identification process can find more content than the admin team is comfortable investigating directly in the web user interace.
On all three Archiver commands, you'll find a --syncer
option which labels information about the
type of content that we'll tag or remove, as well as who owns it. This information can be incredibly helpful to inspect
and will help you communicate with those Users directly.
In the next section, we'll learn about Syncers and how you can leverage them to manage all the information that Archiver produces.