Populating an intermediate table
After you create an intermediate table, you must populate it to materialize data. You can also repopulate an intermediate table to refresh the data when base tables are updated.
Each populate operation creates a new version of the intermediate table. The previous version remains available during the refresh. If the refresh fails, the existing data is not affected.
Note
Populating an intermediate table decrements access budgets on all referenced base tables, including transitive dependencies.
To populate an intermediate table
-
Open the AWS Clean Rooms console at https://console.aws.amazon.com/cleanrooms/
. -
In the left navigation pane, choose Collaborations.
-
Choose the collaboration, and then choose the Tables tab.
-
Choose the intermediate table that you want to populate.
-
Choose Populate.
-
In the Populate dialog, review the stored analysis (SQL query or analysis template reference).
-
Configure the following settings:
-
Query compute payer – If your collaboration has multiple query compute payers, select the one that pays for query compute costs for the populate operation.
-
Worker type – The instance type for the populate job (default: CR.1X).
-
Number of workers – The number of instances to use (2–128, default: 16).
-
(Optional) Spark properties – Custom Spark runtime configuration.
-
-
Choose Populate.
Tracking progress
To view the status of a populate operation, choose the Analysis tab on the intermediate table details page. You can view the following information:
-
Protected query ID (linked to query details)
-
Status
-
Worker type
-
Number of workers
-
Billed CRPU hours