Manage datasets
The Data Management page is where you manage all encrypted datasets in your organization—from uploads to workspace sharing and access controls. As shown in the following figure, you can:
- View and manage all datasets in your organization.
- Add datasets to your organization.
- Share datasets with Workspaces for collaboration.
- Adjust the datasets table view to suit your needs.
The Data Management table lists all datasets in your organization and includes the following details:
- Name: The dataset's name, set by the member who added it.
- Uploaded By: The name of the member who connected the dataset.
- Uploaded On: The date the dataset was added to Opaque.
- Data Source: The origin of the dataset (e.g., Local Upload, AWS, Azure, Google Cloud, or Result).
By default, datasets are sorted in ascending order by creation date (oldest to newest). Use the column headers to sort by name, member, date, or data source. Click the three dots icon () on the right side of each entry for additional details.
To access the Data Management page, select Data Management from the primary navigation bar.
Search and filter datasets
To narrow the list of datasets, you can use the search field or apply filters using the filter icon.
To narrow the list of datasets in the table you can search by a dataset’s name or click the filter icon next to the search field to apply additional filters.
- To search by name, enter a full or partial dataset name. Searches are case-sensitive.
- Click the filter icon next to the search field to apply additional filters:
- Date (From / To): Show only datasets connected during a specific date range.
- Data owner (Uploaded By): Filter by one or more users who connected the datasets.
- Data source: Limit results to specific cloud providers, such as AWS or Google Cloud.
You can also sort datasets by column headers (e.g., alphabetical by name or newest first).
View dataset information
To view details about a dataset:
- Navigate to the Data Management page.
- Click the three dots icon () in the dataset's row.
This opens the General Information panel.
The General Information panel displays the following information:
- Dataset name: The name of the dataset.
- Uploaded On: The date and time the dataset was added.
- Data Source: The origin of the data file (e.g., local upload, cloud storage).
- Uploaded By: The user who uploaded the dataset.
- Description: A brief summary of the dataset.
- Has a test dataset: Indicates if a test dataset exists and its type:
- Synthetic Dataset: Reflects the original data's distributions and format.
- Dummy Dataset: Generated based on column names and types; may not match the original format.
- Size: The size of the dataset in bytes.
- Associated Workspaces: Lists the workspaces with access to the dataset and provides the option to revoke access from workspaces.
- Schema: Displays the number of accessible columns in the dataset with an option to view the schema.
Revoke dataset access
To revoke dataset access from a workspace:
- Navigate to the Data Management page.
- Click the three-dot icon () in the dataset's row.
- In the General Information panel, under Associated Workspaces, click Revoke next to the workspace name.
Revoking access prevents any jobs (new, existing, or pending) in the workspace from using the dataset. It also blocks access for any pending jobs that have not yet been executed.
Download a dataset
Only data owners can download data files. To download a data file:
- On the Data Management page, select the dataset.
- Click the Download icon () in the top-right corner.
The dataset will be decrypted and downloaded to your local computer.
Remove a dataset
To remove a dataset from your organization:
- Navigate to the Data Management page.
- Select the dataset or datasets you want to remove.
- Click the Trash icon (:material-trash:) in the top-right corner.
!!! "Beware" This action permanently removes the dataset from Opaque, including all associated workspaces.