Skip to main content

Cluster

The Cluster skill lets you divide groups of abstract data into classes of similar data. Unlike Train, the Cluster skill doesn't require a labeled dataset.

Format

Cluster has a single format with several variations: Cluster data (excluding | including) <columns>.

Parameters

Cluster uses the following parameters:

  • columns (optional). The columns to exclude or include in your cluster. Not selecting a column will use all columns in the dataset to cluster data.

Output

If data is successfully clustered, a preview message appears in the conversation history. This message contains a link to preview the ClusterResults dataset. An output also appears in the Chart tab displaying the Cluster Centroids, Models, Scores, and Pipeline Report.

Examples

To cluster the "Titanic Dataset" in it's entirety, enter Cluster data.

To cluster the Age, Fare, and Gender columns, enter Cluster data including Age, Fare, Gender.

Feedback