Cluster
The Cluster
skill lets you divide groups of abstract data into classes of similar data. Unlike Train
, the Cluster
skill doesn't require a labeled dataset.
Format
Cluster
has a single format with several variations: Cluster data (excluding | including) <columns>
.
Parameters
Cluster
uses the following parameters:
columns
(optional). The columns to exclude or include in your cluster. Not selecting a column will use all columns in the dataset to cluster data.
Output
If data is successfully clustered, a preview message appears in the conversation history. This message contains a link to preview the ClusterResults dataset. An output also appears in the Chart tab displaying the Cluster Centroids, Models, Scores, and Pipeline Report.
Examples
To cluster the "Titanic Dataset" in it's entirety, enter Cluster data
.
To cluster the Age, Fare, and Gender columns, enter Cluster data including Age, Fare, Gender
.