Discovering new intents
#
Discovering intents with uncertaintyIn this video we explain how to use the `uncertainty` metric to discover new intents in the unlabeled data of an existing workspace.
- You'll need a workspace with at least a dozen or more intents that each have a significant amount of training data.
- Make sure to have trained your NLU models.
- Sort your unlabeled data by
uncertainty
. - Select one of the top results that has appeal.
- Sort by
similarity to stash
and select more relevant results.
`Uncertainty` surfaces utterances that your model has a very difficult time understanding. However, one of the caveats to this metric is that it does not take frequency or density into account. This means that it may surface outlier utterances that have little to no business value being labeled because they only occurred once in your entire data set.
#
Discovering intents with densityIn this video we explain how to use `density` sorting with clustering to discover new intents in your unlabeled data.
- Sort your unlabeled data by
density
. - Select one of the top items.
- Sort by
similarity to stash
and activate clustering. - Select relevant clusters.
`Density` is an interesting metric because it surfaces utterances that have many siblings (_occur often in your data_).