Extends a dataset with the distance in meters between lat/long fields and a reference point.
Given a dataset and a categorical field, finds the minimum scale required to create class purity in the cluster with k = number of classes.