A Dataset is a collection of examples. Each example in turn is comprise of one or more features, and possibly a label.
Datasets in MarkovML are immutable and static.

A single dataset can be segmented or unsegmented. ML engineers frequently divide datasets into different segments to be used for training, testing, and/or validating a model. MarkovML gives you the ability to specify segments of a dataset and provides insights into how segments compare with one another.

Data Families

A Data Family is a group of one or more Datasets that share similar characteristics.
It is often useful to be able to group related Datasets together; Data Families allow you to do just this.