4c01876. Token vocabulary from training data set ; radial sampling of high-dimensional latent vector oversphere of given radius pseudocode; training data set exploratory data analysis; training and ...