(10/23) The point is we can map words, and collections of words (like essay answers) to points in some many-dimensional space (e.g., 300 dimensions for word2vec). Mapped into such spaces, words/answers with similar content are close to each other. See e.g., https://projector.tensorflow.org