In word sense disambiguation and named-entity disambiguation, an important assumption is that a document consists of related concepts and entities.
There are millions of concepts and entities, what makes some related but not others? This question is difficult and I don’t have the definitive answer. But it is a good start to list some classes of relatedness. Continue reading