Weighting Tags and Paths in XML Documents According to Their Topic Generalization