AnintegratedcollectionofJavacodeusefulforstatisticalnaturallanguageprocessing,documentclassification,clustering,informationextraction.