Data analysis of sougou.500w.utf8 with MR and Hive