Goodreads Book Datasets With User Rating 10M 数据集
我们每个人都知道Goodreads,每个想读书的书迷,首先要在该网站上搜索该书的书名,然后阅读该书的所有评论和评级。这些数据集非常适合两项工作:建立基于1000万本书的图书推荐系统,将描述列用于NLP。 book1-100k.csv book100k-200k.csv book200k-300k.csv book300k-400k.csv book400k-500k.csv book500k-600k.csv book600k-700k.csv book700k-800k.csv book800k-900k.csv book900k-1000k.csv book1000k-1100k.csv book1100k-1200k.csv book1200k-1300k.csv book1300k-1400k.csv book1400k-1500k.csv book1500k-1600k.csv book1600k-1700k.csv book1700k-1800k.csv book1800k-1900k.csv book1900k-2000k.csv user_rating_0_to_1000.csv user_rating_1000_to_2000.csv user_rating_2000_to_3000.csv user_rating_3000_to_4000.csv user_rating_4000_to_5000.csv user_rating_5000_to_6000.csv user_rating_6000_to_11000.csv book3000k-4000k.csv book4000k-5000k.csv book2000k-3000k.csv
文件列表
94125.zip
(预估有个30文件)
book4000k-5000k.csv
171.1MB
book100k-200k.csv
8.42MB
user_rating_1000_to_2000.csv
2.14MB
user_rating_0_to_1000.csv
2.55MB
book300k-400k.csv
8.32MB
user_rating_5000_to_6000.csv
772KB
book1800k-1900k.csv
28.37MB
book400k-500k.csv
8.07MB
book800k-900k.csv
44.2MB
book600k-700k.csv
68.69MB
暂无评论