You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, @guedes-joaofelipe! Thank you for your issue, but we can't reproduce the problem here. So could you please check your dataset and your environment again?
@EliverQ I had the same problem,When I convert the yelp data set on windows。
Traceback (most recent call last):
File "run.py", line 40, in
datasets.convert_inter()
File "D:\学业\研究生\数据集\数据集转换程序\RecSysDatasets-master\conversion_tools\src\extended_dataset.py", line 4581, in convert_inter
for _ in fin:
UnicodeDecodeError: 'gbk' codec can't decode byte 0x8b in position 1909: illegal multibyte sequence
I followed the instructions on Readme.md to download and convert the movie lens dataset but I got the following error:
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 2892: invalid continuation byte
Just changed the pd.read_csv method on file convertion_tools/src/extended_dataset.py (line 52) to include an encoding argument and fix the problem.
pd.read_csv(self.item_file, delimiter=self.item_sep, header=None, engine='python', encoding = "ISO-8859-1")
The text was updated successfully, but these errors were encountered: