You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Call HcclAllReduce(send_buf, recv_buf, count, PDDataTypeToHcclDataType(data_type), PDReduceOpToHcclReduceOp(op), reinterpret_cast<HcclComm>(comm), reinterpret_cast<aclrtStream>(stream)) failed : 5 at file /root/PaddleCustomDevice/backends/npu/runtime/runtime.cc line 881
E40024: 2024-07-26-11:47:38.607.914 Failed call Python Func/Meathod [get_binfile_sha256_hash_from_c], Reason[SystemError: PY_SSIZE_T_CLEAN macro must be defined for '#' formats
]
Possible Cause: The Python Func/Meathod does not exist.
LAUNCH INFO 2024-07-26 11:57:05,580 Exit code -11
单卡训练时遇到的问题:
训练代码会在中途卡住不动,不清楚是什么原因造成的。
The text was updated successfully, but these errors were encountered:
Paddle版本:
CANN版本:
8.0.RC1
操作系统版本:
Ubuntu 20.04.3 LTS
lora_stf_argument.json
为:训练脚本:
报错信息如下:
训练代码会在中途卡住不动,不清楚是什么原因造成的。
The text was updated successfully, but these errors were encountered: