Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

quantitative model request #2

Open
callmehanyu opened this issue Aug 15, 2024 · 0 comments
Open

quantitative model request #2

callmehanyu opened this issue Aug 15, 2024 · 0 comments

Comments

@callmehanyu
Copy link

callmehanyu commented Aug 15, 2024

Can you provide a quantized version using q4f16? I tried to replace it, but the code gave an error:

Unhandled exception. Microsoft.ML.OnnxRuntime.OnnxRuntimeException: [ErrorCode:Fail] Load model from ./models/decoder_model_merged_q4f16.onnx failed:/Users/runner/work/1/s/onnxruntime/core/graph/graph.cc:1421 void onnxruntime::Graph::InitializeStateFromModelFileGraphProto() This is an invalid model. Subgraph output (logits) is an outer scope value being returned directly. Please update the model to add an Identity node between the outer scope value and the subgraph output.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant