Skip to content

Conversation

@SinanGncgl
Copy link
Contributor

Pull Request Template

Checklist

  • Confirmed that cargo run-checks command has been executed.

Related Issues/PRs

#3812

Changes

Problem: ONNX models >2GB cannot be loaded due to protobuf size limits. The protobuf format stores tensor data inline, causing failures for large models.

Solution: Added external data support where tensor weights are stored in separate .bin files. The .onnx file contains only metadata (location, offset, length, checksum). This enables loading models of any size while maintaining full backward compatibility.

Testing

Tested with small units which has 2.68GB of model.

@antimora
Copy link
Collaborator

antimora commented Oct 8, 2025

@SinanGncgl , thanks for taking this up. However, I am working on a major refactoring of onnx-ir that would affect your PR. The refactor also covers needed infrastructure #3596

This PR would need to wait till the refactor is complete. Sorry if this wasn't communicated earlier.

@SinanGncgl
Copy link
Contributor Author

@SinanGncgl , thanks for taking this up. However, I am working on a major refactoring of onnx-ir that would affect your PR. The refactor also covers needed infrastructure #3596

This PR would need to wait till the refactor is complete. Sorry if this wasn't communicated earlier.

No worries I can mark it as draft, thanks for letting me know.

@SinanGncgl SinanGncgl marked this pull request as draft October 8, 2025 22:51
@SinanGncgl SinanGncgl closed this by deleting the head repository Oct 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants