-
Notifications
You must be signed in to change notification settings - Fork 57
Add a Heat aware DistributedSampler for torch usage. #1807
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Add a Heat aware DistributedSampler for torch usage. #1807
Conversation
… on a process local basis.
for more information, see https://pre-commit.ci
|
Thank you for the PR! |
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #1807 +/- ##
==========================================
- Coverage 91.96% 91.62% -0.34%
==========================================
Files 88 88
Lines 13496 13697 +201
==========================================
+ Hits 12411 12550 +139
- Misses 1085 1147 +62
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
Thank you for the PR! |
1 similar comment
|
Thank you for the PR! |
|
Thank you for the PR! |
|
When using the normal comm.Bcast the Bcast only works the first time and the seconds time not anymore. ❯ mpirun -np 2 python test.py
0 tensor([2, 4, 3, 0, 1], dtype=torch.int32)
1 tensor([2, 4, 3, 0, 1], dtype=torch.int32)
...
1 tensor([455, 0, 0, 0, 32], dtype=torch.int32)
0 tensor([1, 4, 3, 2, 0], dtype=torch.int32) |
|
Thank you for the PR! |
…_DistributedSampler_for_Interoperability_with_Pytorch_Ecosystem
|
Thank you for the PR! |
for more information, see https://pre-commit.ci
|
Thank you for the PR! |
|
Thank you for the PR! |
for more information, see https://pre-commit.ci
…_DistributedSampler_for_Interoperability_with_Pytorch_Ecosystem
…_DistributedSampler_for_Interoperability_with_Pytorch_Ecosystem
for more information, see https://pre-commit.ci
- Add Comments for init - made idxmap creation faster
…mentation_of_a_Heat_Aware_DistributedSampler_for_Interoperability_with_Pytorch_Ecosystem' into features/1789-Implementation_of_a_Heat_Aware_DistributedSampler_for_Interoperability_with_Pytorch_Ecosystem
for more information, see https://pre-commit.ci
- remove randperm for correction
- Added Train, val split function using the sampler.
for more information, see https://pre-commit.ci
…_DistributedSampler_for_Interoperability_with_Pytorch_Ecosystem
…_DistributedSampler_for_Interoperability_with_Pytorch_Ecosystem
…ct_randomization_of_tests_set_seed_in_basic_test_case_class set seed in basic test class
…_DistributedSampler_for_Interoperability_with_Pytorch_Ecosystem
|
Thank you for the PR! |
|
TODO
|
Merge PR #1917 before this one!
Due Diligence
Description
Issue/s resolved: #1789
Changes proposed:
Add a Heat aware DistributedSampler for usage for PyTorch use cases
Update: Added a train_val_split function.
Type of change
Does this change modify the behaviour of other functions? If so, which?
no