[SIMD] Model weights updating using AVX Instructions #11

octaviansima · 2020-12-12T21:30:16Z

This PR includes further optimizations of the aggregation code by performing arithmetic operations using Intel's AVX 256-bit instructions. It also includes more minor optimizations regarding non-necessary copying of data, a logic change to avoid redundant loads and stores, and the conversion of double types to floats. Code used to test performance found here: https://github.com/octaviansima/secure-aggregation/blob/perf-testing/server/tests/host_test.cpp

Note that the PR is huge only due to the inclusion of Intel Intrinsics header files required for compilation.

…tion into simd

chester-leung

I ran some performance tests using the perf-testing branch and got the following results:

Code version	Num weights	Num Threads	Time (s)
Non SIMD	6000	1	132.79
SIMD	6000	1	134.97
Non SIMD	15000	1	843.7
SIMD	15000	1	858.7

I'll continue investigating to see why there is no performance improvement

chester-leung · 2020-12-15T23:01:11Z

server/enclave/ecalls.cpp

+            if (k == g_accumulator.size() - 1 && iters_sum > 0) {
+                const float iters_sum_arr[8] = {iters_sum, iters_sum, iters_sum, iters_sum,
+                                                    iters_sum, iters_sum, iters_sum, iters_sum};
+                iters_sum_slice = _mm256_loadu_ps(iters_sum_arr);


I think you can use _mm256_broadcast_ss(&iters_sum) instead of _mm256_loadu_ps. That way you don't have to first create an array. I tested this and it builds/runs, but you may want to check correctness.

chester-leung · 2020-12-15T23:01:23Z

server/enclave/ecalls.cpp

-        }
+            const float n_iter_arr[8] = {n_iter, n_iter, n_iter, n_iter,
+                                            n_iter, n_iter, n_iter, n_iter};
+            __m256 n_iter_slice = _mm256_loadu_ps(n_iter_arr);


chester-leung · 2020-12-15T23:01:51Z

server/enclave/ecalls.cpp

-            continue; // Didn't receive this variable from any clients
-        }
+            // Multiple the weights by local iterations and update g_old_params[v_name].
+            for (int i = 0; i < acc_params[v_name].size() / 8 * 8; i += 8) {


This variable i has already been used in the outer for loop

chester-leung · 2020-12-15T23:02:04Z

server/enclave/ecalls.cpp

+                _mm256_storeu_ps(g_old_params[v_name].data() + i, updated_old_params_v_name_slice);
+            }
+            // Tail case.
+            for (int i = acc_params[v_name].size() / 8 * 8; i < acc_params[v_name].size(); i++) {


See comment above

podcastinator · 2020-12-18T23:04:24Z

Superseded by #12 and #13.

octaviansima added 11 commits December 10, 2020 14:16

fixed local pathing issues

a7dd127

building with intrinsics works, march=native set

7416ef5

got rid of updated_params_at_var

296ccb6

outer loop optimization works

c4bb7b7

modify test file to include larger case

dd864c3

avx implemented over weights loop

8ea0a47

added tail case

dc475f6

removed weights creation

fce5d92

simd for final loop too

b76627f

got rid of final loop

85b8b8d

update paths

1a7e3da

octaviansima requested review from chester-leung and podcastinator December 12, 2020 21:37

octaviansima added 6 commits December 12, 2020 13:48

moved load outside of for loop

6ec86f6

moved load outside of for loop

a46c696

load only on last accumulator

cae6b38

all converted to float except vector operations

d928f40

vector ops to floats

6371643

Merge branch 'simd' of https://github.com/octaviansima/secure-aggrega…

fba6dcd

…tion into simd

chester-leung reviewed Dec 15, 2020

View reviewed changes

podcastinator closed this Dec 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SIMD] Model weights updating using AVX Instructions #11

[SIMD] Model weights updating using AVX Instructions #11

Uh oh!

octaviansima commented Dec 12, 2020 •

edited

Loading

Uh oh!

chester-leung left a comment •

edited

Loading

Uh oh!

chester-leung Dec 15, 2020

Uh oh!

chester-leung Dec 15, 2020

Uh oh!

chester-leung Dec 15, 2020

Uh oh!

chester-leung Dec 15, 2020

Uh oh!

podcastinator commented Dec 18, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SIMD] Model weights updating using AVX Instructions #11

[SIMD] Model weights updating using AVX Instructions #11

Uh oh!

Conversation

octaviansima commented Dec 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chester-leung left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chester-leung Dec 15, 2020

Choose a reason for hiding this comment

Uh oh!

chester-leung Dec 15, 2020

Choose a reason for hiding this comment

Uh oh!

chester-leung Dec 15, 2020

Choose a reason for hiding this comment

Uh oh!

chester-leung Dec 15, 2020

Choose a reason for hiding this comment

Uh oh!

podcastinator commented Dec 18, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

octaviansima commented Dec 12, 2020 •

edited

Loading

chester-leung left a comment •

edited

Loading