[MXNET-259] Performance improvement of random.shuffle #10351

asitstands · 2018-03-31T09:30:07Z

Description

For multidimensional arrays on CPU, mx.random.shuffle implements Fisher-Yates algorithm which needs n number of swaps of two memory ranges where n is the length of the first axis. Previously the swap is performed by std::swap_ranges. This PR replaces it with a manual swap using std::memcpy. This brings a good performance gain. For example, shuffling of arrays with shape (1000, 10000) is almost ten times faster in linux/g++-6.4.1 and linux/clang++-6.0. (1D arrays on CPU and arrays on GPU are not in the scope of this PR.)

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

asitstands · 2018-03-31T09:40:11Z

@reminisce @marcoabreu

piiswrong · 2018-04-01T23:08:10Z

src/operator/random/shuffle_op.cc

  CHECK_GT(first_axis_len, 0U);
+  const size_t stride_bytes = sizeof(DType) * stride;
+  Tensor<cpu, 1, char> buf =
+    ctx.requested[1].get_space_typed<cpu, 1, char>(Shape1(stride_bytes), ctx.get_stream<cpu>());


I think you need to add Fresources to operator registration?

It is there because the GPU version already needed a temp space.

Replace std::swap_ranges with memcpy

5585387

asitstands requested a review from cjolivier01 as a code owner March 31, 2018 09:30

marcoabreu approved these changes Mar 31, 2018

View reviewed changes

marcoabreu requested a review from reminisce March 31, 2018 12:22

piiswrong reviewed Apr 1, 2018

View reviewed changes

piiswrong merged commit b37b3f5 into apache:master Apr 2, 2018

lanking520 pushed a commit to lanking520/incubator-mxnet that referenced this pull request Apr 2, 2018

Replace std::swap_ranges with memcpy (apache#10351)

499a186

asitstands deleted the shuffle_memcpy branch May 22, 2018 07:16

rahul003 pushed a commit to rahul003/mxnet that referenced this pull request Jun 4, 2018

Replace std::swap_ranges with memcpy (apache#10351)

f0e1c76

zheng-da pushed a commit to zheng-da/incubator-mxnet that referenced this pull request Jun 28, 2018

Replace std::swap_ranges with memcpy (apache#10351)

78a8f29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MXNET-259] Performance improvement of random.shuffle #10351

[MXNET-259] Performance improvement of random.shuffle #10351

Uh oh!

asitstands commented Mar 31, 2018 •

edited

Loading

Uh oh!

asitstands commented Mar 31, 2018

Uh oh!

piiswrong Apr 1, 2018

Uh oh!

asitstands Apr 2, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[MXNET-259] Performance improvement of random.shuffle #10351

[MXNET-259] Performance improvement of random.shuffle #10351

Uh oh!

Conversation

asitstands commented Mar 31, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Essentials

Uh oh!

asitstands commented Mar 31, 2018

Uh oh!

piiswrong Apr 1, 2018

Choose a reason for hiding this comment

Uh oh!

asitstands Apr 2, 2018

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

asitstands commented Mar 31, 2018 •

edited

Loading