iframe-proxy

umar456 · 2022-01-13T22:46:45Z

Dynamically link CUDA numeric libraries instead of always staticly linking.

Description

ArrayFire's CUDA backend linked against the CUDA numeric libraries
staticly before this change. This caused the libafcuda library to be
in the 1.1GB range for CUDA 11.5 even if you were targeting one compute
capability. This is partially due to the fact that the linker does not
remove the compute capabilities of older architectures when linking.

One way around this would be to use nvprune to remove the architectures
that are not being used by the compute cability when building. This
approach is not yet implemented.

This commit will revert back to dynamically linking the CUDA numeric
libraries by default. You can still select the old behavior by setting
the AF_WITH_STATIC_CUDA_NUMERIC_LIBS option in CMake

Changes to Users

Then binary sizes are significantly smaller but requires you to have
the cuda libraries in the library paths. This is only an issue when building
installers.

Checklist

Rebased on latest master
Code compiles
Tests pass
~~[ ] Functions added to unified API~~
~~[ ] Functions documented~~

WilliamTambellini · 2022-01-13T23:09:30Z

umar456 · 2022-01-13T23:11:08Z

It could be done but would require a good bit of work. We have done this for other libraries but not for things like cublas and cufft.

9prady9 · 2022-01-20T04:39:56Z

ArrayFire's CUDA backend linked against the CUDA numeric libraries staticly before this change. This caused the libafcuda library to be in the 1.1GB range for CUDA 11.5 even if you were targeting one compute capability. This is partially due to the fact that the linker does not remove the compute capabilities of older architectures when linking. One way around this would be to use nvprune to remove the architectures that are not being used by the compute cability when building. This approach is not yet implemented. This commit will revert back to dynamically linking the CUDA numeric libraries by default. You can still select the old behavior by setting the AF_WITH_STATIC_CUDA_NUMERIC_LIBS option in CMake

umar456 added 2 commits March 22, 2022 00:28

Fix find_library call when searching for CUDA libraries

68d3df8

umar456 force-pushed the binary_size_opt branch from 63db467 to 68d3df8 Compare March 22, 2022 04:29

syurkevi approved these changes Mar 22, 2022

View reviewed changes

umar456 merged commit e3f9559 into arrayfire:master Mar 22, 2022

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build libafcuda with dynamically loaded CUDA numeric binaries#3205

Build libafcuda with dynamically loaded CUDA numeric binaries#3205
umar456 merged 2 commits intoarrayfire:masterfrom
umar456:binary_size_opt

umar456 commented Jan 13, 2022

Uh oh!

WilliamTambellini commented Jan 13, 2022

Uh oh!

umar456 commented Jan 13, 2022

Uh oh!

9prady9 commented Jan 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Sunbelt Computer Software

PL/B Language Development and Support

Conversation

umar456 commented Jan 13, 2022

Description

Changes to Users

Checklist

Uh oh!

WilliamTambellini commented Jan 13, 2022

Uh oh!

umar456 commented Jan 13, 2022

Uh oh!

9prady9 commented Jan 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants