Support for NVIDIA Nsight Systems Profiling of DJL Serving #2667
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR introduces support for performance profiling and debugging in DJL Serving using NVIDIA Nsight Systems. The changes enable detailed profiling and performance analysis to optimize the model server's operation.
Key Features
Debug Mode Activation
DEBUG_MODE
environment variable:Nsight Systems Integration
DEBUG_MODE
is enabled.NSIGHT_VERSION
environment variable.Profiling Configuration
NSYS_PROFILE_DELAY
: Time delay (in seconds) before starting profiling, to capture relevant operations.NSYS_PROFILE_DURATION
: Duration (in seconds) of profiling. It is recommended to keep this under 10 minutes to avoid large report sizes.NSYS_PROFILE_TRACE
: Specify APIs and libraries to trace during profiling (e.g.,cuda,nvtx,osrt,cudnn
).Report Handling
S3_DEBUG_PATH
environment variable is provided.Testing
Documentation