Skip to content
GitLab
Projects Groups Topics Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in
  • gpu_offloading gpu_offloading
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributor statistics
    • Graph
    • Compare revisions
  • Issues 0
    • Issues 0
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Terraform modules
  • Monitor
    • Monitor
    • Metrics
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Okke van Eck
  • gpu_offloadinggpu_offloading
  • Wiki
  • 1.d. Profiling & Debugging

1.d. Profiling & Debugging · Changes

Page history
Update 1.d. Profiling & Debugging authored Nov 06, 2023 by Okke van Eck's avatar Okke van Eck
Hide whitespace changes
Inline Side-by-side
1.d.-Profiling-&-Debugging.md
View page @ 073fddb8
...@@ -141,6 +141,7 @@ There are also environment variables that can be set for extra information durin ...@@ -141,6 +141,7 @@ There are also environment variables that can be set for extra information durin
|:----------------------:|:---------------:|:-----------:|:------------------:| |:----------------------:|:---------------:|:-----------:|:------------------:|
| NCPUS | [N] | 1 | Sets the number of processes or threads used in parallel regions. *NOTE: Same as OMP_NUM_THREADS, kept for historical reasons.* | | NCPUS | [N] | 1 | Sets the number of processes or threads used in parallel regions. *NOTE: Same as OMP_NUM_THREADS, kept for historical reasons.* |
| NVCOMPILER_ACC_NOTIFY | <ul><li>`1`: Kernel launches only</li><li>`2`: Data transfers only</li><li>`3`: Kernel launches and data transfers</li><li>`4`: region entry/exits only</li><li>`5`: region entry/exits and kernel launches</li><li>`8`: wait operations, synchronizatoins</li><li>`16`: (de)allocation of device memory</li></ul> | 3 | Print information for GPU-related events. | | NVCOMPILER_ACC_NOTIFY | <ul><li>`1`: Kernel launches only</li><li>`2`: Data transfers only</li><li>`3`: Kernel launches and data transfers</li><li>`4`: region entry/exits only</li><li>`5`: region entry/exits and kernel launches</li><li>`8`: wait operations, synchronizatoins</li><li>`16`: (de)allocation of device memory</li></ul> | 3 | Print information for GPU-related events. |
| NV_ACC_DEBUG | [0\|1] | 1 | Info on devices, launches, function arguments. |
### Hardware independent ### Hardware independent
There are also hardware independent environment variables that gives us runtime information. There are also hardware independent environment variables that gives us runtime information.
......
Clone repository
  • 0. Sources
  • 0.a. Documentation & Manuals
  • 0.b. Slides
  • 0.c. Papers
  • 0.d. Misc
  • 1. Offloading process
  • 1.a. Supercomputers
  • 1.b. Compilers
  • 1.c. OpenMP directives
  • 1.d. Profiling & Debugging
  • 1.e Performance hazards
  • 2. Nemo standalone
  • 3. Tools
  • 3.a. Intel Offload Advisor & Intel VTune
  • 3.b. CPU and GPU performance overview
View All Pages