... | ... | @@ -39,5 +39,18 @@ On GPU: |
|
|
- WriteUnitStalled (percentage GPUTime the WriteUnit is stalled)
|
|
|
- LDSBankConflict (Percentage of GPUTime LDS is stalled by bank conflict)
|
|
|
|
|
|
## Application optimization workflow
|
|
|
The workflow for optimization code can be summarized as:
|
|
|
|
|
|
1. Understand requirements
|
|
|
2. Understand current performance
|
|
|
3. Can it be done? (modeling)
|
|
|
4. How can it be done? (some options)
|
|
|
5. Tuning
|
|
|
- Not there yet? Back to 2!
|
|
|
6. Analyze the result
|
|
|
|
|
|
|
|
|
|
|
|
# Subpages
|
|
|
- [Intel Advisor & Intel VTune](3.a.-Intel-Offload-Advisor-&-Intel-VTune) |
|
|
\ No newline at end of file |