Support for nVidia GPU's and CUDA, including PC sampling on GPU.
New fnbounds server that is faster and has a much smaller memory
footprint.
New format for hpcviewer databases. Note: the new viewer supports
reading old databases, but the current hpcprof requires the latest
hpcviewer.
Hpcstruct supports thread-level parallelism, use '-j ' to
run with multiple threads.
Important bug fixes to improve powerpc unwinding.
Bug fixes to better handle DOE applications: adagio, kull, pytorch
Updates to the manual and man pages.