-
Notifications
You must be signed in to change notification settings - Fork 7
Parallel Efficiency Tests
Huang, Hua edited this page Aug 14, 2020
·
5 revisions
Tests in this page were performed on a server with the following hardware and software configurations:
- 2 * Intel Xeon Gold 6226 CPU @ 2.7GHz (2 * 12 cores, 2 * 12 * 2 threads, hyperthreading disabled)
- 6 * 32 GB DDR4 memory
- Red Had Enterprise Linux 7.6 (kernel 3.10.0-957.12.1.el7)
- Intel Parallel Studio Cluster version 2019.5
- ICC optimization flags: -O3 -xHost
- OpenMP environment variables
- OMP_NUM_THREADS=1, 2, 4, 8, 12, 24
- OMP_PLACES=cores
- OMP_PROC_BIND=close
Test point sets: 1,000,000 uniformly and randomly distributed points in a 3D scaled cube
Prescribed QR relative error tolerance: 1e-6
Report H2-build and H2-matvec timings
Test kernel: Coulomb
- Return to the top H2Pack github page (leave this wiki)
- Installing H2Pack
- Basic Application Interface
- Using and Writing Kernel Functions
- Two Running Modes for H2Pack
- HSS-Related Computations
- Bi-Kernel Matvec (BKM) Functions
- Vector Wrapper Functions for Kernel Evaluations
- Proxy Points and their Reuse
- Python Interface
- H2 Matrix File Storage Scheme (draft)
- Using H2 Matrix File Storage