Skip to content

Parallel Efficiency Tests

Huang, Hua edited this page Aug 14, 2020 · 5 revisions

Tests in this page were performed on a server with the following hardware and software configurations:

  • 2 * Intel Xeon Gold 6226 CPU @ 2.7GHz (2 * 12 cores, 2 * 12 * 2 threads, hyperthreading disabled)
  • 6 * 32 GB DDR4 memory
  • Red Had Enterprise Linux 7.6 (kernel 3.10.0-957.12.1.el7)
  • Intel Parallel Studio Cluster version 2019.5
  • ICC optimization flags: -O3 -xHost
  • OpenMP environment variables
    • OMP_NUM_THREADS=1, 2, 4, 8, 12, 24
    • OMP_PLACES=cores
    • OMP_PROC_BIND=close

Test point sets: 1,000,000 uniformly and randomly distributed points in a 3D scaled cube

Prescribed QR relative error tolerance: 1e-6

Report H2-build and H2-matvec timings

Test kernel: Coulomb

Clone this wiki locally