HPCG-Benchmark
version=3.1
Release date=March 28, 2019
Machine Summary=
Machine Summary::Distributed Processes=36864
Machine Summary::Threads per processes=1
Global Problem Dimensions=
Global Problem Dimensions::Global nx=1792
Global Problem Dimensions::Global ny=6912
Global Problem Dimensions::Global nz=13536
Processor Dimensions=
Processor Dimensions::npx=32
Processor Dimensions::npy=32
Processor Dimensions::npz=36
Local Domain Dimensions=
Local Domain Dimensions::nx=56
Local Domain Dimensions::ny=216
Local Domain Dimensions::Lower ipz=0
Local Domain Dimensions::Upper ipz=35
Local Domain Dimensions::nz=376
########## Problem Summary  ##########=
Setup Information=
Setup Information::Setup Time=6.94243
Linear System Information=
Linear System Information::Number of Equations=167661010944
Linear System Information::Number of Nonzero Terms=4524503896696
Multigrid Information=
Multigrid Information::Number of coarse grid levels=3
Multigrid Information::Coarse Grids=
Multigrid Information::Coarse Grids::Grid Level=1
Multigrid Information::Coarse Grids::Number of Equations=20957626368
Multigrid Information::Coarse Grids::Number of Nonzero Terms=565270128952
Multigrid Information::Coarse Grids::Number of Presmoother Steps=1
Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1
Multigrid Information::Coarse Grids::Grid Level=2
Multigrid Information::Coarse Grids::Number of Equations=2619703296
Multigrid Information::Coarse Grids::Number of Nonzero Terms=70585576600
Multigrid Information::Coarse Grids::Number of Presmoother Steps=1
Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1
Multigrid Information::Coarse Grids::Grid Level=3
Multigrid Information::Coarse Grids::Number of Equations=327462912
Multigrid Information::Coarse Grids::Number of Nonzero Terms=8804912200
Multigrid Information::Coarse Grids::Number of Presmoother Steps=1
Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1
########## Memory Use Summary  ##########=
Memory Use Information=
Memory Use Information::Total memory used for data (Gbytes)=187347
Memory Use Information::Memory used for OptimizeProblem data (Gbytes)=67431.2
Memory Use Information::Bytes per equation (Total memory / Number of Equations)=1117.42
Memory Use Information::Memory used for linear system and CG (Gbytes)=105527
Memory Use Information::Coarse Grids=
Memory Use Information::Coarse Grids::Grid Level=1
Memory Use Information::Coarse Grids::Memory used=12612.6
Memory Use Information::Coarse Grids::Grid Level=2
Memory Use Information::Coarse Grids::Memory used=1578.75
Memory Use Information::Coarse Grids::Grid Level=3
Memory Use Information::Coarse Grids::Memory used=197.908
########## V&V Testing Summary  ##########=
Spectral Convergence Tests=
Spectral Convergence Tests::Result=PASSED
Spectral Convergence Tests::Unpreconditioned=
Spectral Convergence Tests::Unpreconditioned::Maximum iteration count=11
Spectral Convergence Tests::Unpreconditioned::Expected iteration count=12
Spectral Convergence Tests::Preconditioned=
Spectral Convergence Tests::Preconditioned::Maximum iteration count=2
Spectral Convergence Tests::Preconditioned::Expected iteration count=2
Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon=
Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Result=PASSED
Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for SpMV=1.34923e-15
Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for MG=0
########## Iterations Summary  ##########=
Iteration Count Information=
Iteration Count Information::Result=PASSED
Iteration Count Information::Reference CG iterations per set=50
Iteration Count Information::Optimized CG iterations per set=50
Iteration Count Information::Total number of reference iterations=19900
Iteration Count Information::Total number of optimized iterations=19900
########## Reproducibility Summary  ##########=
Reproducibility Information=
Reproducibility Information::Result=PASSED
Reproducibility Information::Scaled residual mean=0.00504257
Reproducibility Information::Scaled residual variance=0
########## Performance Summary (times in sec) ##########=
Benchmark Time Summary=
Benchmark Time Summary::Optimization phase=2.55509
Benchmark Time Summary::DDOT=91.3061
Benchmark Time Summary::WAXPBY=37.7825
Benchmark Time Summary::SpMV=430.484
Benchmark Time Summary::MG=1481.02
Benchmark Time Summary::Total=2041.73
Floating Point Operations Summary=
Floating Point Operations Summary::Raw DDOT=2.01522e+16
Floating Point Operations Summary::Raw WAXPBY=2.01522e+16
Floating Point Operations Summary::Raw SpMV=1.83677e+17
Floating Point Operations Summary::Raw MG=1.02761e+18
Floating Point Operations Summary::Total=1.25159e+18
Floating Point Operations Summary::Total with convergence overhead=1.25159e+18
GB/s Summary=
GB/s Summary::Raw Read B/W=3.77561e+06
GB/s Summary::Raw Write B/W=872490
GB/s Summary::Raw Total B/W=4.6481e+06
GB/s Summary::Total with convergence and optimization phase overhead=3.92199e+06
GFLOP/s Summary=
GFLOP/s Summary::Raw DDOT=220710
GFLOP/s Summary::Raw WAXPBY=533374
GFLOP/s Summary::Raw SpMV=426675
GFLOP/s Summary::Raw MG=693853
GFLOP/s Summary::Raw Total=613006
GFLOP/s Summary::Total with convergence overhead=613006
GFLOP/s Summary::Total with convergence and optimization phase overhead=517244
User Optimization Overheads=
User Optimization Overheads::Optimization phase time (sec)=2.55509
User Optimization Overheads::Optimization phase time vs reference SpMV+MG time=0.252254
DDOT Timing Variations=
DDOT Timing Variations::Min DDOT MPI_Allreduce time=26.215
DDOT Timing Variations::Max DDOT MPI_Allreduce time=271.178
DDOT Timing Variations::Avg DDOT MPI_Allreduce time=104.683
Final Summary=
Final Summary::HPCG result is VALID with a GFLOP/s rating of=517244
Final Summary::HPCG 2.4 rating for historical reasons is=583922
Final Summary::Please upload results from the YAML file contents to=http://hpcg-benchmark.org