HPCG-Benchmark version=3.1 Release date=March 28, 2019 Machine Summary= Machine Summary::Distributed Processes=256 Machine Summary::Threads per processes=8 Global Problem Dimensions= Global Problem Dimensions::Global nx=4096 Global Problem Dimensions::Global ny=4096 Global Problem Dimensions::Global nz=1152 Processor Dimensions= Processor Dimensions::npx=8 Processor Dimensions::npy=8 Processor Dimensions::npz=4 Local Domain Dimensions= Local Domain Dimensions::nx=512 Local Domain Dimensions::ny=512 Local Domain Dimensions::Lower ipz=0 Local Domain Dimensions::Upper ipz=3 Local Domain Dimensions::nz=288 ########## Problem Summary ##########= Setup Information= Setup Information::Setup Time=0.554265 Linear System Information= Linear System Information::Number of Equations=19327352832 Linear System Information::Number of Nonzero Terms=521366779384 Multigrid Information= Multigrid Information::Number of coarse grid levels=3 Multigrid Information::Coarse Grids= Multigrid Information::Coarse Grids::Grid Level=1 Multigrid Information::Coarse Grids::Number of Equations=2415919104 Multigrid Information::Coarse Grids::Number of Nonzero Terms=65111907064 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=2 Multigrid Information::Coarse Grids::Number of Equations=301989888 Multigrid Information::Coarse Grids::Number of Nonzero Terms=8124263800 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=3 Multigrid Information::Coarse Grids::Number of Equations=37748736 Multigrid Information::Coarse Grids::Number of Nonzero Terms=1011857080 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 ########## Memory Use Summary ##########= Memory Use Information= Memory Use Information::Total memory used for data (Gbytes)=13816.1 Memory Use Information::Memory used for OptimizeProblem data (Gbytes)=0 Memory Use Information::Bytes per equation (Total memory / Number of Equations)=714.847 Memory Use Information::Memory used for linear system and CG (Gbytes)=12159.2 Memory Use Information::Coarse Grids= Memory Use Information::Coarse Grids::Grid Level=1 Memory Use Information::Coarse Grids::Memory used=1452.54 Memory Use Information::Coarse Grids::Grid Level=2 Memory Use Information::Coarse Grids::Memory used=181.64 Memory Use Information::Coarse Grids::Grid Level=3 Memory Use Information::Coarse Grids::Memory used=22.7234 ########## V&V Testing Summary ##########= Spectral Convergence Tests= Spectral Convergence Tests::Result=PASSED Spectral Convergence Tests::Unpreconditioned= Spectral Convergence Tests::Unpreconditioned::Maximum iteration count=11 Spectral Convergence Tests::Unpreconditioned::Expected iteration count=12 Spectral Convergence Tests::Preconditioned= Spectral Convergence Tests::Preconditioned::Maximum iteration count=2 Spectral Convergence Tests::Preconditioned::Expected iteration count=2 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon= Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Result=PASSED Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for SpMV=0 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for MG=0 ########## Iterations Summary ##########= Iteration Count Information= Iteration Count Information::Result=PASSED Iteration Count Information::Reference CG iterations per set=50 Iteration Count Information::Optimized CG iterations per set=53 Iteration Count Information::Total number of reference iterations=750 Iteration Count Information::Total number of optimized iterations=795 ########## Reproducibility Summary ##########= Reproducibility Information= Reproducibility Information::Result=PASSED Reproducibility Information::Scaled residual mean=0.00500188 Reproducibility Information::Scaled residual variance=0 ########## Performance Summary (times in sec) ##########= Benchmark Time Summary= Benchmark Time Summary::Optimization phase=0.486096 Benchmark Time Summary::DDOT=4.16728 Benchmark Time Summary::WAXPBY=3.2593 Benchmark Time Summary::SpMV=11.6154 Benchmark Time Summary::MG=42.6899 Benchmark Time Summary::Total=61.7339 Floating Point Operations Summary= Floating Point Operations Summary::Raw DDOT=9.27713e+13 Floating Point Operations Summary::Raw WAXPBY=9.27713e+13 Floating Point Operations Summary::Raw SpMV=8.44614e+14 Floating Point Operations Summary::Raw MG=4.73031e+15 Floating Point Operations Summary::Total=5.76047e+15 Floating Point Operations Summary::Total with convergence overhead=5.4344e+15 GB/s Summary= GB/s Summary::Raw Read B/W=574725 GB/s Summary::Raw Write B/W=132830 GB/s Summary::Raw Total B/W=707555 GB/s Summary::Total with convergence and optimization phase overhead=651047 GFLOP/s Summary= GFLOP/s Summary::Raw DDOT=22261.8 GFLOP/s Summary::Raw WAXPBY=28463.6 GFLOP/s Summary::Raw SpMV=72714.7 GFLOP/s Summary::Raw MG=110806 GFLOP/s Summary::Raw Total=93311.3 GFLOP/s Summary::Total with convergence overhead=88029.5 GFLOP/s Summary::Total with convergence and optimization phase overhead=85859.1 User Optimization Overheads= User Optimization Overheads::Optimization phase time (sec)=0.486096 User Optimization Overheads::Optimization phase time vs reference SpMV+MG time=0.036714 DDOT Timing Variations= DDOT Timing Variations::Min DDOT MPI_Allreduce time=0.206468 DDOT Timing Variations::Max DDOT MPI_Allreduce time=2.77871 DDOT Timing Variations::Avg DDOT MPI_Allreduce time=1.07706 Final Summary= Final Summary::HPCG result is VALID with a GFLOP/s rating of=85859.1 Final Summary::HPCG 2.4 rating for historical reasons is=87001.9 Final Summary::Results are valid but execution time (sec) is=61.7339 Final Summary::Official results execution time (sec) must be at least=1800