HPCG-Benchmark version=3.1 Release date=March 28, 2019 Machine Summary= Machine Summary::Distributed Processes=34560 Machine Summary::Threads per processes=1 Global Problem Dimensions= Global Problem Dimensions::Global nx=1792 Global Problem Dimensions::Global ny=7776 Global Problem Dimensions::Global nz=11280 Processor Dimensions= Processor Dimensions::npx=32 Processor Dimensions::npy=36 Processor Dimensions::npz=30 Local Domain Dimensions= Local Domain Dimensions::nx=56 Local Domain Dimensions::ny=216 Local Domain Dimensions::Lower ipz=0 Local Domain Dimensions::Upper ipz=29 Local Domain Dimensions::nz=376 ########## Problem Summary ##########= Setup Information= Setup Information::Setup Time=2.97322 Linear System Information= Linear System Information::Number of Equations=157182197760 Linear System Information::Number of Nonzero Terms=4241726080312 Multigrid Information= Multigrid Information::Number of coarse grid levels=3 Multigrid Information::Coarse Grids= Multigrid Information::Coarse Grids::Grid Level=1 Multigrid Information::Coarse Grids::Number of Equations=19647774720 Multigrid Information::Coarse Grids::Number of Nonzero Terms=529941665176 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=2 Multigrid Information::Coarse Grids::Number of Equations=2455971840 Multigrid Information::Coarse Grids::Number of Nonzero Terms=66174207880 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=3 Multigrid Information::Coarse Grids::Number of Equations=306996480 Multigrid Information::Coarse Grids::Number of Nonzero Terms=8254662640 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 ########## Memory Use Summary ##########= Memory Use Information= Memory Use Information::Total memory used for data (Gbytes)=177075 Memory Use Information::Memory used for OptimizeProblem data (Gbytes)=64653.5 Memory Use Information::Bytes per equation (Total memory / Number of Equations)=1126.56 Memory Use Information::Memory used for linear system and CG (Gbytes)=98931.2 Memory Use Information::Coarse Grids= Memory Use Information::Coarse Grids::Grid Level=1 Memory Use Information::Coarse Grids::Memory used=11824.3 Memory Use Information::Coarse Grids::Grid Level=2 Memory Use Information::Coarse Grids::Memory used=1480.08 Memory Use Information::Coarse Grids::Grid Level=3 Memory Use Information::Coarse Grids::Memory used=185.538 ########## V&V Testing Summary ##########= Spectral Convergence Tests= Spectral Convergence Tests::Result=PASSED Spectral Convergence Tests::Unpreconditioned= Spectral Convergence Tests::Unpreconditioned::Maximum iteration count=11 Spectral Convergence Tests::Unpreconditioned::Expected iteration count=12 Spectral Convergence Tests::Preconditioned= Spectral Convergence Tests::Preconditioned::Maximum iteration count=2 Spectral Convergence Tests::Preconditioned::Expected iteration count=2 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon= Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Result=PASSED Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for SpMV=1.2281e-15 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for MG=4.9124e-15 ########## Iterations Summary ##########= Iteration Count Information= Iteration Count Information::Result=PASSED Iteration Count Information::Reference CG iterations per set=50 Iteration Count Information::Optimized CG iterations per set=51 Iteration Count Information::Total number of reference iterations=18050 Iteration Count Information::Total number of optimized iterations=18411 ########## Reproducibility Summary ##########= Reproducibility Information= Reproducibility Information::Result=PASSED Reproducibility Information::Scaled residual mean=0.00500602 Reproducibility Information::Scaled residual variance=0 ########## Performance Summary (times in sec) ##########= Benchmark Time Summary= Benchmark Time Summary::Optimization phase=1.42848 Benchmark Time Summary::DDOT=71.2121 Benchmark Time Summary::WAXPBY=33.6008 Benchmark Time Summary::SpMV=388.911 Benchmark Time Summary::MG=1357.54 Benchmark Time Summary::Total=1852.26 Floating Point Operations Summary= Floating Point Operations Summary::Raw DDOT=1.74768e+16 Floating Point Operations Summary::Raw WAXPBY=1.74768e+16 Floating Point Operations Summary::Raw SpMV=1.59251e+17 Floating Point Operations Summary::Raw MG=8.91303e+17 Floating Point Operations Summary::Total=1.08551e+18 Floating Point Operations Summary::Total with convergence overhead=1.06422e+18 GB/s Summary= GB/s Summary::Raw Read B/W=3.60955e+06 GB/s Summary::Raw Write B/W=834158 GB/s Summary::Raw Total B/W=4.44371e+06 GB/s Summary::Total with convergence and optimization phase overhead=4.01237e+06 GFLOP/s Summary= GFLOP/s Summary::Raw DDOT=245418 GFLOP/s Summary::Raw WAXPBY=520129 GFLOP/s Summary::Raw SpMV=409480 GFLOP/s Summary::Raw MG=656557 GFLOP/s Summary::Raw Total=586046 GFLOP/s Summary::Total with convergence overhead=574555 GFLOP/s Summary::Total with convergence and optimization phase overhead=529159 User Optimization Overheads= User Optimization Overheads::Optimization phase time (sec)=1.42848 User Optimization Overheads::Optimization phase time vs reference SpMV+MG time=0.143119 DDOT Timing Variations= DDOT Timing Variations::Min DDOT MPI_Allreduce time=23.1163 DDOT Timing Variations::Max DDOT MPI_Allreduce time=154.61 DDOT Timing Variations::Avg DDOT MPI_Allreduce time=53.9899 Final Summary= Final Summary::HPCG result is VALID with a GFLOP/s rating of=529159 Final Summary::HPCG 2.4 rating for historical reasons is=558992 Final Summary::Please upload results from the YAML file contents to=http://hpcg-benchmark.org