n128-202560p-2t version=V3.1 Release date=March 28, 2019 Machine Summary= Machine Summary::Distributed Processes=202560 Machine Summary::Threads per processes=2 Global Problem Dimensions= Global Problem Dimensions::Global nx=4096 Global Problem Dimensions::Global ny=3840 Global Problem Dimensions::Global nz=27008 Processor Dimensions= Processor Dimensions::npx=32 Processor Dimensions::npy=30 Processor Dimensions::npz=211 Local Domain Dimensions= Local Domain Dimensions::nx=128 Local Domain Dimensions::ny=128 Local Domain Dimensions::nz=128 ########## Problem Summary ##########= Setup Information= Setup Information::Setup Time=1.28108 Linear System Information= Linear System Information::Number of Equations=424799109120 Linear System Information::Number of Nonzero Terms=11465435211256 Multigrid Information= Multigrid Information::Number of coarse grid levels=3 Multigrid Information::Coarse Grids= Multigrid Information::Coarse Grids::Grid Level=1 Multigrid Information::Coarse Grids::Number of Equations=53099888640 Multigrid Information::Coarse Grids::Number of Nonzero Terms=1432661914360 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=2 Multigrid Information::Coarse Grids::Number of Equations=6637486080 Multigrid Information::Coarse Grids::Number of Nonzero Terms=178953406840 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=3 Multigrid Information::Coarse Grids::Number of Equations=829685760 Multigrid Information::Coarse Grids::Number of Nonzero Terms=22336862392 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 ########## Memory Use Summary ##########= Memory Use Information= Memory Use Information::Total memory used for data (Gbytes)=303813 Memory Use Information::Memory used for OptimizeProblem data (Gbytes)=0 Memory Use Information::Bytes per equation (Total memory / Number of Equations)=715.193 Memory Use Information::Memory used for linear system and CG (Gbytes)=267359 Memory Use Information::Coarse Grids= Memory Use Information::Coarse Grids::Grid Level=1 Memory Use Information::Coarse Grids::Memory used=31953.5 Memory Use Information::Coarse Grids::Grid Level=2 Memory Use Information::Coarse Grids::Memory used=3999.4 Memory Use Information::Coarse Grids::Grid Level=3 Memory Use Information::Coarse Grids::Memory used=501.289 ########## V&V Testing Summary ##########= Spectral Convergence Tests= Spectral Convergence Tests::Result=PASSED Spectral Convergence Tests::Unpreconditioned= Spectral Convergence Tests::Unpreconditioned::Maximum iteration count=11 Spectral Convergence Tests::Unpreconditioned::Expected iteration count=12 Spectral Convergence Tests::Preconditioned= Spectral Convergence Tests::Preconditioned::Maximum iteration count=2 Spectral Convergence Tests::Preconditioned::Expected iteration count=2 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon= Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Result=PASSED Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for SpMV=2.52196e-15 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for MG=2.9591e-14 ########## Iterations Summary ##########= Iteration Count Information= Iteration Count Information::Result=PASSED Iteration Count Information::Reference CG iterations per set=50 Iteration Count Information::Optimized CG iterations per set=50 Iteration Count Information::Total number of reference iterations=3900 Iteration Count Information::Total number of optimized iterations=3900 ########## Reproducibility Summary ##########= Reproducibility Information= Reproducibility Information::Result=PASSED Reproducibility Information::Scaled residual mean=0.00505633 Reproducibility Information::Scaled residual variance=0 ########## Performance Summary (times in sec) ##########= Benchmark Time Summary= Benchmark Time Summary::Optimization phase=0.712522 Benchmark Time Summary::DDOT=9.14428 Benchmark Time Summary::WAXPBY=32.9321 Benchmark Time Summary::SpMV=308.397 Benchmark Time Summary::MG=1323.36 Benchmark Time Summary::ALL_reduce=134.289 Benchmark Time Summary::Total=1808.14 Floating Point Operations Summary= Floating Point Operations Summary::Raw DDOT=1.00066e+16 Floating Point Operations Summary::Raw WAXPBY=1.00066e+16 Floating Point Operations Summary::Raw SpMV=9.1219e+16 Floating Point Operations Summary::Raw MG=5.10353e+17 Floating Point Operations Summary::Total=6.21586e+17 Floating Point Operations Summary::Total with convergence overhead=6.21586e+17 GB/s Summary= GB/s Summary::Raw Read B/W=2.11734e+06 GB/s Summary::Raw Write B/W=489286 GB/s Summary::Raw Total B/W=2.60662e+06 GB/s Summary::Total with convergence and optimization phase overhead=2.5844e+06 GFLOP/s Summary= GFLOP/s Summary::Raw DDOT=1.0943e+06 GFLOP/s Summary::Raw WAXPBY=303854 GFLOP/s Summary::Raw SpMV=295785 GFLOP/s Summary::Raw MG=385649 GFLOP/s Summary::Raw Total=343771 GFLOP/s Summary::Total with convergence overhead=343771 GFLOP/s Summary::Total with convergence and optimization phase overhead=340840 User Optimization Overheads= User Optimization Overheads::Problem setup time (sec)=1.28108 User Optimization Overheads::Optimization phase time (sec)=0.712522 User Optimization Overheads::Optimization phase time vs reference SpMV+MG time=1.35039 DDOT Timing Variations= DDOT Timing Variations::Min DDOT MPI_Allreduce time=73.0944 DDOT Timing Variations::Max DDOT MPI_Allreduce time=135.413 DDOT Timing Variations::Avg DDOT MPI_Allreduce time=87.8303 Final Summary = Final Summary ::HPCG result is VALID with a GFLOP/s rating of=340840 Final Summary :: HPCG 2.4 Rating (for historical value) is=342718 Final Summary ::Reference version of ComputeDotProduct used=Performance results are most likely suboptimal Final Summary ::Please upload results from the YAML file contents to=http://hpcg-benchmark.org