Strong scaling Test -- Kraken & Stampede

  • This run, use a short final time and consider the profiling time and IO time.
    1. stampede
Runtime http://www.pas.rochester.edu/~bliu/Stampede/strongScalingOnStampedeNew.png
Non-ghost zone portion http://www.pas.rochester.edu/~bliu/Stampede/strongScalingOnStampedeNoGhostUpdate.png
  1. kraken
Runtime http://www.pas.rochester.edu/~bliu/Kraken/strongScalingOnKraken.png
Non-ghost zone portion http://www.pas.rochester.edu/~bliu/Kraken/strongScalingOnKrakenOnlyNonGhost.png
  • Data
    1. stampede Non optimization
Cores Wall Time Non-ghost zone portion
128 549.82 57%
256 360.35 48%
512 235.72 41%

stampede O3

Cores Wall Time Non-ghost zone portion
128 54.0 57%
256 33.5 48%
512 20.4 41%
1024 13.31 35%
2048 10.63 29%
4096 8.66 23%
  1. Kraken
Cores Wall Time Non-ghost zone portion
120 118.18 56%
240 69.00 50%
480 40.99 42%
1008 28.24 35%
2016 16.88 29%
4996 11.25 22%
  • Configuration of Kraken & Stampede
Kraken Stampede
Computing Nodes 9408 6400
Core per Node 12 16
Processor 2.6 GHz AMD Opteron 2.7GHz Xeon E5-2680 (Coprocessors Xeon Phi SE10P 1.1 GHz)
Memory per Node 16 GB 32 GB
  • Standard output 1.128 cores
    Total Runtime =      550.3116700649261475 seconds.
     Info allocations    =   ------    280.6 mb
     message allocations =   ------     36.2 mb
     sweep allocations   =   ------     49.8 mb
     filling fractions   =   0.012  0.644  0.900  0.000
     Current efficiency  =  82%  16%  98% 
     Cell updates/second =        973      1721  57%
     Wall Time Remaining =   ------   
     AMR Speed-Up Factor =       0.1039E+04
    
  1. 256 cores
    Total Runtime =      360.3508758544921875 seconds.
     Info allocations    =   ------    200.6 mb
     message allocations =   ------     32.4 mb
     sweep allocations   =   ------     59.7 mb
     filling fractions   =   0.012  0.665  0.898  0.000
     Current efficiency  =  77%  21%  98% 
     Cell updates/second =        735      1525  48%
     Wall Time Remaining =   ------   
     AMR Speed-Up Factor =       0.7988E+03
    
  1. 512 cores
    Total Runtime =      549.8217809200286865 seconds.
     Info allocations    =   ------    280.6 mb
     message allocations =   ------     36.2 mb
     sweep allocations   =   ------     49.8 mb
     filling fractions   =   0.012  0.644  0.900  0.000
     Current efficiency  =  82%  16%  98% 
     Cell updates/second =        974      1722  57%
     Wall Time Remaining =   ------   
     AMR Speed-Up Factor =       0.1040E+04
    
  • Standard output on Kraken
    1. 120 cores
      Total Runtime =      118.1762299537658691 seconds.
       Info allocations    =   ------    257.6 mb
       message allocations =   ------     41.2 mb
       sweep allocations   =   ------     63.1 mb
       filling fractions   =   0.012  0.668  0.895  0.000
       Current efficiency  =  75% 
       Cell updates/second =       4837      8572  56%
       Wall Time Remaining =   ------   
       AMR Speed-Up Factor =       0.8210E+03
      
  1. 240 cores
    Total Runtime =       68.9995868206024170 seconds.
     Info allocations    =   ------    164.1 mb
     message allocations =   ------     32.8 mb
     sweep allocations   =   ------     58.4 mb
     filling fractions   =   0.012  0.654  0.897  0.000
     Current efficiency  =  70% 
     Cell updates/second =       4099      8226  50%
     Wall Time Remaining =   ------   
     AMR Speed-Up Factor =       0.7070E+03
    
  2. 480
    Total Runtime =       40.9853310585021973 seconds.
     Info allocations    =   ------    122.0 mb
     message allocations =   ------     24.2 mb
     sweep allocations   =   ------     30.1 mb
     filling fractions   =   0.011  0.706  0.901  0.000
     Current efficiency  =  68% 
     Cell updates/second =       3414      8082  42%
     Wall Time Remaining =   ------   
     AMR Speed-Up Factor =       0.5956E+03
    

Comments

No comments.