理化学研究所 で開発されている GENESIS (GENeralized-Ensemble SImulation System)は、タンパク質、核酸、脂質、糖鎖、またはそれらの複合体など 生体内高分子 を対象にした 分子動力学計算 を行うためのソフトウェアです。公式ホームページに掲載されているベンチマークテストを実行し、その標準出力に表示されている”total time”の値を結果及び、MDメインループ( dynamics )から割り出した ns/day により、性能を評価しています。
テストに使用しているインプットファイルの概要は以下の通りです。
DHFR
27,346 atoms
forcefield = AMBER
integrator = VRES
switchdist = 8 Å
cutoffdist = 8 Å
pairlistdist = 9.5 Å
nsteps = 6000
timestep = 3.5 fs
APOA1
92,224 atoms
forcefield = CHARMM
integrator = VRES
switchdist = 10.0 Å
cutoffdist = 12.0 Å
pairlistdist = 13.5 Å
nsteps = 6000
timestep = 3.5 fs
UUN
216,726 atoms
forcefield = CHARMM
integrator = VRES
switchdist = 10.0 Å
cutoffdist = 12.0 Å
pairlistdist = 13.5 Å
nsteps = 6000
timestep = 3.5 fs
実行結果
以下のデータの結果は、MDメインループ( dynamics )から、ns/day を割り出した値です。数字が大きいほど、性能が高いことを示しています。
specification | node | proc | thread | GPU | Ensemble: NVT | Ensemble: NPT | ||||
DHFR | APOA1 | UUN | DHFR | APOA1 | UUN | |||||
実施日:2024-01-18、GENESIS 2.1.2 単精度 GPU : (2) NVIDIA RTX 6000 Ada HPC-ProServer DPrT7960 CPU:(1) Intel Xeon w9-3495X 1.9-4.8G, 56C/112T Mem : 256GB (8) 32GB RDIMM, 4800MT/s OS : Ubuntu 22.04 CUDA : 12.3, Driver : 545.23.08 Compiler : Intel oneAPI 2023.2.0 |
1 | 8 | 32 | 1 | 131.9 | 50.1 | 31.4 | 127.1 | 48.8 | 28.9 |
1 | 16 | 32 | 1 | 75.0 | 34.5 | 30.1 | 73.8 | 33.7 | 25.6 | |
1 | 8 | 32 | 2 | 207.1 | 72.0 | 33.8 | 197.4 | 69.4 | 31.4 | |
1 | 16 | 32 | 2 | 139.9 | 55.7 | 34.7 | 135.4 | 53.9 | 32.0 | |
実施日:2023-05-09、GENESIS 2.1 単精度 GPU : (1) NVIDIA GeForce RTX 4090 HPC-ProServer DPeR7525 CPU:(2) AMD EPYC 7713 2.0GHz 64C/128T Mem : 1024GB (32) 32GB DDR4 3200MHz OS : RockyLinux 8.7 CUDA12.1 Driver: 530.30.02 Compiler: gcc 8.5 MPI : OpenMPI 4.1.5 Math LIB : OpenBlas 0.3.23 |
1 | 8 | 8 | 1 | 85.3 | 26.7 | 11.5 | 81.8 | 25.6 | 11.0 |
1 | 8 | 16 | 1 | 87.5 | 33.2 | 16.4 | 84.4 | 32.5 | 16.0 | |
1 | 8 | 32 | 1 | 95.5 | 38.1 | 22.3 | 93.1 | 37.6 | 21.9 | |
1 | 16 | 16 | 1 | 60.9 | 26.4 | 16.6 | 60.1 | 25.8 | 15.2 | |
1 | 16 | 32 | 1 | 63.1 | 28.7 | 21.7 | 61.4 | 27.9 | 20.5 | |
1 | 16 | 64 | 1 | 62.3 | 29.1 | 28.4 | 61.3 | 28.6 | 24.0 | |
1 | 32 | 32 | 1 | 33.3 | 19.8 | 19.3 | 33.0 | 19.4 | 14.9 | |
1 | 32 | 64 | 1 | 33.6 | 20.0 | 20.8 | 33.1 | 19.7 | 15.0 | |
1 | 32 | 128 | 1 | 32.1 | 18.6 | 18.7 | 31.9 | 18.3 | 14.3 | |
実施日:2023-03-13、GENESIS 2.1 単精度 GPU : (2) NVIDIA RTX A6000 HPC-ProServer DPeR750 CPU:(2) Intel Xeon Gold 6326 2.90GHz 16C/32T Mem : 256GB (16) 16GB DDR4 3200MHz OS : RockyLinux 8.7 CUDA12.1 Driver: 530.30.02 Compiler: ifort 2021.8.0 MPI : OpenMPI 4.1.4 Math LIB : MKL |
1 | 8 | 32 | 2 | 157.8 | 57.7 | 28.8 | 152.9 | 55.5 | 27.1 |
1 | 16 | 32 | 2 | 104.7 | 44.6 | 29.6 | 101.4 | 43.2 | 26.9 | |
2 | 16 | 64 | 2 | 180.7 | 79.0 | 47.7 | 176.8 | 77.0 | 44.1 | |
2 | 32 | 64 | 2 | 104.1 | 56.4 | 48.1 | 101.7 | 55.2 | 37.9 | |
実施日:2023-02-05、GENESIS 2.03 単精度 GPU : (2) NVIDIA RTX A4000 HPC-ProServer DPrT7865 CPU:(1) AMD Ryzen Threadripper PRO 5995WX 256 MB cache, 64C/128T, 2.7GHz‐4.5GHz Mem : 256GB (8) 32GB DDR4 3200MHz OS : Ubuntu 22.04 5.15.0-58-generic CUDA12.0 Driver: 525.85.12 Compiler: gfortran 11.3 MPI : OpenMPI 4.1.4 Math LIB : libblas, liblapack |
1 | 8 | 8 | 2 | 153.6 | 39.8 | 16.0 | 145.6 | 39.4 | 15.0 |
1 | 8 | 16 | 2 | 144.2 | 46.9 | 22.6 | 135.8 | 44.2 | 20.7 | |
1 | 8 | 32 | 2 | 150.5 | 49.0 | 22.8 | 148.0 | 48.3 | 20.3 | |
1 | 16 | 16 | 2 | 111.3 | 39.7 | 21.5 | 107.6 | 39.1 | 19.7 | |
1 | 16 | 32 | 2 | 106.2 | 41.4 | 25.4 | 103.6 | 39.7 | 22.6 | |
1 | 16 | 64 | 2 | 103.9 | 40.6 | 24.4 | 102.5 | 39.7 | 22.6 | |
1 | 32 | 32 | 2 | 58.8 | 29.4 | 25.3 | 58.0 | 29.0 | 19.0 | |
1 | 32 | 64 | 2 | 57.8 | 29.1 | 25.3 | 57.0 | 28.8 | 19.4 | |
実施日:2023-01-14、GENESIS 2.03 単精度 GPU : (2) NVIDIA RTX A6000 HPC-ProServer DPrR7920 CPU:(2) Intel Xeon Gold 6240R 24C/48T Mem:192GB((12)16GB) OS:CentOS7.9 CUDA12.0 Driver: 525.60.13 Compiler : Intel oneAPI Fortran 2023.0.0 MPI : Intel MPI 2023.0.0 Math LIB : Intel MKL 2023.0.0 |
1 | 8 | 8 | 2 | 95.0 | 23.9 | 9.4 | 88.8 | 23.0 | 8.9 |
1 | 8 | 16 | 2 | 127.6 | 35.5 | 16.1 | 120.3 | 34.1 | 15.1 | |
1 | 8 | 32 | 2 | 147.1 | 50.1 | 23.2 | 139.9 | 48.8 | 22.0 | |
1 | 16 | 16 | 2 | 93.3 | 32.7 | 15.5 | 90.5 | 31.5 | 14.7 | |
1 | 16 | 32 | 2 | 102.7 | 40.0 | 24.4 | 99.0 | 38.9 | 22.4 | |
1 | 32 | 32 | 2 | 56.0 | 28.9 | 23.9 | 54.8 | 28.3 | 18.0 | |
以下のデータの結果は、標準出力のtotal timeの値です。値が小さいほど性能が出ていることを示します。
specification | node | proc | thread | GPU | Ensemble: NVT | Ensemble: NPT | ||||
DHFR | APOA1 | UUN | DHFR | APOA1 | UUN | |||||
実施日:2023-02-05、GENESIS 2.03 単精度 GPU : (2) NVIDIA RTX A4000 HPC-ProServer DPrT7865 CPU:(1) AMD Ryzen Threadripper PRO 5995WX 256 MB cache, 64C/128T, 2.7GHz‐4.5GHz Mem : 256GB (8) 32GB DDR4 3200MHz OS : Ubuntu 22.04 5.15.0-58-generic CUDA12.0 Driver: 525.85.12 Compiler: gfortran 11.3 MPI : OpenMPI 4.1.4 Math LIB : libblas, liblapack |
1 | 8 | 8 | 2 | 12.863 | 47.907 | 120.298 | 13.522 | 48.397 | 127.544 |
1 | 8 | 16 | 2 | 13.615 | 40.738 | 85.556 | 14.333 | 43.16 | 93.209 | |
1 | 8 | 32 | 2 | 12.978 | 38.968 | 85.221 | 13.138 | 39.446 | 95.327 | |
1 | 16 | 16 | 2 | 17.483 | 47.794 | 89.211 | 18.195 | 48.492 | 97.108 | |
1 | 16 | 32 | 2 | 18.378 | 45.966 | 76.24 | 18.828 | 47.907 | 85.368 | |
1 | 16 | 64 | 2 | 18.762 | 46.699 | 79.415 | 18.962 | 47.87 | 85.192 | |
1 | 32 | 32 | 2 | 32.553 | 64.255 | 76.383 | 33.214 | 65.187 | 99.993 | |
1 | 32 | 64 | 2 | 33.135 | 65.102 | 76.751 | 33.666 | 65.61 | 98.678 | |
実施日:2023-01-14、GENESIS 2.03 単精度 GPU : (2) NVIDIA RTX A6000 HPC-ProServer DPrR7920 CPU:(2) Intel Xeon Gold 6240R 24C/48T Mem:192GB((12)16GB) OS:CentOS7.9 CUDA12.0 Driver: 525.60.13 Compiler : Intel oneAPI Fortran 2023.0.0 MPI : Intel MPI 2023.0.0 Math LIB : Intel MKL 2023.0.0 |
1 | 8 | 8 | 2 | 23.48 | 81.81 | 206.17 | 24.63 | 85.23 | 218.38 |
1 | 8 | 16 | 2 | 18.19 | 56.53 | 124.79 | 19.08 | 58.70 | 132.36 | |
1 | 8 | 32 | 2 | 16.24 | 41.16 | 88.85 | 16.78 | 42.24 | 93.18 | |
1 | 16 | 16 | 2 | 23.94 | 61.23 | 127.49 | 24.36 | 63.45 | 134.58 | |
1 | 16 | 32 | 2 | 22.04 | 50.70 | 84.11 | 22.48 | 51.86 | 90.89 | |
1 | 32 | 32 | 2 | 37.77 | 68.82 | 85.40 | 38.41 | 70.32 | 110.23 | |
実施日:2022-08-22、GENESIS 2.0 単精度 GPU : (2) NVIDIA RTX A4000 HPC-ProServer DPrR7920 CPU:(2) Intel Xeon Silver 4110 2.1G 8C/16T Mem:256GB((16)16GB) OS:RockyLinux8.6 CUDA11.7 Driver: 515.65.01 Compiler : Intel oneAPI Fortran 2021.6.0 MPI : Intel MPI 2021.6.0 Math LIB : Intel MKL 2022.1.0 |
1 | 8 | 8 | 1 | 34.40 | 120.66 | 272.60 | 35.86 | 121.23 | 292.34 |
1 | 8 | 8 | 2 | 31.72 | 113.72 | 270.00 | 33.01 | 114.03 | 291.47 | |
1 | 8 | 16 | 2 | 23.55 | 74.75 | 166.73 | 24.84 | 79.86 | 178.78 | |
1 | 16 | 16 | 2 | 27.99 | 78.87 | 173.31 | 29.00 | 81.73 | 183.10 | |
実施日:2022-02-28、GENESIS 2.0β3 mixed precision GPU : (4) NVIDIA A40 NVLink3 HPC-ProServer DPeR750XA CPU:(2) Intel Xeon Gold 6326 2.90G 16C/32T Mem:512GB((16)32GB) OS:Ubuntu 20.04 CUDA11.6 Driver: 510.47.03 NVLINK3 Compiler : GCC 9.3.0 MPI : OpenMPI 4.1.2 Math LIB : libblas3, liblapack3 |
1 | 8 | 8 | 1 | 24.41 | 73.98 | 174.56 | 25.47 | 77.33 | 188.96 |
1 | 8 | 8 | 2 | 17.64 | 65.67 | 173.16 | 18.57 | 69.42 | 188.37 | |
1 | 8 | 8 | 4 | 15.99 | 65.31 | 173.56 | 16.87 | 68.47 | 187.86 | |
1 | 8 | 16 | 1 | 21.83 | 57.35 | 111.26 | 22.71 | 59.22 | 121.93 | |
1 | 8 | 16 | 2 | 14.82 | 44.52 | 107.38 | 15.56 | 46.52 | 115.9 | |
1 | 8 | 16 | 4 | 12.26 | 40.96 | 106.66 | 12.91 | 43.17 | 117.42 | |
1 | 8 | 32 | 1 | 20.58 | 50.04 | 83.96 | 21.32 | 51.6 | 91.8 | |
1 | 8 | 32 | 2 | 13.32 | 34.96 | 75.64 | 13.9 | 36.16 | 82.31 | |
1 | 8 | 32 | 4 | 10.41 | 30.38 | 74.53 | 10.92 | 31.42 | 80.27 | |
1 | 16 | 16 | 1 | 37.15 | 82.58 | 118.69 | 38.09 | 84.88 | 132.05 | |
1 | 16 | 16 | 2 | 21.95 | 55.03 | 112.56 | 22.69 | 57.25 | 122.11 | |
1 | 16 | 16 | 4 | 15.9 | 45.91 | 112.91 | 16.43 | 47.66 | 122.08 | |
1 | 16 | 32 | 1 | 36.4 | 75.54 | 88.46 | 37.2 | 76.75 | 101.49 | |
1 | 16 | 32 | 2 | 20.84 | 46.69 | 75.68 | 21.37 | 47.94 | 83.58 | |
1 | 16 | 32 | 4 | 14.38 | 34.32 | 73.66 | 14.82 | 35.43 | 79.8 | |
1 | 32 | 32 | 1 | 74.07 | 123.6 | 122.25 | 75.12 | 126.03 | 164.14 | |
1 | 32 | 32 | 2 | 40.06 | 69.83 | 83.26 | 40.75 | 71.91 | 104.78 | |
1 | 32 | 32 | 4 | 24.4 | 46.75 | 74.04 | 24.92 | 48.17 | 85.45 | |
実施日:2022-02-28、GENESIS 2.0β3 単精度 GPU : (4) NVIDIA A40 NVLink3 HPC-ProServer DPeR750XA CPU:(2) Intel Xeon Gold 6326 2.90G 16C/32T Mem:512GB((16)32GB) OS:Ubuntu 20.04 CUDA11.6 Driver: 510.47.03 NVLINK3 Compiler : GCC 9.3.0 MPI : OpenMPI 4.1.2 Math LIB : libblas3, liblapack3 |
1 | 8 | 8 | 1 | 23.54 | 70.36 | 163.67 | 24.48 | 73.22 | 178.17 |
1 | 8 | 8 | 2 | 16.78 | 62.75 | 164.07 | 17.71 | 65.78 | 177.43 | |
1 | 8 | 8 | 4 | 15.11 | 61.65 | 163.47 | 15.93 | 64.86 | 177.4 | |
1 | 8 | 16 | 1 | 21.18 | 55.39 | 105.51 | 22.14 | 56.86 | 115.6 | |
1 | 8 | 16 | 2 | 14.17 | 42.29 | 101.13 | 14.83 | 43.7 | 109.63 | |
1 | 8 | 16 | 4 | 11.66 | 39.03 | 101.47 | 12.34 | 40.67 | 109.91 | |
1 | 8 | 32 | 1 | 20.17 | 48.28 | 79.97 | 20.9 | 49.74 | 86.59 | |
1 | 8 | 32 | 2 | 12.86 | 33.19 | 71.43 | 13.36 | 34.07 | 77.62 | |
1 | 8 | 32 | 4 | 9.88 | 28.44 | 70.39 | 10.37 | 29.51 | 75.29 | |
1 | 16 | 16 | 1 | 36.55 | 80.07 | 112.64 | 37.48 | 82.18 | 126.58 | |
1 | 16 | 16 | 2 | 21.4 | 52.41 | 106.09 | 22.05 | 54.55 | 114.4 | |
1 | 16 | 16 | 4 | 15.29 | 43.32 | 106.39 | 15.81 | 45.06 | 115.17 | |
1 | 16 | 32 | 1 | 35.83 | 73.7 | 84.09 | 36.69 | 75.16 | 97.31 | |
1 | 16 | 32 | 2 | 20.29 | 44.87 | 71.41 | 20.86 | 46.12 | 79.47 | |
1 | 16 | 32 | 4 | 13.78 | 32.47 | 69.22 | 14.35 | 33.45 | 75.14 | |
1 | 32 | 32 | 1 | 73.62 | 122.27 | 118.04 | 74.55 | 124.35 | 161.15 | |
1 | 32 | 32 | 2 | 39.6 | 68.21 | 79.11 | 40.25 | 69.69 | 100.1 | |
1 | 32 | 32 | 4 | 23.97 | 44.72 | 69.5 | 24.45 | 46.01 | 80.54 | |
実施日:2022-02-13、GENESIS 2.0β3 mixed precision GPU : (2) GeForce RTX 3090 HPC-ProServer DPeT640 CPU : (2) Intel Xeon 6240R 2.4GHz 24C/48T Mem : Total 192GB (12) 16GB DDR4 2933MHz OS : CensOS 7.9 Compiler : Intel oneAPI 2021.5.0 MPI : Intel MPI 2021.5.0 Math LIB : Intel MKL 2021.5.0 CUDA 11.5 |
1 | 8 | 8 | 2 | 27.09 | 90.84 | 211.84 | 28.39 | 96.73 | 230.92 |
1 | 8 | 16 | 2 | 23.26 | 64.28 | 131.05 | 24.05 | 66.18 | 140.84 | |
1 | 8 | 32 | 2 | 21.45 | 53.36 | 97.41 | 22.11 | 54.62 | 103.95 | |
実施日:2022-02-13、GENESIS 2.0β3 単精度 GPU : (2) GeForce RTX 3090 HPC-ProServer DPeT640 CPU : (2) Intel Xeon 6240R 2.4GHz 24C/48T Mem : Total 192GB (12) 16GB DDR4 2933MHz OS : CensOS 7.9 Compiler : Intel oneAPI 2021.5.0 MPI : Intel MPI 2021.5.0 Math LIB : Intel MKL 2021.5.0 CUDA 11.5 |
1 | 8 | 8 | 2 | 21.88 | 80.94 | 203.25 | 22.84 | 84.85 | 220.41 |
1 | 8 | 16 | 2 | 17.17 | 54.93 | 123.3 | 17.91 | 56.97 | 132.19 | |
1 | 8 | 32 | 2 | 14.99 | 39.43 | 87.17 | 15.75 | 40.73 | 92.27 | |
実施日:2022-02-11、GENESIS 2.0β3 GPU : なし, 単精度 HPC-ProServer DPeR6525 CPU : (2) AMD EPYC 7713 2.0GHz 64C/128T Mem : Total 512GB (16) 32GB DDR4 3200MHz OS : Rocky Linux 8.5 Compiler : GCC 8.5 MPI : OpenMPI 4.1.2 Math LIB : AOCL 3.1.0 |
1 | 8 | 8 | 0 | 78.22 | 589.79 | 1322.93 | 79.12 | 586.85 | 1363.95 |
1 | 8 | 16 | 0 | 46.84 | 314.03 | 709.41 | 48.25 | 311.17 | 725.95 | |
1 | 16 | 16 | 0 | 44.82 | 310.87 | 682.32 | 45.5 | 311.57 | 696.96 | |
1 | 16 | 32 | 0 | 28.25 | 165.53 | 376.71 | 29.31 | 163.7 | 385.4 | |
1 | 32 | 32 | 0 | 25.74 | 163.5 | 364.31 | 26.48 | 162.35 | 388.21 | |
1 | 32 | 64 | 0 | 17.92 | 94.82 | 206.33 | 18.09 | 95.42 | 217.57 | |
1 | 64 | 64 | 0 | 16.54 | 95.48 | 201.69 | 17.07 | 94.52 | 224.73 | |
1 | 64 | 128 | 0 | 14.19 | 66.52 | 140.68 | 15.08 | 66.04 | 156.61 | |
実施日:2022-02-11、GENESIS 2.0β3 GPU : なし, 倍精度 HPC-ProServer DPeR6525 CPU : (2) AMD EPYC 7713 2.0GHz 64C/128T Mem : Total 512GB (16) 32GB DDR4 3200MHz OS : Rocky Linux 8.5 Compiler : GCC 8.5 MPI : OpenMPI 4.1.2 , Math LIB : AOCL 3.1.0 |
1 | 8 | 8 | 0 | 83.53 | 628.41 | 1369.99 | 85.3 | 627.82 | 1416.06 |
1 | 8 | 16 | 0 | 52.44 | 341.25 | 745.83 | 53.45 | 340.79 | 767.05 | |
1 | 16 | 16 | 0 | 49.88 | 324.9 | 724.25 | 52.39 | 328.82 | 744.25 | |
1 | 16 | 32 | 0 | 33.05 | 179.15 | 395.32 | 33.27 | 181.05 | 428.03 | |
1 | 32 | 32 | 0 | 29.66 | 176.1 | 385.07 | 30.11 | 177.2 | 412.63 | |
1 | 32 | 64 | 0 | 21.47 | 109.26 | 223.3 | 22.18 | 108.28 | 239.7 | |
1 | 64 | 64 | 0 | 19.24 | 109.62 | 218.03 | 19.81 | 109.5 | 246.5 | |
1 | 64 | 128 | 0 | 16.66 | 80.22 | 156.31 | 17.52 | 82.68 | 175.43 | |