SIMULIA
My Support | About SIMULIACareersContact Us | Sitemap | Dassault Systèmes
   
 

Abaqus Version 6.6 Performance Data

The Abaqus benchmark problems are intended to provide an estimate of the performance that can be expected when running representative Abaqus jobs on different computer platforms. For Version 6.6, Abaqus has updated the benchmark problems so that they all now exhibit performance characteristics of real-world applications.

The benchmark problems listed here are available upon request. If you are a customer, see Answer 2342 for instructions on obtaining the input files associated with these benchmark problems. If you are a hardware vendor and would like to submit performance data please contact .

NOTE: The Abaqus benchmark problems may change between releases. Therefore the timing data presented on these pages should not be directly compared with benchmark data obtained using other versions of Abaqus.

Contents

Abaqus/Standard Benchmark Problems
S1: Plate with gravity load
S2: Flywheel with centrifugal load
S3: Impeller frequencies
S4: Cylinder head bolt-up
S5: Stent expansion
S6: Tire footprint
Abaqus/Explicit Benchmark Problems
E1: Car crash
E2: Cell phone drop
E3: Sheet forming
E4: Projectile penetration
E5: Blast loaded plate
E6: Concentric spheres
Abaqus Performance Data
Abaqus/Standard Performance Data
Windows/x86-32
Windows/x86-64
Linux/x86-32
HP-UX/Itanium
IBM AIX/Power
Linux/Itanium
Linux/x86-64
Abaqus/Explicit Performance Data
Windows/x86-32
Linux/x86-32
HP-UX/Itanium
IBM AIX/Power
Linux/Itanium
Linux/x86-64


Abaqus/Standard Benchmark Problems

The problems described below provide an estimate of the performance that can be expected when running Abaqus/Standard on different computers. The jobs are representative of typical Abaqus/Standard applications including linear statics, nonlinear statics, and natural frequency extraction.

S1: Plate with gravity load

This benchmark is a linear static analysis of a plate with gravity loading. The plate is meshed with second-order shell elements of type S8R5 and uses a linear elastic material model. Edges of the plate are fixed. There is no contact.

S1
Input file name: s1.inp
Increments: 1
Iterations: 1
Degrees of freedom: 1,085,406
Floating point operations: 1.89E+011
Minimum memory requirement: 587 MB
Memory to minimize I/O: 2 GB
Disk space requirement: 2 GB

S2: Flywheel with centrifugal load

This benchmark is a mildly nonlinear static analysis of a flywheel with centrifugal loading. The flywheel is meshed using first-order hexahedral elements of type C3D8R and uses an isotropic hardening Mises plasticity material model. There is no contact. The nonlinearity in this problem arises from localized yielding in the vicinity of the bolt holes.

Two versions of this benchmark are provided. Both versions are identical except that one uses the direct sparse solver and the other uses the iterative solver.


S2a: Direct solver version
Input file name: s2a.inp
Increments: 6
Iterations: 12
Degree of freedom: 474,744
Floating point operations: 1.86E+012
Minimum memory requirement: 733 MB
Memory to minimize I/O: 849 MB
Disk space requirement: 4.55 GB

S2b: Iterative solver version
Input file name: s2b.inp
Increments: 6
Iterations: 11
Degrees of freedom: 474,744
Floating point operations: 8.34E+010
Minimum memory requirement: 2.8 GB
Memory to minimize I/O: NA
Disk space requirement: 387 MB

S3: Impeller frequencies

This benchmark extracts the natural frequencies and mode shapes of a turbine impeller. The impeller is meshed with second-order tetrahedral elements of type C3D10 and uses a linear elastic material model. Frequencies in the range from 100 Hz. to 20,000 Hz. are requested.

Three versions of this benchmark are provided: a 360,000 DOF version that uses the Lanczos eigensolver, a 1,100,000 DOF version that uses the Lanczos eigensolver, and a 1,100,000 DOF version that uses the AMS eigensolver.


S3a: 360,000 DOF Lanczos eigensolver version
Input file name: s3a.inp
Degrees of freedom: 362,178
Floating point operations: 3.42E+11
Minimum memory requirement: 384 MB
Memory to minimize I/O: 953 MB
Disk space requirement: 4.0 GB

S3b: 1,100,000 DOF Lanczos eigensolver version
Input file name: s3b.inp
Degrees of freedom: 1,112,703
Floating point operations: 3.03E+12
Minimum memory requirement: 1.33 GB
Memory to minimize I/O: 3.04 GB
Disk space requirement: 23.36 GB

S3c: 1,100,000 DOF AMS eigensolver version
Input file name: s3c.inp
Degrees of freedom: 1,112,703
Floating point operations: 3.03E+12
Minimum memory requirement: 1.33 GB
Memory to minimize I/O: 3.04 GB
Disk space requirement: 19.3 GB

S4: Cylinder head bolt-up

This benchmark is a mildly nonlinear static analysis that simulates bolting a cylinder head onto an engine block. The cylinder head and engine block are meshed with tetrahedral elements of types C3D4 or C3D10M, the bolts are meshed using hexahedral elements of type C3D8I, and the gasket is meshed with special-purpose gasket elements of type GK3D8. Linear elastic material behavior is used for the block, head, and bolts while a nonlinear pressure-overclosure relationship with plasticity is used to model the gasket. Contact is defined between the bolts and head, the gasket and head, and the gasket and block. The nonlinearity in this problem arises both from changes in the contact conditions and yielding of the gasket material as the bolts are tightened.

Three versions of this benchmark are provided: a 700,000 DOF version that is suitable for use with the direct sparse solver on 32-bit systems, a 5,000,000 DOF version that is suitable for use with the direct sparse solver on 64-bit systems, and a 5,000,000 DOF version that is suitable for use with the iterative solver on 64-bit systems.


S4a: 700,000 DOF direct solver version
Input file name: s4a.inp
Increments: 1
Iterations: 5
Degrees of freedom: 720,059
Floating point operations: 5.77E+11
Minimum memory requirement: 895 MB
Memory to minimize I/O: 3 GB
Disk space requirement: 3 GB

S4b: 5,000,000 DOF direct solver version
Input file name: s4b.inp
Increments: 1
Iterations: 5
Degrees of freedom: 5,236,958
Floating point operations: 1.14E+13
Minimum memory requirement: 4 GB
Memory to minimize I/O: 20 GB
Disk space requirement: 23 GB

S4c: 5,000,000 DOF iterative solver version
Input file name: s4c.inp
Increments: 1
Iterations: 3
Degrees of freedom: 5,248,154
Floating point operations: 3.74E+11
Minimum memory requirement: 16 GB
Memory to minimize I/O: NA
Disk space requirement: 3.3 GB

S5: Stent expansion

This benchmark is a strongly nonlinear static analysis that simulates the expansion of a medical stent device. The stent is meshed with hexahedral elements of type C3D8 and uses a linear elastic material model. The expansion tool is modeled using surface elements of type SFM3DR. Contact is defined between the stent and expansion tool. Radial displacements are applied to the expansion tool which in turn cause the stent to expand. The nonlinearity in this problem arises from large displacements and sliding contact.

Note: SIMULIA would like to acknowledge Nitinol Devices and Components for providing the original finite element model of the stent. The stent model used in this benchmark is not representative of current stent designs.


S5
Input file name: s5.inp
Increments: 21
Iterations: 91
Degrees of freedom: 181,692
Floating point operations: 1.80E+009
Minimum memory requirement: NA
Memory to minimize I/O: NA
Disk space requirement: NA

S6: Tire footprint

This benchmark is a strongly nonlinear static analysis that determines the footprint of an automobile tire. The tire is meshed with hexahedral elements of type C3D8, C3D6H, and C3D8H. Linear elastic and hyperelastic material models are used. Belts inside the tire are modeled using rebar layers and embedded elements. The rim and road surface are modeled as rigid bodies. Contact is defined between the tire and wheel and the tire and road surface. The analysis sequence consists of three steps. During the first step the tire is mounted to the wheel, during the second step the tire is inflated, and then during the third step a vertical load is applied to the wheel. The nonlinearity in the problem arises from large displacements, sliding contact, and hyperelastic material behavior.

S6
Input file name: s6.inp
Increments: 41
Iterations: 177
Degrees of freedom: 729,264
Floating point operations: NA
Minimum memory requirement: 397 MB
Memory to minimize I/O: 940 MB
Disk space requirement: NA


Abaqus/Explicit Benchmark Problems

The problems described below provide an estimate of the performance that can be expected when running Abaqus/Explicit on different computers. The jobs are representative of typical Abaqus/Explicit applications including high-speed dynamic impact events and quasi-static events with complicated contact conditions. The number of increments listed in the tables below are approximate and can vary somewhat depending on the hardware platform and the number of parallel domains.

E1: Car crash

This benchmark consists of passenger car impacting a rigid wall. The car is meshed primarily with shell elements of type S3RS and S4RS with isotropic hardening Mises plasticity material behavior. The various compenents of the car are connected using multi-point constraints and connector elements. Many of the suspension and drivetrain components are modeled as rigid bodies. The car, road surface, and wall are placed into a single general contact domain and the car is given an initial velocity of 25 mph.

E1
Input file name: e1.inp
Increments: 62,934
Number of elements: 274,632
Inital stable time increment: 9.535E-07
Final kinetic energy: 2.100E+06
Memory requirement: 1200 MB

E2: Cell phone drop

This benchmark consists of a simplified model of a cell phone impacting a fixed rigid floor. The cell phone components are meshed using a variety of element types including C3D8R, C3D10M, and S4R. The material behavior is modeled using linear elasticity, isotropic hardening Mises plasticity, and hyperelasticity. The components are assembled using surface-based mesh ties and placed into a general contact domain that also includes the floor. The initial velocity and orientation of the cell phone is defined such that a severe oblique impact occurs.

E2
Input file name: e2.inp
Increments: 87,369
Number of elements: 45,785
Inital stable time increment: 3.431E-08
Final kinetic energy: 6.043E+02
Memory requirement: 300 MB

E3: Sheet forming

This benchmark consists of forming a sheet metal part by the deep drawing process. The deformable sheet metal blank is meshed with shell elements of type S4R and uses an isotropic hardening Mises plasticity material model. The tools are meshed using surface elements of type SFM3D4R which are declared rigid. General contact is defined between the blank and tools. The analysis sequence consists of two steps. During the first step the blank is clamped between the binder and die and then during the second step the punch is displaced to form the part. Since the process is essentially quasi-static the computations are performed over a sufficiently long time period to render inertial effects negligible. The performance of this analysis is a direct measure of the performance of the three-dimensional general contact algorithm.

E3
Input file name: e3.inp
Increments: 31,177
Number of elements: 34,540 (deformable only)
Inital stable time increment: 7.151E-07
Final kinetic energy: 1.391E+03
Memory requirement: 550 MB

E4: Projectile penetration

This benchmark consists of a projectile penetrating a steel plate at an oblique angle. Both the projectile and plate are meshed using hexahedral elements of type C3D8R and use a rate-dependent isotropic hardening Mises plasticity material model with failure. The projectile and plate are placed into a general contact domain with surface erosion. The edges of the plate are held fixed and the initial velocity of the projectile is specified so that the projectile passes completely through the plate.

E4
Input file name: e4.inp
Increments: 12,433
Number of elements: 237,100
Inital stable time increment: 4.957E-09
Final kinetic energy: 1.469E+04
Memory requirement: 1400 MB

E5: Blast loaded plate

This benchmark consists of a stiffened steel plate subjected to a high intensity blast load. The plate is meshed using shell elements of type S4R and uses an isotropic hardening Mises plasticity material model. There is no contact.

E5
Input file name: e5.inp
Increments: 81,716
Number of elements: 50,000
Inital stable time increment: 6.122E-07
Final kinetic energy: 1.050E+01
Memory requirement: 150 MB

E6: Concentric spheres

This benchmark consists of a large number of concentric spheres with clearance between each sphere. The spheres are meshed using hexahedral elements of type C3D8R and use an isotropic hardening Mises plasticity material model. All of the spheres are placed into a single general contact domain and the outer sphere is violently shaken which results in complex contact interactions between the contained spheres.

E6
Input file name: e6.inp
Increments: 23,291
Number of elements: 244,124
Inital stable time increment: 2.116E-07
Final kinetic energy: 2.034E+06
Memory requirement: 1000 MB

Abaqus Performance Data

All times are given in seconds and include the time required for the main analysis executables (standard.exe and explicit.exe), the analysis input file processor (pre.exe) and the Abaqus/Explicit packager (package.exe).

Sequential Execution

The times listed for sequential execution are elapsed time (wall-clock time) when the problem is running stand-alone on the computer. For Abaqus/Standard, the elapsed time may vary for the same computer depending on the amount of memory that is assigned to the Abaqus/Standard job and the type of disks that are used. For Abaqus/Explicit, the memory configuration on the machine should not significantly affect the elapsed time provided that sufficient memory available. All Abaqus/Explicit timing data has been obtained using the single precision executable.

Unless otherwise noted, all of the parallel runs have been made using the default parallel settings in Abaqus.

The performance data reported here is intended to be used as a guideline. The times may change due to modifications within Abaqus and should not be used to compare platforms unless the same version of Abaqus has been used on both platforms. The times may also depend on the actual configuration of the computer. If a detailed comparison between computers is important, then the benchmark problems should be rerun using the same version of Abaqus and the actual configuration of the computer that is of interest.

Simultaneous Execution

The times listed for simultaneous execution are intended to represent situations where the machine may be heavily loaded with jobs as might occur in a multi-user environment. These times are obtained by running multiple copies of the same job simultaneously.

For Abaqus/Standard, the set of jobs used for simultaneous execution comprise s1, s2a, and s3a. For Abaqus/Explicit, the set of jobs used for simultaneous execution comprise e3, e4, and e5. The Total Time per Set is the sum of the times required to run multiple versions of every job in a set simultaneously. For example, the Total Time per Set running 3 jobs simultaneously would be obtained by adding the times required to run 3 s1 jobs at the same time, followed by 3 s2a jobs at the same time, and so on until all the jobs in the set have been run. The data for "1 Simultaneous" jobs is populated from the timings generated by the sequential runs. Each job is run using only a single processor.

The column headed "Average Time per Set" is the Total Time per Set divided by the number of simultaneous jobs. These times provide a basis for estimating the relative performance of computer systems heavily loaded with Abaqus jobs.


Abaqus/Standard Performance Data

Windows/x86-32

NOTE:  Parallel element operations are not supported on the Windows/x86-32 platform

Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
standard_memory:  2000 MB standard_memory_policy:  MAXIMUM
Computer system:  HP XW6200 Operating system:  Windows XP Professional SP2
Processor:  Intel Xeon 3600 MHz. Cores/processor:  1
Nodes:  1 Processors/node:  2
MPI library:  NA Interconnect:  NA
Memory/node:  4 GB I/O system:  73 GB
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 211 6511   1511     1694     4098 18703
2 202 4406   6853     1479     4231  

Windows/x86-64

NOTE:  Parallel element operations are not used here as this machine does not have Windows Compute Cluster Server which is necessary to support parallel element operations

Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-3
standard_memory:  6500 MB standard_memory_policy:  MODERATE
Computer system:  HP XW9300 Operating system:  Windows Server 2003 Standard x64 Edition
Processor:  AMD Opteron Dual Core 270, 2700 MHz. Cores/processor:  2
Nodes:  1 Processors/node:  2
MPI library:  NA Interconnect:  NA
Memory/node:  8 GB I/O system:  256 GB
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 200 8554 4144       1761     4626 16808
2 130 4532 2642       1444     4565 13468

Linux/x86-32

Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
standard_memory:  2000 MB standard_memory_policy:  MAXIMUM
Computer system:  HP XW6000 Operating system:  SuSE Linux 9.1
Processor:  Intel Xeon, 3200 MHz. Cores/processor:  1
Nodes:  1 Processors/node:  2
MPI library:  HP-MPI Interconnect:  NA
Memory/node:  2 GB I/O system:  73 GB
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 157 6271   1617 15819   1676     3236 15457
2 136 4174 3420 4225 17161   1187     1971 11007

HP-UX/Itanium

Benchmark Details
Submitted by:  HP Abaqus version:  6.6-EF1
standard_memory:  24576 MB standard_memory_policy:  MODERATE
Computer system:  HP Integrity RX8640 Operating system:  HP-UX 11.23
Processor:  Itanium2 DC 1.6 GHz (Montecito) Cores/socket:  2
Nodes:  1 Sockets/node:  16
MPI library:  HP-MPI 2.2 Interconnect:  HP zx2 chipset
Memory/node:  128 GB I/O system:  24 SCI 73GM 15K RPM disks in 2 MSA30's
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 139 3576 2836 780 3888 2854 1223 12587 7286 3814 13518
2 101 1901 1489       830 7117 4655 2140 8106
4 78 1038 793       587 4242 2804 1354 4942
8 67 608 529       471 2835 2018 973 3443
16 66 413 389       443 2203 1718 808 3458
32   355 362       498 2158 1727 821  
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 4495 4495
2 4591 2296
4 4553 1138
8 4796 600
16 5863 366

IBM AIX/Power

Benchmark Details
Submitted by:  IBM Abaqus version:  6.6-2
standard_memory:  TBD standard_memory_policy:  MODERATE
Computer system:  IBM P570 Operating system:  AIX 5.3
Processor:  Power5+ 2.2 GHz Cores/processor:   
Nodes:  1 Cores/node:  16
MPI library:  POE Interconnect:  NA
Memory/node:  128 GB I/O system:  TBD
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 112 3428 2429 518 2986   956 11212 5730 2695 9543
2 86 1853 1283 613 3224   689 6026 3410 1591 5786
4 70 1065 720 619 3857   508 3432 2372 1070 3630
8 30     706 3487   450 2578 1950 896 3263
16 67 516 374 1223     420 1616      

Linux/Itanium

Benchmark Details
Submitted by:  HP Abaqus version:  6.6-1
standard_memory:  10240 mb standard_memory_policy:  MODERATE
Computer system:  HP Integrity RX3600 Operating system:  RHEL4 U4
Processor:  Itanium2 DC 1.6 GHz (Montecito) Cores/processor:  2
Nodes:  1 Sockets/node:  2
MPI library:  HP-MPI 2.2.5 Interconnect:  HP zx2 chipset
Memory/node:  32 GB I/O system:  20 SCSI 73GB 15K RPM disks
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 139 3569 3623 717 3639 3640 1187 12726 8475 3678 13401
2 101 1883 1967       810 7174 5223 2066 7855
4 78 1091 1241       587 4409 3585 1285 4836
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 4425 4425
2 4485 2243
4 4664 1166
 
Benchmark Details
Submitted by:  HP Abaqus version:  6.6-EF1
standard_memory:  24576 MB standard_memory_policy:  MODERATE
Computer system:  HP Integrity RX8640 Operating system:  Red Hat Enterprise Linux 4 Update 4
Processor:  Itanium2 DC 1.6 GHz (Montecito) Cores/socket:  2
Nodes:  1 Sockets/node:  16
MPI library:  HP-MPI 2.2.5 Interconnect:  HP zx2 chipset
Memory/node:  128 GB I/O system:  24 SCI 73GM 15K RPM disks in 2 MSA30's
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 145 3632 3808 762 3851 4116 1230 13054 9690 3697 13639
2 115 1969 2037       887 7518 5390 2113 8267
4 88 1086 1063       619 4558 3416 1292 5142
8 77 684 627       516 3125 2489 930 3724
16 77 530 430       493 2632 2118 761 3226
32   479 392       560 2407 2094 698  
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 4539 4539
2 4585 2293
4 4614 1154
8 4725 591
16 5084 318
 
Benchmark Details
Submitted by:  SGI Abaqus version:  6.6-1
standard_memory:  standard_memory_policy:  MODERATE
Computer system:  SGI Altix 4700 Operating system:  Propack 4 SP 3
Processor:  Itanium Montecito, 1.6 GHz Cores/socket:  2
Nodes:  32 blade nodes Sockets/node:  64
MPI library:  ProPack MPT Interconnect:  NA
Memory/node:  8 GB (256 GB total addressable) I/O system:  xfs
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 155 3615 3768 787 3607   1215 12900 9392 3713 13673
2 116 1909 2029 906 3705   840 7388 5774 2234 7813
4 93 1110 1202 713 3611   601 4568 4230 1292 4767
8 94 758 683       517 3290 2837 1008 3835
16 86 591 413       486 2963 1964 885 3570
 
Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
standard_memory:  32000 mb standard_memory_policy:  MODERATE
Computer system:  SGI Altix Operating system:  SUSE Linux Enterprise Server 9
Processor:  Itanium 2, 1.6 GHz Cores/processor:  1
Nodes:  1 Processors/node:  32
MPI library:  ProPack MPT Interconnect:  NA
Memory/node:  128 GB I/O system:  xfs
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 148 3686 3725 804 3996 3318 1268 13190 9979 3889 14574
2 107 1966 3235 966 4090 3825 873 7399 5868 2229 8492
4 85 1262 1591 772 3994 4385 658 5432 5092 1404 6534
8 82 1091 613   3301 2965 543 4228 2436 1154 4987
16 78 797 522       559 4368 1940 974 4720

Linux/x86-64

Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
standard_memory:  4000 MB standard_memory_policy:  MAXIMUM
Computer system:  HP XW6200 Operating system:  Red Hat Enterprise Linux AS 3.0
Processor:  Intel EM64T, 3600 MHz. Cores/processor:  1
Nodes:  1 Processors/node:  2
MPI library:  HP-MPI Interconnect:  NA
Memory/node:  4 GB I/O system:  Two discs, 72 GB, 72 GB
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 125 4790 3168     3587 1160     2821 13103
2 118 3010 2333 2742   3119 988     2037 8781
 
Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
standard_memory:  7500 MB standard_memory_policy:  MODERATE
Computer system:  IBM x336 Cluster Operating system:  Red Hat Enterprise Linux AS 3.0
Processor:  Intel EM64T, 2800 MHz Cores/processor:  1
Nodes:  8 Processors/node:  2
MPI library:  HP-MPI Interconnect:  Topspin Infiniband
Memory/node:  8 GB I/O system:  ext3 with journaling
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 126 4134 2802 771 10240   1120 19687   2800 13132
2 91 2282 2320 1016 23787   746 21586   2022 8331
4 165 2160 1229       748 10305     10288
8 89 1649 3110       700   2471   7099
 
Benchmark Details
Submitted by:  HP Abaqus version:  6.6-1 for 1,2, and 4 cores/6.6-EF1 for 8 and 16 cores
standard_memory:  7000 mb standard_memory_policy:  MODERATE
Computer system:  HP XC4000 (DL145 G2) Operating system:  HP XC V3.1 BL5 (RHEL 4)
Processor:  AMD Opteron, 2.6 GHz Cores/processor:  2
Nodes:  16 Sockets/node:  2
MPI library:  HP-MPI 2.2.0.2 Interconnect:  Infiniband SDR PCI-X/PCI Express (Voltaire)
Memory/node:  8 GB I/O system:  HP SFS 16 Object Storage Targets, Voltaire Infiniband
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 137 6002 3497 857 10617 5027 1247 23025   2330 12297
2 98 3142 1766 1034 11832 3411 793 14256   1509 7483
4 75 1759 1147 1075 11064 3116 533 10002   1067 4728
8 69 1235 636       524 7044     6574
16 59 722 362       388 3543     3988
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 6996 6996
2 7906 3953
4 8682 2171
 
Benchmark Details
Submitted by:  HP Abaqus version:  6.6-EF1
standard_memory:  7000 mb standard_memory_policy:  MODERATE
Computer system:  HP XC3000 (DL140 G3) Operating system:  RHEL 4
Processor:  Intel Xeon DC 3.0 GHz Cores/processor:  2
Nodes:  16 Sockets/node:  2
MPI library:  HP-MPI 2.2.5 Interconnect:  Infiniband SDR PCI-X/PCI Express (Voltaire)
Memory/node:  8 GB I/O system:  2 SAS disks striped
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 86 3249 2576 552 5176 2534 775 13121   1741 8204
2 61 1756 1468       511 8524   1237 5014
4 52 1132 1370       372 6610   1054 4251
8 49 900 764       376 4767     5136
16 46 477 444       271 2390     3283
 
Benchmark Details
Submitted by:  HP Abaqus version:  6.6-1
standard_memory:  12288 mb standard_memory_policy:  MODERATE
Computer system:  HP XC3000 (DL140 G3) Operating system:  Red Hat Enterprise Linux 4
Processor:  Intel Xeon DC, 3.0 GHz Cores/processor:  2
Nodes:  1 Sockets/node:  2
MPI library:  HP-MPI 2.2.5 Interconnect:   
Memory/node:  16 GB I/O system:  5 15K RPM SCSI disks in an MSA30
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 81 3210 2440 541 3091 2333 756 11695 5897 1715 7953
2 63 1742 1394 647 4112 1655 500 6954 4780 1119 4901
4 49 1093 1292 576 15763 1574 362 4805 4189 815 3335
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 3832 3832
2 3894 1947
4 5010 1253
 
Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
standard_memory:  7000 mb standard_memory_policy:  MODERATE
Computer system:  HP XC Opteron Cluster Operating system:  Red Hat Enterprise Linux 4
Processor:  AMD Opteron, 2200 MHz Cores/processor:  2
Nodes:  16 Sockets/node:  2
MPI library:  HP-MPI Interconnect:  Voltaire Infiniband
Memory/node:  8 GB I/O system:  ext2 filesystem, 72 GB SATA drive
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 161 7150 3833 1000 11415 5583 1413 26996   2594 13542
2 108 3619 1858 1200 31674 3827 900 17879   1620 7969
4 91 2072 1136 2208 85293 3194 763 14226   1076 4888
8 83 1884 672       605 9561     7800
 
Benchmark Details
Submitted by:  Intel Abaqus version:  6.6-1
standard_memory:  12800 mb standard_memory_policy:  MAXIMUM
Computer system:  Supermicro SuperServer 7045B-8R+ Operating system:  Red Hat Enterprise Linux 4
Processor:  Intel Xeon Processor 5160, 3.0 GHz Cores/socket:  2
Nodes:  1 Sockets/node:  2
MPI library:  HP-MPI Interconnect:  NA
Memory/node:  16 GB, 1333 MHz fully buffered dimms I/O system:  ext2 filesystem, four 73GB 15k RPM SCSI, RAID0
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 73 3198 2523 523 3037 2293 750 11734   1705 8370
2 51 1723 1828 670 5003 1616 480 6860 5435 1069 5055
4 37 1005 1354 563 11227 1231 342 4468 5061 768 3430
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 3794 3794
2 3870 1935
4 5163 1291
 
Benchmark Details
Submitted by:  SGI Abaqus version:  6.6-1
standard_memory:  5000 mb standard_memory_policy:  MAXIMUM
Computer system:  SGI Altix XE Operating system:  Propack 5, SP 2
Processor:  3.0 GHz/1.3 GHz Front Side Bus Cores/socket:  2
Nodes:  1 Sockets/node:  2
MPI library:  HP-MPI Interconnect:  NA
Memory/node:  16 GB I/O system:  4 SAS disks
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 79 3165 2457 514 3001   750 10891   1726 8222
2 54 1705 1480 638 5305   479 6122   1133 5095
4 42 1023 1321 554     335 3864   823 3443
 
Benchmark Details
Submitted by:  NetEffect Abaqus version:  6.6-EF1
standard_memory:  7000 mb standard_memory_policy:  MODERATE
Computer system:  HP DL145 G2 Operating system:  RHEL 4
Processor:  AMD Opteron, 2.6 GHz Cores/socket:  2
Nodes:  8 Sockets/node:  2
MPI library:  HP-MPI 2.2.0.2 Interconnect:  NetEffect NE010e, 10 Gb Ethernet
Memory/node:  8 GB I/O system:  72 GB SATA, ext2 filesystem
Sequential Execution
CPUs S1 S2a S2b S3a S3b S3c S4a S4b S4c S5 S6
1 143 7261 3132       1538 25502   24222 12341
2 95 2847 1847       775 13326   1580 8866
4 70 1493 1105       507 6790 14570 1077 5110
8 57 804 619       391 3098 2253 692 4413
16 54 473 367       343 2077 2194 647 3523
24 53 368 335       334 1738 2013 631 3379
32 53 429 252       343 1926 1991 637 4046

Abaqus/Explicit Performance Data

Windows/x86-32

Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
Computer system:  HP XW6200 Operating system:  Windows XP Professional SP2
Processor:  Intel Xeon 3600 MHz. Cores/processor:  1
Nodes:  1 Processors/node:  2
MPI library:  NA Interconnect:  NA
Memory/node:  4 GB I/O system:  73 GB
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 39806 24908 16211 8292 11176 22234
2 26038 13974 10520   6329 14224

Linux/x86-32

Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
Computer system:  HP XW6000 Operating system:  SuSE Linux 9.1
Processor:  Intel Xeon, 3200 MHz. Cores/processor:  1
Nodes:  1 Processors/node:  2
MPI library:  HP-MPI Interconnect:  NA
Memory/node:  2 GB I/O system:  73 GB
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 47482 30480 21586 9303 17293 24553
2 33901 17062 15160 6481 9989 17163

HP-UX/Itanium

Benchmark Details
Submitted by:  HP Abaqus version:  6.6-1
Computer system:  HP Integrity RX3600 Operating system:  HP-UX 11.23
Processor:  Itanium2 DC 1.6 GHz (Montecito) Cores/processor:  2
Nodes:  1 Sockets/node:  2
MPI library:  HP-MPI 2.2 Interconnect:  HP zx2 chipset
Memory/node:  16 GB I/O system:  20 DS2405 disks
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1   16142 15088 6780 6521 18002
2 19437 8620 8059 3892 3512 10561
4 12573 4967 4428 2422 2042 6546
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 28389 28389
2 29568 14784
4 32571 8143
 
Benchmark Details
Submitted by:  HP Abaqus version:  6.6-EF1
Computer system:  HP Integrity RX8640 Operating system:  HP-UX 11.23
Processor:  Itanium2 DC 1.6 GHz (Montecito) Cores/processor:  2
Nodes:  1 Sockets/node:  2
MPI library:  HP-MPI 2.2 Interconnect:  HP zx2 chipset
Memory/node:  128 GB I/O system:  20 SCSI 73GB 15K RPM disks in 2 MSA30's, VxFS 5.0
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 36172 17113 16623 7075 6953 19604
2 20404 8781 8311 3786 3539 11006
4 11952 4635 4077 1991 1793 6163
8 6439 2506 2271 1178 839 3507
16 3857 1375 1267 763 414 2047
32 2723 912 861 609 278 1438
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 30651 30651
2 30737 15639
4 30640 7660
8 31463 3933
16 33263 2079
32 39465 1233

IBM AIX/Power

Benchmark Details
Submitted by:  IBM Abaqus version:  6.6-2
Computer system:  IBM P570 Operating system:  AIX 5.3
Processor:  Power5+ 2.2 GHz Cores/processor:   
Nodes:  1 Cores/node:  16
MPI library:  POE Interconnect:  NA
Memory/node:  128 GB I/O system:  TBD
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 32157 21950 13057 8276 8475 17147
2 17385 11061 6370 4433 4143 9366
4 9757 5361 3033 2317 2084 5216
8 5231 2757 1556      
16 3204 1614 980 876 603 1780

Linux/Itanium

Benchmark Details
Submitted by:  HP Abaqus version:  6.6-1
Computer system:  HP Integrity RX8640 Operating system:  Red Hat Enterprise Linux 4 Update 4
Processor:  Itanium2 DC 1.6 GHz (Montecito) Cores/processor:  2
Nodes:  1 Sockets/node:  16
MPI library:  HP-MPI 2.2.5 Interconnect:  HP zx2 chipset
Memory/node:  128 GB I/O system:  24 SCSI 73GB 15K RPM disks in 2 MSA30's
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 48155 19454 23712 9081 8384 21964
2 26151 10191 11498 4953 4328 12361
4 15135 5396 5269 2522 2133 6972
8 8065 2865 2748 1518 945 3936
16 4663 1532 1406 906 438 2265
32 3034 916 893 681 260 1502
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 41177 41177
2 41179 20590
4 45675 11419
8 42503 5313
16 45277 2830
32 61886 1934
 
Benchmark Details
Submitted by:  SGI Abaqus version:  6.6-1
Computer system:  SGI Altix 4700 Operating system:  Propack 4 SP 3
Processor:  Itanium Montecito, 1.6 GHz Cores/socket:  2
Nodes:  32 blade nodes Sockets/node:  64
MPI library:  ProPack MPT Interconnect:  NA
Memory/node:  8 GB (256 GB total addressable) I/O system:  xfs
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 44760 18663 21301 8772 8123 20643
2 25268 9993 10687 4880 4322 12103
4 15493 5580 5175 2748 2248 7250
8 8296 2731 2623 1614 826 3966
16 4667 1306 1267 914 421 2236
32 2668 850 763 607 263 1375
 
Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
Computer system:  SGI Altix Operating system:  SUSE Linux Enterprise Server 9
Processor:  Itanium 2, 1.6 GHz Cores/processor:  1
Nodes:  1 Processors/node:  32
MPI library:  ProPack MPT Interconnect:  NA
Memory/node:  128 GB I/O system:  xfs
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
2 41607 11801 10390 7562 8469 11724
4 18480 9458 5933 3935 4128 10063
8 12068 5116 3415 2693 1569 6167

Linux/x86-64

Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
Computer system:  HP XW6200 Operating system:  Red Hat Enterprise Linux AS 3.0
Processor:  Intel EM64T, 3600 MHz. Cores/processor:  1
Nodes:  1 Processors/node:  2
MPI library:  TBD Interconnect:  TBD
Memory/node:  4 GB I/O system:  Two discs, 72 GB, 72 GB
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 36958 22483 15192 7620 9341 19465
2 24110 12538 9869 4673 5512 12694
 
Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
Computer system:  IBM x336 Cluster Operating system:  Red Hat Enterprise Linux AS 3.0
Processor:  Intel EM64T, 2800 MHz. Cores/processor:  1
Nodes:  8 Processors/node:  2
MPI library:  HP-MPI Interconnect:  Topspin Infiniband
Memory/node:  8 GB I/O system:  ext3 with journaling
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 37527 22915 15246 7831 9621 19654
2 24686 12730 9809 4745 5582 12742
4 13708 6380 5137 2627 2893 7207
8 7502 3486 2725 1593 1527 4239
 
Benchmark Details
Submitted by:  HP Abaqus version:  6.6-1
Computer system:  HP XC4000 (DL145 G2) Operating system:  HP XC V3.1 BL5 (RHEL 4)
Processor:  AMD Opteron, 2.6 GHz Cores/processor:  2
Nodes:  16 Sockets/node:  2
MPI library:  HP-MPI 2.2.0.2 Interconnect:  Infiniband SDR PCI-X/PCI Express (Voltaire)
Memory/node:  8 GB I/O system:  HP SFS 16 Object Storage Targets, Voltaire Infiniband
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 33248 21495 13805 6938 8752 17853
2 18026 10686 7086 3882 4516 9869
4 11569 5853 4321 2243 2495 6105
8 6153 3087 2505 1367 1305 3544
16 3548 1758 1391 768 729 2033
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 29495 29495
2 29685 14843
4 33326 8332
 
Benchmark Details
Submitted by:  HP Abaqus version:  6.6-1
Computer system:  HP XC3000 (DL140 G3) Operating system:  Red Hat Enterprise Linux 4
Processor:  Intel Xeon DC, 3.0 GHz Cores/processor:  2
Nodes:  16 Sockets/node:  2
MPI library:  HP-MPI 2.2.5 Interconnect:  Infiniband SDR PCI-X/PCI Express (Voltaire)
Memory/node:  8 GB I/O system:  2 SAS disks striped
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 25371 14248 10090 5191 6211 13282
2 14298 7366 5274 3304 3334 7530
4 10869 4588 3916 1964 2089 5490
8 5819 2412 2031 1157 1054 3076
16 3156 1350 1056 642 543 1749
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 21492 21492
2 21673 10837
4 29470 7368
 
Benchmark Details
Submitted by:  SIMULIA Abaqus version:  6.6-1
Computer system:  HP XC Opteron Cluster Operating system:  Red Hat Enterprise Linux 4
Processor:  AMD Opteron, 2200 MHz Cores/processor:  2
Nodes:  16 Sockets/node:  2
MPI library:  HP-MPI Interconnect:  Voltaire Infiniband
Memory/node:  8 GB I/O system:  ext2 filesystem, 72 GB SATA drive
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 42663 26441 17878 8029 10276 21652
2 20353 12320 10217 4966 5316 12782
4 15616 6627 6094 2633 2854 8478
8 7476 3752 3802 1783 1460 4551
16 5076 2321 1946 1051 985 2855
 
Benchmark Details
Submitted by:  Intel Abaqus version:  6.6-1
Computer system:  Supermicro SuperServer 7045B-8R+ Operating system:  Red Hat Enterprise Linux 4
Processor:  Intel Xeon Processor 5160, 3.0 GHz Cores/sockets:  2
Nodes:  1 Sockets/node:  2
MPI library:  HP-MPI Interconnect:  NA
Memory/node:  16 GB, 1333 MHz fully buffered dimms I/O system:  ext2 filesystem, four 73GB 15k RPM SCSI, RAID0
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 25076 14616 10563 5225 6272 13242
2 14878 8044 6316 3310 3483 8048
4 10696 4646 3881 1997 2126 5460
Simultaneous Execution
Number of Simultaneous Jobs Total Time per Set Average Time per Set
1 22060 22060
2 24418 12209
4 30585 7646
 
Benchmark Details
Submitted by:  NetEffect Abaqus version:  6.6-EF1
Computer system:  HP DL145 G2 Operating system:  RHEL 4
Processor:  AMD Opteron, 2.6 GHz Cores/sockets:  2
Nodes:  8 Sockets/node:  2
MPI library:  HP-MPI 2.2.0.2 Interconnect:  NetEffect NE010e, 10 Gb Ethernet
Memory/node:  16 GB I/O system:  72 GB SATA, ext2 filesystem
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 33310 21866 13677 7082 8871 17879
2 18014 10925 7500 3875 4493 10014
4 11473 5851 3861 2204 2490 5936
8 5595 3000 2308 1274 1058 3363
16 3571 1741 1404 1008 719 2071
24 2861 1320 1006 626 523 1584
32 2130 1085 870 532 459 1286
 
Benchmark Details
Submitted by:  SGI Abaqus version:  6.6-1
Computer system:  SGI Altix XE Operating system:  ProPack 5, SP 2
Processor:  3.0 GHz/1.3 GHz Front Side Bus Cores/sockets:  2
Nodes:  4 Sockets/node:  2
MPI library:  HP-MPI Interconnect:  Voltaire DDR IB
Memory/node:  16 GB I/O system:  4 SAS disks
Sequential Execution
CPUs E1 E2 E3 E4 E5 E6
1 24874 14458 10611 5178 6208 13140
2 14195 7426 5665 2901 3361 7566
4 10324 4537 3878 1892 2026 5238
8 5419 2409 1977 1102 1067 3012
16 3074 1444 1061 630 551 1839

 

 

 
© Dassault Systèmes, 2004, 2012 - All Rights Reserved
Terms of Use | Trademarks | Privacy Statement