TAGGED: computer-memory, dat-files, hpc-cluster, transient-structural
-
-
February 6, 2023 at 8:04 pm
helen.durand
SubscriberHello, I am running transient structural simulations on a HPC cluster (using input .dat files) and have been running into 'out of memory' issues. I am aware that the mesh can be coarsened to alliviate this issue (especially away from areas of interest), but I was wondering if there are other techniques to address this problem.
I am also implementing APDL command blocks in the simulation. Could reducing the calulations in these codes (by rewriting the code in a more optimal way, for example) help?
Could adjusting 'Analysis Setings' > 'Output Controls' or 'Analysis Setings' > 'Analysis Data Management' help?
Other ideas or feedback would be apprciated. Thank you!
-
February 7, 2023 at 2:43 pm
Mike Rife
Ansys EmployeeHi Helen
Can you post the specific warning/error message(s) that you have been receiving? Also is there a message about pivoting having been activated?
Mike
-
February 7, 2023 at 2:57 pm
helen.durand
SubscriberHere is the error. I do not think I have gotten any errors about pivoting.
/XXX/el7/pre-compiled/ansys/2020r1/v201/ansys/bin/ansysdis201: line 77: 2367 Killed /XXXel7/pre-compiled/ansys/2020r1/v201/ansys/bin/linx64/ansys.e -dis -mpi INTELMPI -j “file” -s read -b -i “./v19_1_struct.dat” -o “./outputfile.out”
srun: error: YYY8: task 0: Out Of Memory
slurmstepd: error: Detected 484 oom-kill event(s) in StepId=11323733.0. Some of your processes may have been killed by the cgroup out-of-memory handler.
[mpiexec@YYY8] HYDT_bscu_wait_for_completion (../../tools/bootstrap/utils/bscu_wait.c:151): one of the processes terminated badly; aborting
[mpiexec@YYY8] HYDT_bsci_wait_for_completion (../../tools/bootstrap/src/bsci_wait.c:36): launcher returned error waiting for completion
[mpiexec@YYY8] HYD_pmci_wait_for_completion (../../pm/pmiserv/pmiserv_pmci.c:540): launcher returned error waiting for completion
[mpiexec@YYY8] main (../../ui/mpich/mpiexec.c:1149): process manager error waiting for completion
slurmstepd: error: Detected 484 oom-kill event(s) in StepId=11323733.batch. Some of your processes may have been killed by the cgroup out-of-memory handler.Also, these are the warnings and errors from the Solver Output. I do not suspect these are the cause of any issues.
/COM,ANSYS RELEASE 2020 R1 BUILD 20.1 UP20191203 13:33:04
*** WARNING *** CP = 180.521 TIME= 13:36:34
No *DO trips needed, enter *ENDDO .*** WARNING *** CP = 700635.438 TIME= 05:21:47
Element shape checking is currently inactive. Issue SHPP,ON or
SHPP,WARN to reactivate, if desired. -
February 7, 2023 at 3:01 pm
Mike Rife
Ansys EmployeeHi Helen
Well the Cgroup is outside of Ansys' control. Are there any messages in the mapdl output file regarding an increase in database size? At what step was the job when it was killed? I.E. had it reached the point of solving? Mike
-
- You must be logged in to reply to this topic.

Boost Ansys Fluent Simulations with AWS
Computational Fluid Dynamics (CFD) helps engineers design products in which the flow of fluid components is a significant challenge. These different use cases often require large complex models to solve on a traditional workstation. Click here to join this event to learn how to leverage Ansys Fluids on the cloud, thanks to Ansys Gateway powered by AWS.

Earth Rescue – An Ansys Online Series
The climate crisis is here. But so is the human ingenuity to fight it. Earth Rescue reveals what visionary companies are doing today to engineer radical new ideas in the fight against climate change. Click here to watch the first episode.

Ansys Blog
Subscribe to the Ansys Blog to get great new content about the power of simulation delivered right to your email on a weekly basis. With content from Ansys experts, partners and customers you will learn about product development advances, thought leadership and trends and tips to better use Ansys tools. Sign up here.
- Solver Pivot Warning in Beam Element Model
- Saving & sharing of Working project files in .wbpz format
- Understanding Force Convergence Solution Output
- An Unknown error occurred during solution. Check the Solver Output…..
- What is the difference between bonded contact region and fixed joint
- User manual
- The solver engine was unable to converge on a solution for the nonlinear problem as constrained.
- whether have the difference between using contact and target bodies
- material damping and modal analysis
- Colors and Mesh Display
-
5282
-
3299
-
2469
-
1308
-
1006
© 2023 Copyright ANSYS, Inc. All rights reserved.