TAGGED: hpc, restart-points, transient-structural
-
-
November 21, 2020 at 3:43 am
helen.durand
SubscriberHello,
I was running a transient structural simulation on a HPC system. After a couple of days of running properly and after about 20% of the simulation had completed, the simulation stopped. Neither the solver output report nor the .err file contain any error messages, and I am struggling to determine why the simulation stopped. Looking at the outline in Workbench (see image below), there is a symbol next to 'Solution' that is related to solution restarts (https://ansyshelp.ansys.com/account/secured?returnurl=/Views/Secured/corp/v201/en/wb_sim/ds_solution_restarts.html%23ds_restarts_managing).
What would cause the simulation to stop running and automatically create a solution restart? Did something go wrong? Can I just continue the solution from this restart point?
December 2, 2020 at 3:58 pmMike Rife
Ansys EmployeeHi. How was the model launched to the HPC system? Is RSM configured to submit to the cluster? If so do you still have the RSM log? If so any clues there?nIf you submitted the job manually was it a direct submission? I.E. did you issue the MAPDL command to start the batch solve? Or did you submit the job via a job scheduler? If a job scheduler was used are there any logs to be had there? nReview the sole output file - some 'clue' may not be a error or warning message.nIf a job scheduler was used, either by direct submission or from Mechanical via RSM, does the job scheduler queue have a time limit? nIs the HPC system running Linux? If so ask the cluster admin about any Cgroup rules in place. Or any OS rules on hardware usage. A Cgroup rule on say RAM usage (i.e. don't let compute node use more than 95% of RAM) can lead the OS to kill a process that is using a lot of RAM. So the OS may have killed the job. That may or may not be evident in the solve output file, or in the job scheduler log, or in the RSM log.nYou can restart the solution from time 0.11 at least. But you may want to find out about any time and/or hardware usage rules first. So you don't run into this again in a few days.nMikeViewing 1 reply thread- The topic ‘Automatic creation of restart point on HPC system – did something go wrong?’ is closed to new replies.
Ansys Innovation SpaceBoost Ansys Fluent Simulations with AWS
Computational Fluid Dynamics (CFD) helps engineers design products in which the flow of fluid components is a significant challenge. These different use cases often require large complex models to solve on a traditional workstation. Click here to join this event to learn how to leverage Ansys Fluids on the cloud, thanks to Ansys Gateway powered by AWS.Â
Earth Rescue – An Ansys Online Series
The climate crisis is here. But so is the human ingenuity to fight it. Earth Rescue reveals what visionary companies are doing today to engineer radical new ideas in the fight against climate change. Click here to watch the first episode.
Ansys Blog
Subscribe to the Ansys Blog to get great new content about the power of simulation delivered right to your email on a weekly basis. With content from Ansys experts, partners and customers you will learn about product development advances, thought leadership and trends and tips to better use Ansys tools. Sign up here.
Trending discussions- How to do the frequency response of the nonlinear vibration of a flexible PCB?
- Importing Line and Solid Bodies from SpaceClaim to Mechanical
- how to open SendCommand in Ansys
- problems facing during solution
- Still facing the same issue
- Failed to move file from solver directory to scratch directory: file.rst
- Adaptive Sizing
- Stiffness factor
- Import DAT file
- Import pressure data (coordinates and value) to ansys workbench through excel
Top Contributors-
8808
-
4658
-
3153
-
1688
-
1478
Top Rated Tags© 2023 Copyright ANSYS, Inc. All rights reserved.
Ansys does not support the usage of unauthorized Ansys software. Please visit www.ansys.com to obtain an official distribution.
-