June 18, 2021 at 8:19 pmhelen.durandSubscriber
We are running ANSYS Structural simulations on an HPC cluster with a user interface. We are sometimes able to complete the simulation with multiple cores, and sometimes we are not. Specifically:
- For a 32-core job, we can run it successfully if we ask for 16 cores in the distributed option.
- If we ask for 7 cores or 32 cores in the distributed option for a 32-core job, the simulation becomes stuck at 10% completion of “Building the Mathematical Model” and never continues to progress.
- If we ask for 27 cores in the distributed option for a 32-core job, the simulation completes several load steps and then suddenly shuts down Structural with no warning or listed errors.
Here are our questions:
June 21, 2021 at 9:45 pmMike RifeAnsys EmployeeWhat OS is being used by the cluster, and which job scheduler? Often these types of issues are OS dependent. Also can you please define the term 'job'. Usually when we say a solve is run on say 7 of 32 physical cpu cores that the compute node has, that is a 7 core job.
- Why can we sometimes complete the simulation in a parallel fashion, and sometimes not? There are no security programs on the HPC cluster.
- What should be the maximum number of cores we can request with the distributed option for a job with 32 cores? Should we be able to request all 32, or do some need to be available for some “background” tasks of ANSYS?
- Why does the program sometimes crash and sometimes instead become stuck at 10% completion of “Building the Mathematical Model”?
- Is there any rule about the number of cores we can ask for in the distributed option compared to the total number of cores in the job (like does it need to be some divisor of the total number of cores)? If yes, why?
The 'rules of thumb' for the number of cpu cores to use can depend on the type of physics being solved, the hardware being used, any possible bottleneck of the solution, etc. For example usually we want to run the solution 'in-core' meaning the FEA matrices and vectors are kept wholly in RAM. But to do so may require running on more cpu cores than the FEM warrants in order to check out enough compute nodes to get the needed RAM.
Viewing 1 reply thread
Ansys Innovation Space
- You must be logged in to reply to this topic.
Simulation World 2022
Check out more than 70 different sessions now available on demand. Get inspired as you hear from visionary companies, leading researchers and educators from around the globe on a variety of topics from life-saving improvements in healthcare, to bold new realities of space travel. Take a leap of certainty and check out a session today here.
Earth Rescue – An Ansys Online Series
The climate crisis is here. But so is the human ingenuity to fight it. Earth Rescue reveals what visionary companies are doing today to engineer radical new ideas in the fight against climate change. Click here to watch the first episode.
Subscribe to the Ansys Blog to get great new content about the power of simulation delivered right to your email on a weekly basis. With content from Ansys experts, partners and customers you will learn about product development advances, thought leadership and trends and tips to better use Ansys tools. Sign up here.Trending discussions
- How to calculate the residual stress on a coating by Vickers indentation?
- An Unknown error occurred during solution. Check the Solver Output…..
- Saving & sharing of Working project files in .wbpz format
- Solver Pivot Warning in Beam Element Model
- Understanding Force Convergence Solution Output
- whether have the difference between using contact and target bodies
- Colors and Mesh Display
- The solver engine was unable to converge on a solution for the nonlinear problem as constrained.
- Massive amount of memory (RAM) required for solve
- What is the difference between bonded contact region and fixed joint
Top Rated Tags
© 2022 Copyright ANSYS, Inc. All rights reserved.Ansys does not support the usage of unauthorized Ansys software. Please visit www.ansys.com to obtain an official distribution.