Fluids

Fluids

Module file error

    • vrp
      Subscriber
      I am trying to run Fluent on cluster (cent OS), but i am getting modulefile error. I have 128 HPC Licenses and i am trying to use 64 cores.nBelow is the error from log file:nnModuleCmd_Load.c(213):ERROR:105:Unable to locate a modulefile for 'Ansys/ansys-2020.R2'nModuleCmd_Load.c(213):ERROR:105:Unable to locate a modulefile for 'Ansys/ansys-2020.R2'n/home/vrp1/.bashrc:line 4:module:command not foundnnand here is the .trn,n/home/ansys_inc/v202/fluent/fluent20.2.0/bin/fluent -r20.2.0 3ddp -pmpi-auto-selected -host -t64 -mpi=intel -cnf=/var/spool/torque/aux/6592.umkc-hpc -path/home/ansys_inc/v202/fluent -ssh -cx node001:46141:38757nStarting /home/ansys_inc/v202/fluent/fluent20.2.0/lnamd64/3ddp_host/fluent.20.2.0 host -cx node001:46141:38757 (list (rpsetvar (QUOTE parallel/function) fluent 3ddp -flux -node -r20.2.0 -t64 -pmpi-auto-selected -mpi=intel -cnf=/var/spool/torque/aux/6592.umkc-hpc -ssh) (rpsetvar (QUOTE parallel/rhost) ) (rpsetvar (QUOTE parallel/ruser) ) (rpsetvar (QUOTE parallel/nprocs_string) 64) (rpsetvar (QUOTE parallel/auto-spawn?) #t) (rpsetvar (QUOTE parallel/trace-level) 0) (rpsetvar (QUOTE parallel/remote-shell) 1) (rpsetvar (QUOTE parallel/path) /home/ansys_inc/v202/fluent) (rpsetvar (QUOTE parallel/hostsfile) /var/spool/torque/aux/6592.umkc-hpc) )nn       Welcome to ANSYS Fluent 2020 R2nn       Copyright 1987-2020 ANSYS, Inc. All Rights Reserved.n       Unauthorized use, distribution or duplication is prohibited.n       This product is subject to U.S. laws governing export and re-export.n       For full Legal Notice, see documentation.nnBuild Time:May 29 2020 10:12:52 EDT Build Id:10176  n nn   --------------------------------------------------------------n   This is an academic version of ANSYS FLUENT. Usage of this productn   license is limited to the terms and conditions specified in your ANSYSn   license form, additional terms section.n   --------------------------------------------------------------nHost spawning Node 0 on machine node001 (unix).n/home/ansys_inc/v202/fluent/fluent20.2.0/bin/fluent -r20.2.0 3ddp -flux -node -t64 -pmpi-auto-selected -mpi=intel -cnf=/var/spool/torque/aux/6592.umkc-hpc -ssh -mport 172.16.0.1:172.16.0.1:39254:0nStarting /home/ansys_inc/v202/fluent/fluent20.2.0/multiport/mpi/lnamd64/intel/bin/mpirun -f /tmp/fluent-appfile.vrp1.15903 --rsh=ssh -genv I_MPI_FALLBACK_DEVICE enable -genv FLUENT_ARCH lnamd64 -genv I_MPI_DEBUG 0 -genv I_MPI_PIN disable -genv I_MPI_ADJUST_REDUCE 2 -genv I_MPI_ADJUST_ALLREDUCE 2 -genv I_MPI_ADJUST_BCAST 1 -genv I_MPI_ADJUST_BARRIER 2 -genv I_MPI_ADJUST_ALLGATHER 2 -genv I_MPI_ADJUST_GATHER 2 -genv I_MPI_ADJUST_ALLTOALL 1 -genv I_MPI_ADJUST_SCATTER 2 -genv I_MPI_PLATFORM auto -genv PYTHONHOME /home/ansys_inc/v202/fluent/fluent20.2.0/../../commonfiles/CPython/3_7/linx64/Release/python -genv FLUENT_PROD_DIR /home/ansys_inc/v202/fluent/fluent20.2.0 -genv KMP_AFFINITY disabled -genv TMI_CONFIG /home/ansys_inc/v202/fluent/fluent20.2.0/multiport/mpi/lnamd64/intel/etc/tmi.conf -machinefile /tmp/fluent-appfile.vrp1.15903 -np 64 /home/ansys_inc/v202/fluent/fluent20.2.0/lnamd64/3ddp_node/fluent_mpi.20.2.0 node -mpiw intel -pic mpi-auto-selected -mport 172.16.0.1:172.16.0.1:39254:0n/home/vrp1/.bashrc:line 4:module:command not foundnnode006:SCM:56a5:ccf5bc40:61 us(61 us): open_hca:device mlx4_0 not foundnnode006:SCM:56ab:c335fc40:58 us(58 us): open_hca:device mlx4_0 not foundnnode006:SCM:5696:e36ffc40:62 us(62 us):node006:SCM:56a2:74819c40:63 us(63 us): open_hca:device mlx4_0 not foundn open_hca:device mlx4_0 not foundnnode006:SCM:56ab:c335fc40:61 us(61 us):node006:SCM:56a5:ccf5bc40:60 us(60 us): open_hca:device mlx4_0 not foundn open_hca:device mlx4_0 not foundnnode006:SCM:56a2:74819c40:60 us(60 us): open_hca:device mlx4_0 not foundnnode006:SCM:5696:e36ffc40:58 us(58 us): open_hca:device mlx4_0 not foundnnode006:SCM:569e:994b7c40:61 us(61 us): open_hca:device mlx4_0 not foundnnode006:CMA:56a5:ccf5bc40:38 us(38 us): open_hca:getaddr_netdev ERROR:No such device. Is ib0 configured?.ode006:CMA:56ab:c335fc40:37 us(37 us): open_hca:getaddr_netdev ERROR:No such device. Is ib0 configured?.ode006:SCM:569e:994b7c40:61 us(61 us): open_hca:device mlx4_0 not foundnnode006:CMA:56a2:74819c40:40 us(40 us):node006:CMA:5696:e36ffc40:37 us(37 us): open_hca:getaddr_netdev ERROR:No such device. Is ib0 configured?n open_hca:getaddr_netdev ERROR:No such device. Is ib0 configured?.ode006:CMA:56a5:ccf5bc40:38 us(38 us): open_hca:getaddr_netdev ERROR:No such device. Is ib1 configured?.ode006:CMA:56ab:c335fc40:36 us(36 us): open_hca:getaddr_netdev ERROR:No such device. Is ib1 configured?.ode006:CMA:569e:994b7c40:39 us(39 us): open_hca:getaddr_netdev ERROR:No such device. Is ib0 configured?.ode006:CMA:5696:e36ffc40:38 us(38 us):node006:CMA:56a2:74819c40:37 us(37 us): open_hca:getaddr_netdev ERROR:No such device. Is ib1 configured?n open_hca:getaddr_netdev ERROR:No such device. Is ib1 configured?.ode006:SCM:56a5:ccf5bc40:60 us(60 us):node006:SCM:56ab:c335fc40:57 us(57 us): open_hca:device mthca0 not foundn open_hca:device mthca0 not found
    • Karthik R
      Administrator
      Hello,nCould you please try using ethernet? Seems to me an Infiniband issue. Use -pethernet in your command line.nThank you.nKarthikn
    • vrp
      Subscriber
      Hi Karthik,nThank you so much for your response. I changed the flag from -pinfiniband to -pethernet and it solved my issue.n
    • Karthik R
      Administrator
      Perfect! Let me mark this as answered. nGood luck!nThanks.nKarthikn
    • vrp
      Subscriber
      Hi Karthik,nI noticed that after changing flag from -pinfiniband to -ethernet solved my issue, but seems like my simulation speed got decreased. do you have any other solution?.below is the error log:nModuleCmd_Load.c(213):ERROR:105: Unable to locate a modulefile for 'Ansys/ansys-2020.R2'n/home/vrp1/.bashrc: line 4: module: command not foundnFatal error in MPI_Init: Other MPI error, error stack:nMPIR_Init_thread(805).................: fail failednMPID_Init(1859).......................: channel initialization failednMPIDI_CH3_Init(147)...................: fail failedndapl_rc_setup_all_connections_20(1394): generic failure with errno = 872598799ngetConnInfoKVS(956)...................: PMI_KVS_Get failed
Viewing 4 reply threads
  • You must be logged in to reply to this topic.