RADIOSS Run on Cluster
Hi,
I am trying to run RADIOSS on my group's local cluster. I know for a fact my simulation runs, however it is getting blocked from running. My system administrator thinks that this is because the cluster queuing is running the starter file, but then when the starter initiates the engine file, it is blocked by the queuing system. Is there a way to tell RADIOSS to split the starter and engine jobs to run separately? I realise this is quite the arbitrary request, so if you have any questions let me know. I'm not really sure where to start with this, and am looking for someone with experience setting up RADIOSS on a cluster.
Thanks,
Ayman
Answers
-
Check these forum posts on similar topics
Also, check Radioss online help documentation for more details regarding Radioss parallelization
0 -
Hi Ayman,
It is hard to tell the issue without more details. Is this Linux or Windows? queuing software? submission command used?
The easiest method way to submit a Radioss job is to use the radioss script provided in the installation. This script will run the starter and then the engine and should look like one job to your queuing system.
Linux:
<install_dir>/altair/scripts/radioss 'filename' -option argument
To run from a Windows DOS prompt, type the following:<install_dir>\hwsolvers\scripts\radioss.bat 'filename' -option argument
An example on Linux which uses some additional option would be:
/altair/hw/2018/altair/scripts/radioss -v 2019 phone_0000.rad -mpi i -np 24 -nt 1 -hostfile /var/spool/PBS/aux/92305.admin -mpiargs -genv KMP_AFFINITY=scatter -genv I_MPI_PIN_DOMAIN=auto -genv I_MPI_ADJUST_BCAST=1 -genv I_MPI_ADJUST_REDUCE=2 -genv I_MPI_MPIRUN_CLEANUP=1 -genv KMP_STACKSIZE=400m -genv I_MPI_FABRICS=shm:dapl -noh3d
Thanks,
Andy
0 -
EDIT: The problems below the line have been solved. It was some stuff wrong with my .sh, which I would be happy to elaborate on if anyone wants to know, but it is really specific to my set-up.
_________________________________________________________________________________________________________________________________________________
Thank you both for your replies. I am running it on a local cluster group with 24 cores, which runs on Linux, but I am using MobaXterm through MS Windows to do everything. I have attached this .zip file with everything I'm using to run it. For some reason, my cluster is refusing to accept the job, and has the error:
error: executing task of job 9832 failed: execution daemon on host 'c30m8.local' didn't accept task
error: executing task of job 9832 failed: execution daemon on host 'c27m8.local' didn't accept task
error: executing task of job 9832 failed: execution daemon on host 'c6m8.local' didn't accept task
My system administrator ran it from his end and says everything is fine, and it was. I saw it running and it matched exactly with the results I am using on my PC, however when I run it through my own account, I get those errors.
Any advice?
0