RADIOSS Run on Cluster

Crashphys
Crashphys Altair Community Member
edited October 2020 in Community Q&A

Hi,

 

I am trying to run RADIOSS on my group's local cluster. I know for a fact my simulation runs, however it is getting blocked from running. My system administrator thinks that this is because the cluster queuing is running the starter file, but then when the starter initiates the engine file, it is blocked by the queuing system. Is there a way to tell RADIOSS to split the starter and engine jobs to run separately? I realise this is quite the arbitrary request, so if you have any questions let me know. I'm not really sure where to start with this, and am looking for someone with experience setting up RADIOSS on a cluster.

 

Thanks,

Ayman

Tagged:

Answers

  • Pranav Hari
    Pranav Hari Altair Community Member
    edited October 2020

    Hi aymanalsukhon

     

    Check these forum posts on similar topics

     

    https://community.altair.com/community?id=community_question&sys_id=bb560cf61b2bd0908017dc61ec4bcbfd

     

    https://community.altair.com/community?id=community_question&sys_id=734644b61b2bd0908017dc61ec4bcb09

     

    Also, check Radioss online help documentation for more details regarding Radioss parallelization 

  • Andy_20955
    Andy_20955 New Altair Community Member
    edited May 2019

    Hi Ayman,

    It is hard to tell the issue without more details.  Is this Linux or Windows? queuing  software?  submission command used?

     

    The easiest method way to submit a Radioss job is to use the radioss script provided in the installation.  This script will run the starter and then the engine and should look like one job to your queuing system.

    Linux:

     <install_dir>/altair/scripts/radioss  'filename' -option argument
    To run from a Windows DOS prompt, type the following:
     <install_dir>\hwsolvers\scripts\radioss.bat 'filename' -option argument

    An example on Linux which uses some additional option would be:

     

    /altair/hw/2018/altair/scripts/radioss -v 2019 phone_0000.rad -mpi i -np 24 -nt 1 -hostfile /var/spool/PBS/aux/92305.admin -mpiargs -genv KMP_AFFINITY=scatter -genv I_MPI_PIN_DOMAIN=auto -genv I_MPI_ADJUST_BCAST=1 -genv I_MPI_ADJUST_REDUCE=2 -genv I_MPI_MPIRUN_CLEANUP=1 -genv KMP_STACKSIZE=400m -genv I_MPI_FABRICS=shm:dapl -noh3d
     

    Thanks,

    Andy

     

     

  • Crashphys
    Crashphys Altair Community Member
    edited May 2019

    EDIT: The problems below the line have been solved. It was some stuff wrong with my .sh, which I would be happy to elaborate on if anyone wants to know, but it is really specific to my set-up.

     

    _________________________________________________________________________________________________________________________________________________

    Thank you both for your replies. I am running it on a local cluster group with 24 cores, which runs on Linux, but I am using MobaXterm through MS Windows to do everything. I have attached this .zip file with everything I'm using to run it. For some reason, my cluster is refusing to accept the job, and has the error:

     

    error: executing task of job 9832 failed: execution daemon on host 'c30m8.local' didn't accept task
    error: executing task of job 9832 failed: execution daemon on host 'c27m8.local' didn't accept task
    error: executing task of job 9832 failed: execution daemon on host 'c6m8.local' didn't accept task
     

    My system administrator ran it from his end and says everything is fine, and it was. I saw it running and it matched exactly with the results I am using on my PC, however when I run it through my own account, I get those errors. 

     

    Any advice?

    Unable to find an attachment - read this blog