Skip to content
GitLab
Projects Groups Topics Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in
  • autosubmit autosubmit
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributor statistics
    • Graph
    • Compare revisions
  • Issues 338
    • Issues 338
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 21
    • Merge requests 21
  • Deployments
    • Deployments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Commits
  • Issue Boards
Collapse sidebar
  • Earth SciencesEarth Sciences
  • autosubmitautosubmit
  • Issues
  • #194
Closed
Open
Issue created Sep 22, 2016 by Jordi Cuadrado Borbonés@jcuadradDeveloper

Job not re-submitted after error in AS3.7.3

Hi, While executing an experiment with AS3.7.3 I received the ``[Autosubmit] The job a007_20130101_000_2_SIM status has changed to FAILED```notification (job failed because of a design error in the job, when multiple jobs access and copy the same template file concurrently).

I expected autosubmit to re-submit the job once but it didn't happen.

The configuration was:
RETRIALS = 1 (in /esnas/autosubmit-nmmb/a007/conf/autosubmit_a007.conf) [SIM] FILE = templates/05_sim.sh DEPENDENCIES = PREPROC_VAR RUNNING = chunk PROCESSORS = 264 WALLCLOCK = 01:10 NOTIFY_ON = FAILED (in /esnas/autosubmit-nmmb/a007/conf/jobs_a007.conf) So RETRIALS was not overwritten at JOB level.

At the end of all other jobs, this was the trace in the LOGS folder:

bsc32220@login2:/gpfs/scratch/bsc32/bsc32220/a007/LOG_a007> ll *2_SIM*
-rw-r--r-- 1 bsc32220 bsc32   104 Sep 22 10:48 a007_20130101_000_2_SIM_2266823.err
-rw-r--r-- 1 bsc32220 bsc32  3899 Sep 22 10:48 a007_20130101_000_2_SIM_2266823.out
-rw-r--r-- 1 bsc32220 bsc32 14650 Sep 22 10:46 a007_20130101_000_2_SIM.cmd
-rw-r--r-- 1 bsc32220 bsc32    11 Sep 22 10:48 a007_20130101_000_2_SIM_STAT  

Confirming that it was submitted just once.
Thanks!

Assignee
Assign to
Time tracking