1. 20 May, 2005 4 commits
    • Christopher J. Morrone's avatar
      Fix for bug reported by Jim Garlick: · db7002d1
      Christopher J. Morrone authored
        "srun output overflow ("Need to rewind" in srun/_do_output_line)"
      
      When srun's stdout is consuming data slowly, srun can receive notice that
      the job has terminated before the output stream has been fully written.
      
      The IO thread will receives a SIGHUP to kick it out of its blocking poll.
      However in the slow stdout situation the SIGHUP can interrupt the
      fflush.  When the fflush is interrupted, it appears to clear the stream
      buffer even though the data wasn't written out to the file descriptor,
      and we see data loss on stdout.
      
      To avoid this situation, this change makes signals to the IO thread
      go over a pipe rather than sending a signal.  Also, some extra return
      code checking is done in io.c:_do_output_line().
      db7002d1
    • Christopher J. Morrone's avatar
      A poll of a open file descriptor for /dev/null always returns a POLLERR · e2f39fe7
      Christopher J. Morrone authored
      under AIX.  This work-around bypasses the eio handling of /dev/null and
      instead directly makes pin[0] a file handle for /dev/null.
      
      The only problem with this fix is that it is not obvious elsewhere in the
      code that pin is not a pipe when using "srun --batch" and stdin is
      /dev/null.
      e2f39fe7
    • Moe Jette's avatar
      Fix slurmd shutdown signal synchronization bug (not consistently · f1f8eee2
      Moe Jette authored
          terminating).
      f1f8eee2
    • Moe Jette's avatar
      Combine two echo commands into one to provide consistency on different · dfefc447
      Moe Jette authored
      systems (different mechanisms to prevent line-feed).
      dfefc447
  2. 19 May, 2005 13 commits
  3. 18 May, 2005 8 commits
  4. 17 May, 2005 4 commits
  5. 16 May, 2005 9 commits
  6. 13 May, 2005 2 commits