- English
- Français
Job Output
- At the end of the job PBS output file,
- First, information logged by the server during the last 2 days is provided.
- Second, master and slaves nodes information logged by the cpuacct cgroup subsystem follows. Pay attention to memory and cpu time used on each node.
- Third, a summary of the resources requested and used is provided. See Accounting for metrics definition.
- Exit codes:
- Exit Code = 0: Job execution was successful.
- Exit Code < 0: This is a PBS special return indicating that the job could not be executed. (See PBS documentation for more details and contact support).
- Exit Code between 0 and 128 (or 256) : This is the exit value of the top process, typically the shell. This may be the exit value of the last command executed in the shell.
- Exit Code >= 128 or 256: This means the job was killed by a signal. The signal is given by X modulo 128 ( or 256). If a job had an exit status of 143, that indicates the job was killed with a SIGTERM ( e.g. 143 - 128 = 15 ). See kill(1) man page for signal definitions.
- Do not send the output of the software in the PBS output file. The PBS output file is kept on the job master node /var/spool/PBS/spool directory and copied back at the end of the job in the user's directory where the job was launched.
- Example:
----------------- PBS server and MOM logs ----------------- ------------------frontal2.cenaero.be------------------ Job: 915424.frontal2 03/06/2017 19:55:08 S enqueuing into main, state 1 hop 1 ------------------node0851------------------ Job: 915424.frontal2 03/06/2017 19:55:09 M running prologue ------------------node0852------------------
03/06/2017 19:55:09 M JOIN_JOB as node 1 ------------------------------- Job Information ------------------------------- Job Owner : coulon@frontal3 Resources Requested Number of Cores per Job - NCPUS_PJOB : 48 Resources Used Total Memory used - MEM : 64963112kb For metrics definition, please refer to https://tier1.cenaero.be/en/faq-page -------------------------------------------------------------------------------
|