Log formats
From CEDPS
Contents |
Goals
This page collects the formats and semantics of selected log formats, with a strong emphasis on HPC "Grid" formats. It also shows the translation to the NetLogger log format, where this exists.
Since log formats change, each entry is formatted to indicate its history. Detailed examples are also provided. See the other log entries or the template at the bottom as a guide for new entries.
Logs
Condor/Pegasus Jobstate Log
Thanks to Karan Vahi from the ISI Pegasus team for this content.
- Date entered: 2009-04-21
- Date modified: 2009-04-21
- Software: Condor-G
- Software version: unknown
- Description: The jobstate.log file records the various states that a Condor job goes through during workflow execution.
- Log entries:
Below is a snippet from the jobstate.log for a single job executed via Condor-G.
1239666049 create_dir_blackdiamond_0_isi_viz SUBMIT 3758.0 isi_viz - 1239666059 create_dir_blackdiamond_0_isi_viz EXECUTE 3758.0 isi_viz - 1239666059 create_dir_blackdiamond_0_isi_viz GLOBUS_SUBMIT 3758.0 isi_viz - 1239666059 create_dir_blackdiamond_0_isi_viz GRID_SUBMIT 3758.0 isi_viz - 1239666064 create_dir_blackdiamond_0_isi_viz JOB_TERMINATED 3758.0 isi_viz - 1239666064 create_dir_blackdiamond_0_isi_viz POST_SCRIPT_STARTED - isi_viz - 1239666069 create_dir_blackdiamond_0_isi_viz POST_SCRIPT_TERMINATED 3758.0 isi_viz - 1239666069 create_dir_blackdiamond_0_isi_viz POST_SCRIPT_SUCCESS - isi_viz -
Each entry in jobstate.log has the following
- the ISO timestamp for the time at which the particular event happened
- the name of the job.
- the event recorded by DAGMan for the job.
- the condor id of the job is the queue on the submit node
- the pegasus site to which the job is mapped
The lifecycle for the job when executed as part of the workflow are as follows
- SUBMIT
- EXECUTE
- GLOBUS_SUBMIT
- GRID_SUBMIT
- JOB_TERMINATED
- POST_SCRIPT_STARTED
- POST_SCRIPT_TERMINATED
- POST_SCRIPT_SUCCESS
NetLogger-parsed version of example above
nl_parser -m jobstate < DATA
ts=2009-04-13T23:40:49.000000Z event=pegasus.jobstate.submit level=Info site.id=isi_viz condor.id=3758.0 comp.id=create_dir_blackdiamond_0_isi_viz ts=2009-04-13T23:40:59.000000Z event=pegasus.jobstate.execute level=Info site.id=isi_viz condor.id=3758.0 comp.id=create_dir_blackdiamond_0_isi_viz ts=2009-04-13T23:40:59.000000Z event=pegasus.jobstate.globus_submit level=Info site.id=isi_viz condor.id=3758.0 comp.id=create_dir_blackdiamond_0_isi_viz ts=2009-04-13T23:40:59.000000Z event=pegasus.jobstate.grid_submit level=Info site.id=isi_viz condor.id=3758.0 comp.id=create_dir_blackdiamond_0_isi_viz ts=2009-04-13T23:41:04.000000Z event=pegasus.jobstate.job_terminated level=Info site.id=isi_viz condor.id=3758.0 comp.id=create_dir_blackdiamond_0_isi_viz ts=2009-04-13T23:41:04.000000Z event=pegasus.jobstate.postscript.start level=Info site.id=isi_viz condor.id=- comp.id=create_dir_blackdiamond_0_isi_viz ts=2009-04-13T23:41:09.000000Z event=pegasus.jobstate.postscript.end level=Info status=0 site.id=isi_viz dur=5.000000 condor.id=3758.0 comp.id=create_dir_blackdiamond_0_isi_viz
Globus-Condor
- Date entered: 2009-04-07
- Date modified: 2009-04-07
- Software: Globus Toolkit
- Software version:
- Description: Job events for Condor-G
- Log entries:
<c> 1. <a n="MyType"><s>JobTerminatedEvent</s></a> 2. <a n="EventTypeNumber"><i>5</i></a> 3. <a n="MyType"><s>JobTerminatedEvent</s></a> 4. <a n="EventTime"><s>2008-02-06T16:45:10</s></a> 5. <a n="Cluster"><i>5</i></a> 6. <a n="Proc"><i>0</i></a> 7. <a n="Subproc"><i>0</i></a> <a n="TerminatedNormally"><b v="t"/></a> <a n="ReturnValue"><i>0</i></a> <a n="RunLocalUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a> <a n="RunRemoteUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a> <a n="TotalLocalUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a> <a n="TotalRemoteUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a> <a n="SentBytes"><r>0.000000000000000E+00</r></a> <a n="ReceivedBytes"><r>0.000000000000000E+00</r></a> <a n="TotalSentBytes"><r>0.000000000000000E+00</r></a> <a n="TotalReceivedBytes"><r>0.000000000000000E+00</r></a> </c>
Items 1, 2, 5, 6, and 7 are in all stanzas
- Name of event type
- Number of event type
- sic. Bug
- Timestamp
5. - 7. Condor jobid = "%03d.%03d.%03d" % (Cluster, Proc, Subproc)
- NetLogger log entries:
Submit event
<c>
<a n="MyType"><s>SubmitEvent</s></a>
<a n="EventTypeNumber"><i>0</i></a>
<a n="MyType"><s>SubmitEvent</s></a>
<a n="EventTime"><s>2008-02-06T16:14:24</s></a>
<a n="Cluster"><i>2</i></a>
<a n="Proc"><i>0</i></a>
<a n="Subproc"><i>0</i></a>
<a n="SubmitHost"><s><131.243.2.182:40152></s></a>
</c>
ts=2008-02-06T16:14:24-08:00 event=globus.condor.submit level=Info job.id=002.000.000 host=131.243.2.182 port=40152
Execute event
<c>
<a n="MyType"><s>ExecuteEvent</s></a>
<a n="EventTypeNumber"><i>1</i></a>
<a n="MyType"><s>ExecuteEvent</s></a>
<a n="EventTime"><s>2008-02-06T16:14:27</s></a>
<a n="Cluster"><i>2</i></a>
<a n="Proc"><i>0</i></a>
<a n="Subproc"><i>0</i></a>
<a n="ExecuteHost"><s><131.243.2.130:47659></s></a>
</c>
ts=2008-02-06T16:14:27-08:00 event=globus.condor.execute level=Info job.id=002.000.000 host=131.243.2.130 port=47659
Job terminated event
<c>
<a n="MyType"><s>JobTerminatedEvent</s></a>
<a n="EventTypeNumber"><i>5</i></a>
<a n="MyType"><s>JobTerminatedEvent</s></a>
<a n="EventTime"><s>2008-02-06T16:14:27</s></a>
<a n="Cluster"><i>2</i></a>
<a n="Proc"><i>0</i></a>
<a n="Subproc"><i>0</i></a>
<a n="TerminatedNormally"><b v="t"/></a>
<a n="ReturnValue"><i>0</i></a>
<a n="RunLocalUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a>
<a n="RunRemoteUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a>
<a n="TotalLocalUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a>
<a n="TotalRemoteUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a>
<a n="SentBytes"><r>0.000000000000000E+00</r></a>
<a n="ReceivedBytes"><r>0.000000000000000E+00</r></a>
<a n="TotalSentBytes"><r>0.000000000000000E+00</r></a>
<a n="TotalReceivedBytes"><r>0.000000000000000E+00</r></a>
</c>
ts=2008-02-06T16:14:27-08:00 event=globus.condor.jobTerminated level=Info totalLocalUsage.usr.0=0 runRemoteUsage.sys.0=0 totalReceivedBytes=0.000000 totalRemoteUsage.sys.0=0 terminatedNormally=0 runLocalUsage.sys.0=0 runLocalUsage.usr.0=0 returnValue=0 receivedBytes=0.000000 job.id=002.000.000 totalRemoteUsage.usr.0=0 totalLocalUsage.sys.0=0 runRemoteUsage.usr.0=0 sentBytes=0.000000 totalSentBytes=0.000000
Template
Sample log
- Date entered: YYYY-MM-DD -- date this format was first entered
- Date modified: YYYY-MM-DD -- date this format was last modified
- Software: name of software
- Software version: version(s) of software, if known
- Description: General description of log contents
- Log entries:
1. example of a log entry 2. another example unnumbered-lines can be 3. perhaps together these are a coherent interspersed 4. slice of a log file
- Description of first log entry
- More description
- Every numbered
- line should be described.
- NetLogger log entries: NetLogger translation examples
Description of thing being translated
Raw log
NetLogger log
