Understanding event data collection: Part 1

Editor's note: This is the first of a two-part series on the importance of effective event data collection for reliability analysis. The second part will appear in the August issue. Collecting event data is a minimum requirement to measure the effectiveness of your current asset management strategies.


Collecting event data is a minimum requirement to measure the effectiveness of your current asset management strategies. Without this information one tends to allocate resources on the "problem of the day." This means you probably do not have a systematic approach to removing defects from your operation. Having a complete set of event data for all asset events provides a much clearer understanding of how to get the greatest return from your human and capital resources.

Sources of event data

Event data can come from a variety of sources such as a maintenance management system, predictive and inspection systems, as well as production systems. For this discussion, we will focus our attention primarily on collecting event data related to equipment that resides in a computerized maintenance management system (CMMS).

It is important to understand the reasoning behind the data collection effort before getting into the details of how it is actually accomplished. The collection of event data has a double benefit. The primary benefit of comprehensive event data is to alert process owners as to whether their asset strategies are effective. Once ineffective strategies are identified, the same event data are used to drill down and determine the causes of the ineffective strategies.

What is reliability event data?

Reliability event data can comprise many items within many contexts. For purposes of this discussion criteria of reliability event data include:

  • Work events that occur on equipment

  • Type of work performed

  • Conditions found at the time of work

  • Technical findings after work is completed

  • Dates/time associated with the work

  • Cost associated with performing the work.

    • When an event occurs on a piece of equipment, it is critical to record what type of event actually occurred. For example, was it a failure, repair, or preventive maintenance (PM)? What was the condition of the equipment at the time of the event? Some of the most critical information to be included in the recording of any event are the date and time stamps related to the event and the costs associated with that event — for example, labor, material, contractor, and production losses (see "Recommended event data to collect").

      After the work is completed, it is necessary to record the technical finding such as the failed item, failure mode, cause, and several other data elements discussed in further detail in Part 2 of this article in the August issue.

      A company's balanced scorecard is composed of standardized, enterprise-wide performance measurements related to production assets. A balanced scorecard provides a holistic view of key performance indicators (KPIs) spanning multiple plants and allows management to make strategic, fact-based decisions with greater confidence.

      Author Information
      Ken Latino is a senior consultant in Meridium, Inc. Asset Performance Management Consulting Group, a root cause analysis practitioner, frequent speaker on the topic at conferences, and an educator. He is author of several root cause analysis courses, books, and trade magazine articles. Ken designed a software program that assists analysts in conducting a disciplined root cause analysis. He can be reached at 540-344-9205 ext. 1176 or KLatino@meridium.com .

      Recommended event data to collect

      These lists include the data and supplemental descriptions that are recommended to be collected on a given event. These data will be used as the basis for compiling a balanced scorecard as well the information required to find the underlying causes.


      Event ID — the unique identifier for each failure event

      CMMS ID — useful if you are using a CMMS as the base data collection system for failure events

      Functional location — typically a "smart" ID that represents what function takes place at a given location (for example, pump 01-G-0001 must move liquid X from point A to point B)

      Functional location hierarchy — functional hierarchy to roll up metrics at various levels:

      Level 1

      Level 2 Level n (System)

      Level 3

      Equipment ID — usually a randomly generated ID that reflects the asset that is in service at the functional location. The reason for a separate equipment ID and functional Location is because assets can move from place to place and functional locations

      Equipment name — name or description of equipment for identification purposes

      Equipment category (for example, rotating) — indicates the category of equipment the work was performed on — generally organized by discipline (rotating, fixed, electrical, instrument)

      Equipment class (for example, pump) — indicates the class of equipment on which the work was performed. Failure codes can depend on this value

      Equipment type (for example, centrifugal) — indicates the type of equipment on which the work was performed. Failure codes can depend on this value also


      Functional loss — indicates whether the equipment experienced a functional loss as part of this event. A functional loss can be defined as:

      Complete loss of function

      Partial loss of function

      Potential loss of function

      Functional failure (ISO failure mode) — basically the symptoms of a failure if one has occurred. Any physical asset is installed to fulfill a number of functions. The functional failure describes which function the asset no longer is able to fulfill

      Effect — the effect of the event on production, safety, environmental, or quality

      Maintainable item — the actual component identified as causing the asset to lose its ability to operate (for example, bearing)

      Condition — indicates the type of damage found to the maintainable item. In some cases this also tends to indicate a failure mechanism as well

      Cause — the general cause of the condition. This is not the root cause; it is recommended to use root cause failure analysis (RCFA) to assess root causes

      Maintenance action — corrective action performed to mitigate the damaged item

      Narrative — text description of work and suggestions for improvements


      Event date — the date that the event was first observed and documented

      Mechanically unavailable date/time — the date/time that the equipment is actually taken out of service either due to a failure or for the repair work

      Mechanically available date/time — the date/time that the equipment is available for service after the repair work is completed

      Mechanical downtime — difference between mechanically unavailable date and mechanically available date expressed in hours

      Maintenance start date/time — the date/time that the equipment is actually being worked on by maintenance

      Maintenance end date/time — the date/time that the equipment repair is completed

      Time to repair — the total maintenance time to repair the equipment


      Maintenance cost — the total maintenance expenditure to rectify the failure. This could be company or contractor cost. This cost could be organized into categories, such as material, labor, contractor, etc.

      Production cost

The Top Plant program honors outstanding manufacturing facilities in North America. View the 2017 Top Plant.
The Product of the Year program recognizes products newly released in the manufacturing industries.
Each year, a panel of Control Engineering and Plant Engineering editors and industry expert judges select the System Integrator of the Year Award winners in three categories.
SCCR, 2018 Maintenance study, and VFDs in a washdown environment.
Welding ergonomics, 2017 Salary Survey, and surge protection
2017 Top Plant winner, Best practices, Plant Engineering at 70, Top 10 stories of 2017
Product of the Year winners, Pattern recognition, Engineering analytics, Revitalize older pump installations
Control room technology innovation; Practical approaches to corrosion protection; Pipeline regulator revises quality programs
The cloud, mobility, and remote operations; SCADA and contextual mobility; Custom UPS empowering a secure pipeline
Setting internal automation standards
Knowing how and when to use parallel generators
PID controllers, Solar-powered SCADA, Using 80 GHz radar sensors

Annual Salary Survey

Before the calendar turned, 2016 already had the makings of a pivotal year for manufacturing, and for the world.

There were the big events for the year, including the United States as Partner Country at Hannover Messe in April and the 2016 International Manufacturing Technology Show in Chicago in September. There's also the matter of the U.S. presidential elections in November, which promise to shape policy in manufacturing for years to come.

But the year started with global economic turmoil, as a slowdown in Chinese manufacturing triggered a worldwide stock hiccup that sent values plummeting. The continued plunge in world oil prices has resulted in a slowdown in exploration and, by extension, the manufacture of exploration equipment.

Read more: 2015 Salary Survey

Maintenance and reliability tips and best practices from the maintenance and reliability coaches at Allied Reliability Group.
The One Voice for Manufacturing blog reports on federal public policy issues impacting the manufacturing sector. One Voice is a joint effort by the National Tooling and Machining...
The Society for Maintenance and Reliability Professionals an organization devoted...
Join this ongoing discussion of machine guarding topics, including solutions assessments, regulatory compliance, gap analysis...
IMS Research, recently acquired by IHS Inc., is a leading independent supplier of market research and consultancy to the global electronics industry.
Maintenance is not optional in manufacturing. It’s a profit center, driving productivity and uptime while reducing overall repair costs.
The Lachance on CMMS blog is about current maintenance topics. Blogger Paul Lachance is president and chief technology officer for Smartware Group.
The maintenance journey has been a long, slow trek for most manufacturers and has gone from preventive maintenance to predictive maintenance.
This digital report explains how plant engineers and subject matter experts (SME) need support for time series data and its many challenges.
This digital report will explore several aspects of how IIoT will transform manufacturing in the coming years.
Maintenance Manager; California Oils Corp.
Associate, Electrical Engineering; Wood Harbinger
Control Systems Engineer; Robert Bosch Corp.
This course focuses on climate analysis, appropriateness of cooling system selection, and combining cooling systems.
This course will help identify and reveal electrical hazards and identify the solutions to implementing and maintaining a safe work environment.
This course explains how maintaining power and communication systems through emergency power-generation systems is critical.
click me