Understanding event data collection: Part 2

Editor's note: This is the second of a two-part series. Part 1 appeared in the July 2004 issue. Most modern process and manufacturing plants use a computerized maintenance management system (CMMS) to help manage maintenance performed on plant assets. CMMS benefits include: A vital element of these systems is the ability to document history of events that occur within the maintenance work process.
By Ken Latino, Meridium, Inc., Roanoke, VA August 9, 2004

Most modern process and manufacturing plants use a computerized maintenance management system (CMMS) to help manage maintenance performed on plant assets. CMMS benefits include:

  • Baseline of asset information

  • Management of store room inventory

  • Maintenance work planning and scheduling

  • Maintenance history

  • Maintenance reporting.

    • A vital element of these systems is the ability to document history of events that occur within the maintenance work process. Unfortunately, many organizations that use a CMMS either do not use this capability or do not use it to its full capacity. It is important to instill the expectation in the workforce that all work history events will be recorded so that plant personnel can make informed decisions about where and when to use valuable plant resources.

      There should be a documented work process in place to guarantee that work history data are collected regularly. The maintenance work management process requires the collection of critical event data at various steps (Fig. 1).

      Step 1: Work request initiated

      Typically, the first step in the maintenance process is when a work request or notification gets initiated. This usually takes place when an operator, technician, or inspector identifies a problem that requires maintenance attention and initiates a work request to let the maintenance department know that attention is required. Several data elements must be identified and recorded at this point in the maintenance work process. These elements include:

    • Location of problem (functional location)

    • Equipment ID

    • Date the request was initiated

    • Malfunction start date

    • Failure finding codes (operator routine rounds, inspection, etc.)

    • Event type (failure, repair, PM).

      • Step 2: Work request review

        Once a valid work request is created and documented, it is reviewed by operations and maintenance for validation. If the request is not deemed valid, it is rejected and then terminated. If it is deemed valid, then a work order is generated. At this point additional data must be recorded on the work order, which includes:

      • Location of problem (functional location)

      • Equipment ID

      • Activity code

      • Work order creation date.

        • Once the order is created and documented, a decision is made to either plan or not plan the work order. After the order passes through planning (if required), it is released to maintenance. Maintenance performs the work and turns the equipment back over to operations upon completion.

          Step 3: Document the history

          With the repair complete and the knowledge of the work in hand, it is time to record that data. Depending on the CMMS, this is either done on the completed work order or the work request. This is by far the most detailed data recording that occurs, because what the problem was and what action was taken to correct the problem are known. The data elements required for this phase of the work process include:

        • Location ID

        • Equipment ID

        • Malfunction start date

        • Failure finding codes

        • Event type

        • Malfunction end date

        • Maintainable item

        • Damage codes

        • Primary cause

        • Primary activity.

          • Step 4: Closing the work order

            Once the event has been accurately recorded and all of the cost information has been entered and is deemed business complete, the work order can move to a final closing. When this phase is complete, most of the recommended data required for reliability analysis should be available.

            Using effective event recording codes

            Having a work process to collect event information is only the first step in gathering accurate event history. Without a standardized list of codes to use in your event recording, the data are almost impossible to use for analysis. There are various resources for event recording codes that range from company-specific codes to international industry standards — including the one provided by ISO 14224. This is a standard that was developed for the oil and gas industry and was based on work done by the Offshore Reliability Data group OREDA.

            Codes derived from ISO 14224 that are explained in tables within this article include activity, cause, detection method, effect, functional loss, condition, and failure modes.

            This standard focuses on equipment as well as failure and maintenance data. It describes details related to equipment classes, types, and boundaries. With respect to event recording, this standard defines codes, time stamps, and remarks. Equipment classes covered by ISO 14224 include:

          • Combustion engine

          • Compressor

          • Control logic unit

          • Electric generator

          • Electric motor

          • Fire and gas detector

          • Gas turbine

          • Heat exchanger

          • Process sensor

          • Pump

          • Turboexpander

          • Valve

          • Vessel.

            • Within these classes of equipment, there are specific codes that can be used to record equipment events. These codes include:

            • Method of detection

            • Functional loss

            • Failure mode

            • Maintainable item

            • Failure cause

            • Maintenance activity.

              • While these codes and equipment classes are an excellent start, there are additional equipment classes and code categories that are useful in fully documenting an equipment event. Therefore, additional equipment classes are included as supplements to the ISO 14224 standard. They include:

              • Agitator

              • Boiler

              • Fan-Blower

              • Fired Heater

              • Gas Turbine

              • General Equipment

              • Meter

              • Instrumentation

              • Piping

              • NPV (Tank)

              • Relief Device

              • Power Distribution

              • Steam Turbine.

                • Similarly, additional code categories supplement the code categories within ISO 14224:

                • Condition

                • Effect

                  • Conclusion

                    A key element of a successful asset performance process is the collection of event data required for analysis. This is especially true if you consider that without event data it is impossible to determine where your problems reside, what strategies are effective or ineffective, and where you need to focus your resources for the largest improvements.

                    Beyond the ability to measure performance, event data collection gives you the baseline data to perform detailed reliability analysis. These techniques are very powerful when coupled with accurate and complete event data and can drive proactive behavior within the organization. The combination of quality event data, comprehensive analysis, and disciplined follow-through can be the catalysts to meeting your plant’s strategic goals.

                    Explanation of activity codes

                    Code Description
                    ADJ Adjust
                    CHK Check
                    CMB Combination
                    INS Inspection
                    MOD Modify
                    OTH Other
                    OVH Overhaul
                    REP Repair
                    RFT Refit
                    RPL Replace
                    SVC Service
                    TST Test

                    Explanation of cause codes

                    Code Description
                    CAP Improper capacity
                    DOC Documentation error
                    DSN Improper design
                    EXP Expected wear and tear
                    FAB Fabrication error
                    INS Installation error
                    MGT Management error
                    MNT Maintenance error
                    MTL Improper material
                    OPS Operating error
                    SRV Off-design service
                    UNK Unknown

                    Explanation of detection method codes

                    Code Description
                    CAP Improper
                    CCM Continuous condition monitoring
                    CMB Combination
                    CMT Corrective maintenance
                    FTS Functional testing
                    INS Inspection
                    OBS Observation
                    OTH Other
                    PCM Periodic condition monitoring
                    PMT Preventive maintenance
                    PRD Production interference

                    Explanation of effect codes

                    Code Description
                    ENV Environmental
                    NON None
                    OPS Operations
                    SAF Safety

                    Explanation of functional loss codes

                    Code Description
                    COM Complete
                    NON None
                    PAR Partial
                    POT Potential

                    Explanation of condition codes

                    Code Description
                    BLP Blockage/plugged
                    BRK Breakage
                    CAV Cavitation
                    CTM Contamination
                    COR Corrosion
                    CRK Cracked
                    DIR Dirty
                    ERO Erosion
                    FVI Fatigue, vibration
                    IMB Imbalance
                    ALN Improper alignment
                    CLE Improper clearance
                    FIT Improper fit
                    LUB Improper lubrication
                    RUN Improper runout
                    MTL Incorrect material
                    LKG Leakage
                    LOS Looseness
                    SZD Seized
                    STK Sticking
                    VIB Vibration/noise
                    WER Wear

                    Explanation of failure mode codes

                    Code Description
                    STP Failed to stop on demand
                    BRD Breakdown
                    LOO Low output
                    ERO Erratic output
                    ELP External leakage process medium
                    VIB Vibration
                    NOI Noise
                    OTH Other (specify in text field)
                    FTR Failed to regulate
                    FTS Failed to start on demand
                    SPS Spurious stop/unexpected shutdown
                    HIO High output
                    ELU External leakage utility medium
                    INL Internal leakage
                    OHE Overheating
                    OWD Operation without demand
                    OOC Out of compliance
                    PDE Parameter deviation
                    PLU Plugged/choked
                    AIR Abnormal instrument reading
                    STD Structural deficiency
                    SER Minor-in-service problems
                    UNK Unknown
                    Author Information
                    Ken Latino is a senior consultant in Meridium, Inc., Asset Performance Management Consulting Group, a root cause analysis practitioner, frequent speaker on the topic at conferences, and an educator. He is author of several root cause analysis courses, books, and trade magazine articles. Ken designed a software program that assists analysts in conducting a disciplined root cause analysis. He can be reached at 540-344-9205 ext. 1176 or KLatino@meridium.com .