Understanding event data collection: Part 2
Editor's note: This is the second of a two-part series. Part 1 appeared in the July 2004 issue. Most modern process and manufacturing plants use a computerized maintenance management system (CMMS) to help manage maintenance performed on plant assets. CMMS benefits include: A vital element of these systems is the ability to document history of events that occur within the maintenance work process.
Most modern process and manufacturing plants use a computerized maintenance management system (CMMS) to help manage maintenance performed on plant assets. CMMS benefits include:
-
Baseline of asset information
-
Management of store room inventory
-
Maintenance work planning and scheduling
-
Maintenance history
-
Maintenance reporting.
-
A vital element of these systems is the ability to document history of events that occur within the maintenance work process. Unfortunately, many organizations that use a CMMS either do not use this capability or do not use it to its full capacity. It is important to instill the expectation in the workforce that all work history events will be recorded so that plant personnel can make informed decisions about where and when to use valuable plant resources.
There should be a documented work process in place to guarantee that work history data are collected regularly. The maintenance work management process requires the collection of critical event data at various steps (Fig. 1).
Step 1: Work request initiated
Typically, the first step in the maintenance process is when a work request or notification gets initiated. This usually takes place when an operator, technician, or inspector identifies a problem that requires maintenance attention and initiates a work request to let the maintenance department know that attention is required. Several data elements must be identified and recorded at this point in the maintenance work process. These elements include:
-
Location of problem (functional location)
-
Equipment ID
-
Date the request was initiated
-
Malfunction start date
-
Failure finding codes (operator routine rounds, inspection, etc.)
-
Event type (failure, repair, PM).
-
Step 2: Work request review
Once a valid work request is created and documented, it is reviewed by operations and maintenance for validation. If the request is not deemed valid, it is rejected and then terminated. If it is deemed valid, then a work order is generated. At this point additional data must be recorded on the work order, which includes:
-
Location of problem (functional location)
-
Equipment ID
-
Activity code
-
Work order creation date.
-
Once the order is created and documented, a decision is made to either plan or not plan the work order. After the order passes through planning (if required), it is released to maintenance. Maintenance performs the work and turns the equipment back over to operations upon completion.
Step 3: Document the history
With the repair complete and the knowledge of the work in hand, it is time to record that data. Depending on the CMMS, this is either done on the completed work order or the work request. This is by far the most detailed data recording that occurs, because what the problem was and what action was taken to correct the problem are known. The data elements required for this phase of the work process include:
-
Location ID
-
Equipment ID
-
Malfunction start date
-
Failure finding codes
-
Event type
-
Malfunction end date
-
Maintainable item
-
Damage codes
-
Primary cause
-
Primary activity.
-
Step 4: Closing the work order
Once the event has been accurately recorded and all of the cost information has been entered and is deemed business complete, the work order can move to a final closing. When this phase is complete, most of the recommended data required for reliability analysis should be available.
Using effective event recording codes
Having a work process to collect event information is only the first step in gathering accurate event history. Without a standardized list of codes to use in your event recording, the data are almost impossible to use for analysis. There are various resources for event recording codes that range from company-specific codes to international industry standards — including the one provided by ISO 14224. This is a standard that was developed for the oil and gas industry and was based on work done by the Offshore Reliability Data group OREDA.
Codes derived from ISO 14224 that are explained in tables within this article include activity, cause, detection method, effect, functional loss, condition, and failure modes.
This standard focuses on equipment as well as failure and maintenance data. It describes details related to equipment classes, types, and boundaries. With respect to event recording, this standard defines codes, time stamps, and remarks. Equipment classes covered by ISO 14224 include:
-
Combustion engine
-
Compressor
-
Control logic unit
-
Electric generator
-
Electric motor
-
Fire and gas detector
-
Gas turbine
-
Heat exchanger
-
Process sensor
-
Pump
-
Turboexpander
-
Valve
-
Vessel.
-
Within these classes of equipment, there are specific codes that can be used to record equipment events. These codes include:
-
Method of detection
-
Functional loss
-
Failure mode
-
Maintainable item
-
Failure cause
-
Maintenance activity.
-
While these codes and equipment classes are an excellent start, there are additional equipment classes and code categories that are useful in fully documenting an equipment event. Therefore, additional equipment classes are included as supplements to the ISO 14224 standard. They include:
-
Agitator
-
Boiler
-
Fan-Blower
-
Fired Heater
-
Gas Turbine
-
General Equipment
-
Meter
-
Instrumentation
-
Piping
-
NPV (Tank)
-
Relief Device
-
Power Distribution
-
Steam Turbine.
-
Similarly, additional code categories supplement the code categories within ISO 14224:
-
Condition
-
Effect
-
Conclusion
A key element of a successful asset performance process is the collection of event data required for analysis. This is especially true if you consider that without event data it is impossible to determine where your problems reside, what strategies are effective or ineffective, and where you need to focus your resources for the largest improvements.
Beyond the ability to measure performance, event data collection gives you the baseline data to perform detailed reliability analysis. These techniques are very powerful when coupled with accurate and complete event data and can drive proactive behavior within the organization. The combination of quality event data, comprehensive analysis, and disciplined follow-through can be the catalysts to meeting your plant’s strategic goals.
Explanation of activity codes
Code Description ADJ Adjust CHK Check CMB Combination INS Inspection MOD Modify OTH Other OVH Overhaul REP Repair RFT Refit RPL Replace SVC Service TST Test Explanation of cause codes
Code Description CAP Improper capacity DOC Documentation error DSN Improper design EXP Expected wear and tear FAB Fabrication error INS Installation error MGT Management error MNT Maintenance error MTL Improper material OPS Operating error SRV Off-design service UNK Unknown Explanation of detection method codes
Code Description CAP Improper CCM Continuous condition monitoring CMB Combination CMT Corrective maintenance FTS Functional testing INS Inspection OBS Observation OTH Other PCM Periodic condition monitoring PMT Preventive maintenance PRD Production interference Explanation of effect codes
Code Description ENV Environmental NON None OPS Operations SAF Safety Explanation of functional loss codes
Code Description COM Complete NON None PAR Partial POT Potential Explanation of condition codes
Code Description BLP Blockage/plugged BRK Breakage CAV Cavitation CTM Contamination COR Corrosion CRK Cracked DIR Dirty ERO Erosion FVI Fatigue, vibration IMB Imbalance ALN Improper alignment CLE Improper clearance FIT Improper fit LUB Improper lubrication RUN Improper runout MTL Incorrect material LKG Leakage LOS Looseness SZD Seized STK Sticking VIB Vibration/noise WER Wear Explanation of failure mode codes
Code Description STP Failed to stop on demand BRD Breakdown LOO Low output ERO Erratic output ELP External leakage process medium VIB Vibration NOI Noise OTH Other (specify in text field) FTR Failed to regulate FTS Failed to start on demand SPS Spurious stop/unexpected shutdown HIO High output ELU External leakage utility medium INL Internal leakage OHE Overheating OWD Operation without demand OOC Out of compliance PDE Parameter deviation PLU Plugged/choked AIR Abnormal instrument reading STD Structural deficiency SER Minor-in-service problems UNK Unknown Author Information Ken Latino is a senior consultant in Meridium, Inc., Asset Performance Management Consulting Group, a root cause analysis practitioner, frequent speaker on the topic at conferences, and an educator. He is author of several root cause analysis courses, books, and trade magazine articles. Ken designed a software program that assists analysts in conducting a disciplined root cause analysis. He can be reached at 540-344-9205 ext. 1176 or KLatino@meridium.com .
-
-
-
-
-
-
-
Do you have experience and expertise with the topics mentioned in this content? You should consider contributing to our WTWH Media editorial team and getting the recognition you and your company deserve. Click here to start this process.