Availability Analysis

Availability Analysis connects uptime, downtime, failures, repair time, planned downtime, operating requirements, and reliability improvement so teams can understand whether assets can support demand.

Back to BoK Index

MetricMeasurementDecision Support

Definition

Availability Analysis evaluates the proportion of time that equipment, systems, or processes are capable of performing required work when needed. It connects uptime, downtime, failure frequency, repair duration, planned downtime, maintenance strategy, and operational demand.

A common inherent availability approximation is MTBF divided by MTBF plus MTTR, where MTBF is mean time between failures and MTTR is mean time to repair. Operational availability may also include planned downtime, setup, waiting for parts, staffing, changeovers, sanitation, inspections, or other real-world conditions that affect usable time.

History

Availability analysis comes from reliability engineering, maintenance management, military logistics, industrial operations, and asset management. As equipment and systems became more complex, organizations needed ways to understand not only whether equipment could perform, but whether it would be ready when required.

In Lean, Six Sigma, and TPM environments, availability is a major driver of flow, capacity, OEE, delivery performance, and cost. It links reliability metrics such as MTBF and MTTR to operational consequences such as missed schedules, overtime, inventory buffers, downtime losses, and customer disruption.

When to Use

Use Availability Analysis when equipment or system readiness affects safety, throughput, delivery, customer service, cost, or quality. Good triggers include recurring downtime, low OEE, capacity shortages, unstable schedules, emergency maintenance, long repairs, spare-part delays, bottleneck equipment, utilities reliability, IT outages, and service system downtime.

Use it during asset selection, maintenance strategy design, capacity planning, TPM deployment, reliability improvement, spare-part planning, and bottleneck analysis. Define the availability type carefully before comparing numbers across assets or sites.

Step-by-Step

Define required function and demand window. State what the asset must do and when it must be available. Availability is meaningless without required operating context.
Define downtime rules. Decide what counts as downtime, planned downtime, unplanned downtime, standby time, changeover, waiting time, and excluded time.
Collect event data. Capture start and end times for failures, repairs, planned maintenance, changeovers, sanitation, part waits, adjustments, and other loss categories.
Calculate uptime and downtime. Summarize available time, unavailable time, operating time, and loss categories over the selected period.
Calculate MTBF and MTTR where appropriate. Use consistent failure definitions and repair completion criteria.
Compute availability. Use the formula that matches the question: inherent availability, achieved availability, operational availability, or OEE availability.
Segment by asset, failure mode, shift, product, and condition. Stratification helps identify whether availability losses are driven by a few repeat causes.
Prioritize improvement actions. Reduce failure frequency through reliability work, reduce repair time through maintainability and spare parts, or reduce planned downtime through better scheduling and setup methods.
Monitor sustainment. Track availability over time with clear definitions and connect it to OEE, throughput, and customer impact.

Examples

Bottleneck machine: A CNC cell has high demand but frequent spindle alarms. Availability Analysis shows that a few recurring alarm types drive most downtime, leading to focused maintenance and parameter review.
Packaging line: OEE shows low availability. Downtime stratification reveals long changeovers and sanitation waits rather than mechanical failure, so the team uses SMED and scheduling changes.
Utility system: Compressed air outages stop several lines. The team analyzes availability of compressors, dryers, and backup capacity to justify preventive maintenance and redundancy changes.
IT service desk: A production reporting system is unavailable during shift starts. Availability Analysis separates software outages from planned maintenance windows and network issues.
Maintenance spare parts: MTTR is high because repairs wait for stocked components. The team improves spare-part strategy for high-criticality failure modes.

Common Pitfalls

Using unclear definitions. Availability numbers cannot be compared if sites classify planned downtime, waiting time, or changeovers differently.
Mixing reliability and scheduling losses. Mechanical failures, setup time, sanitation, no operator, and no material may all reduce usable time but require different countermeasures.
Ignoring repair delay. MTTR can include troubleshooting, parts waiting, approval, and restart time depending on definition. Be explicit.
Counting only major outages. Many small stops can create large availability loss and should be captured if they affect demand.
Focusing only on uptime percentage. A high average availability can hide severe downtime during critical demand windows.
No link to business impact. Availability improvement should connect to throughput, service level, safety, cost, quality, or customer impact.
Blaming maintenance alone. Availability is affected by design, operation, cleaning, setup, materials, training, spare parts, and leadership priorities.