In the world of quality management and process improvement, the ability to measure accurately is fundamental to making informed decisions. Within the DMAIC (Define, Measure, Analyze, Improve, Control) framework of Lean Six Sigma, the Measure phase plays a critical role in establishing a reliable baseline for understanding current process performance. At the heart of this phase lies a crucial validation tool: the Gage Repeatability and Reproducibility (GR&R) study.
Understanding whether your measurement system is capable of detecting actual differences in parts or products is not just a technical requirement but a business imperative. A flawed measurement system can lead to incorrect decisions, wasted resources, and customer dissatisfaction. This comprehensive guide will walk you through everything you need to know about conducting effective GR&R studies, complete with practical examples and real-world applications. You might also enjoy reading about Value Stream Mapping: A Comprehensive Guide to Identifying Waste in Your Current Process.
Understanding the Foundation of Measurement Systems Analysis
Before diving into the specifics of GR&R studies, it is essential to understand why measurement systems analysis matters. Every measurement process contains variation, and this variation comes from two primary sources: actual product variation and measurement system variation. The goal of a GR&R study is to determine how much of the total observed variation comes from the measurement system itself. You might also enjoy reading about 5 Common Mistakes in the Measure Phase and How to Avoid Them for Lean Six Sigma Success.
A measurement system includes all the components involved in obtaining measurements: the gage or instrument, the operator or appraiser, the procedure or method, the environment, and any other factors that might influence the measurement result. When we conduct a GR&R study, we are essentially asking: “Can we trust our measurement system to give us accurate and consistent results?” You might also enjoy reading about Control Charts Basics: Understanding Variation in the Measure Phase of Lean Six Sigma.
The Two Critical Components: Repeatability and Reproducibility
Repeatability: Equipment Variation
Repeatability refers to the variation in measurements obtained when one operator uses the same gage to measure the same characteristic on the same part multiple times. This is sometimes called equipment variation because it represents the inherent precision of the measuring instrument itself. Think of it as the gage’s ability to produce consistent results under identical conditions.
For example, if a technician measures the diameter of a shaft five times without moving it, and obtains readings of 25.02mm, 25.03mm, 25.02mm, 25.04mm, and 25.02mm, the slight variations in these measurements represent repeatability error.
Reproducibility: Appraiser Variation
Reproducibility refers to the variation in the average measurements obtained when different operators use the same gage to measure the same characteristic on the same part. This is also known as appraiser variation because it captures the differences between operators. These differences might arise from variations in technique, interpretation of measurement scales, or even differences in how operators position parts for measurement.
Continuing with our shaft diameter example, if three different technicians each measure the same shaft and obtain average readings of 25.03mm, 25.07mm, and 25.02mm respectively, the variation between these averages represents reproducibility error.
Planning Your GR&R Study: Key Considerations
A successful GR&R study requires careful planning. Several factors need consideration before beginning the actual measurement process.
Selecting the Number of Operators
Typically, a GR&R study involves two to three operators, though three is generally recommended for a more robust analysis. These operators should be representative of those who regularly use the measurement system. Select operators with varying experience levels if possible, as this provides a more realistic assessment of reproducibility across your workforce.
Determining the Number of Parts
The number of parts selected for the study should represent the full range of process variation. Industry standards typically recommend using 10 parts, though studies can be conducted with as few as five parts when necessary. The key is that these parts should span the entire specification range or expected process variation.
Deciding on the Number of Trials
Each operator should measure each part multiple times, typically two to three trials. Three trials per part per operator is recommended for a comprehensive study, though two trials may be acceptable when time or resource constraints exist.
Conducting the GR&R Study: Step by Step Methodology
Step 1: Prepare the Parts and Equipment
Begin by selecting your parts and numbering them. The parts should be measured in random order to prevent operators from remembering previous measurements. Ensure that the measurement equipment is properly calibrated and that environmental conditions are stable throughout the study.
Step 2: Organize the Data Collection
Create a data collection sheet that includes spaces for recording the operator name, part number, trial number, and measurement value. Randomize the measurement sequence to minimize bias. It is critical that operators cannot see previous measurements or results from other operators during the study.
Step 3: Execute the Measurements
Have each operator measure all parts in random sequence for the first trial. Then repeat this process for subsequent trials, ensuring that operators remain blind to their previous measurements and the measurements of others.
Working Through a Practical Example: Manufacturing Case Study
Let us examine a real-world scenario to illustrate how a GR&R study works in practice. Imagine a precision manufacturing facility that produces metal brackets. The thickness of these brackets is a critical quality characteristic with a specification of 5.00mm plus or minus 0.10mm. The quality team needs to validate their digital caliper measurement system.
Study Parameters
The team decides to conduct a GR&R study with the following parameters: three operators (Andrea, Brian, and Carlos), ten parts representing the range of production variation, and three trials per part per operator. This results in 90 total measurements (3 operators × 10 parts × 3 trials).
Sample Data Set
Here is a portion of the collected data showing measurements for the first three parts:
Part 1 Measurements (in mm):
- Andrea: Trial 1: 4.95, Trial 2: 4.96, Trial 3: 4.95
- Brian: Trial 1: 4.97, Trial 2: 4.98, Trial 3: 4.97
- Carlos: Trial 1: 4.96, Trial 2: 4.95, Trial 3: 4.96
Part 2 Measurements (in mm):
- Andrea: Trial 1: 5.03, Trial 2: 5.04, Trial 3: 5.03
- Brian: Trial 1: 5.05, Trial 2: 5.06, Trial 3: 5.05
- Carlos: Trial 1: 5.04, Trial 2: 5.03, Trial 3: 5.04
Part 3 Measurements (in mm):
- Andrea: Trial 1: 5.01, Trial 2: 5.01, Trial 3: 5.02
- Brian: Trial 1: 5.03, Trial 2: 5.02, Trial 3: 5.03
- Carlos: Trial 1: 5.01, Trial 2: 5.02, Trial 3: 5.01
From this sample data, we can already observe some patterns. Andrea’s measurements appear consistently lower than Brian’s across all parts, suggesting potential reproducibility issues. However, each operator’s repeated measurements on the same part are relatively consistent, indicating acceptable repeatability.
Analyzing GR&R Results: Understanding the Numbers
Once data collection is complete, the next step involves statistical analysis. While specialized software packages can perform these calculations automatically, understanding the underlying concepts is crucial for proper interpretation.
Key Metrics and Acceptance Criteria
The primary output of a GR&R analysis is the percentage of total variation attributed to the measurement system. This is typically expressed as a percentage of tolerance or percentage of total variation.
Generally accepted criteria are:
- Under 10%: The measurement system is acceptable
- 10% to 30%: The measurement system may be acceptable depending on the application, cost of measurement device, cost of repair, or other factors
- Over 30%: The measurement system needs improvement
Range Method Calculations
The range method is one approach to calculating GR&R, though ANOVA (Analysis of Variance) methods are generally more comprehensive. Using our example data, we would calculate the average range for repeatability and reproducibility.
For repeatability, we calculate the range (maximum minus minimum) for each operator’s measurements on each part, then average these ranges across all parts and operators. For reproducibility, we look at the differences between operator averages.
Interpreting Component Contributions
Beyond the overall GR&R percentage, it is valuable to understand the relative contributions of repeatability and reproducibility. If repeatability is the dominant component, the issue lies primarily with the measurement equipment or method. If reproducibility dominates, the problem is more likely related to operator technique or training.
In our bracket thickness example, if analysis shows that Brian consistently measures 0.02mm higher than Andrea and Carlos, this points to a reproducibility issue. Perhaps Brian applies slightly more pressure when closing the caliper, or positions parts differently. This insight guides corrective action toward operator training rather than equipment replacement.
Common Pitfalls and How to Avoid Them
Insufficient Randomization
One of the most common mistakes in conducting GR&R studies is failing to properly randomize the measurement sequence. If operators can remember or predict previous measurements, the study loses validity. Use random number generators or pre-prepared random sequences to ensure true randomization.
Unrepresentative Part Selection
Selecting parts that do not represent the full range of process variation will produce misleading results. If all selected parts fall in the middle of the specification range, the study may indicate acceptable measurement system capability that does not hold true for parts near the specification limits. Always select parts spanning the expected variation range.
Environmental Variations
Changes in temperature, humidity, or other environmental factors during the study can introduce extraneous variation. Conduct studies under controlled conditions and complete them within a reasonable timeframe to minimize environmental impact.
Operator Awareness Bias
When operators know they are being evaluated, they may behave differently than during normal operations. While some awareness is unavoidable, emphasize that the study evaluates the measurement system, not individual performance. This helps operators maintain their typical measurement practices.
Taking Action on GR&R Results
When the Measurement System Fails
If your GR&R study reveals unacceptable measurement system variation, do not despair. This finding, while initially disappointing, prevents you from making decisions based on unreliable data. Consider these improvement approaches:
For High Repeatability Variation:
- Calibrate or maintain measurement equipment
- Replace worn or damaged gages
- Use measurement devices with finer resolution
- Improve fixturing to ensure consistent part positioning
- Control environmental factors affecting measurements
For High Reproducibility Variation:
- Develop clearer measurement procedures
- Provide comprehensive operator training
- Create visual aids or measurement standards
- Implement operator certification programs
- Consider automation to eliminate operator variation
Continuous Monitoring
A GR&R study is not a one-time event. Measurement systems should be periodically re-evaluated, especially after equipment maintenance, operator changes, or process modifications. Many organizations incorporate abbreviated GR&R studies into their regular quality system audits.
Advanced Considerations in Measurement Systems Analysis
Attribute Agreement Analysis
While GR&R studies apply to continuous measurement data, attribute agreement analysis serves a similar purpose for go/no-go or categorical data. If your measurement system involves visual inspection or binary classification, attribute agreement analysis evaluates operator consistency and accuracy against known standards.
Destructive Testing Scenarios
Traditional GR&R studies assume that parts can be measured multiple times without alteration. However, some measurements are destructive, such as tensile strength testing or contamination analysis. Modified approaches using nested designs or comparing measurements to reference standards address these special cases.
Automated Measurement Systems
As manufacturing increasingly incorporates automated measurement and inspection systems, GR&R principles still apply but may require adaptation. Automated systems eliminate operator reproducibility but may introduce other variation sources such as part loading variation or software algorithm inconsistencies.
The Business Impact of Measurement System Capability
Understanding measurement system capability extends beyond technical compliance. The business implications are significant and far-reaching.
Cost Implications
Poor measurement systems lead to accepting defective products or rejecting good products. The first scenario results in customer complaints, returns, and damaged reputation. The second creates unnecessary scrap and rework costs. A capable measurement system optimizes these costs by enabling accurate accept/reject decisions.
Process Improvement Accuracy
When implementing process improvements, you need reliable measurements to determine whether changes actually deliver benefits. If measurement system variation exceeds process improvement gains, you cannot confidently attribute changes to your improvement efforts. This undermines confidence in Lean Six Sigma initiatives and can lead to abandoning effective improvements or pursuing ineffective ones.
Supplier and Customer Relationships
Measurement system capability affects how you interact with suppliers and customers. If your measurement system differs significantly from theirs, disputes arise over product acceptance. Conducting joint GR&R studies or correlating measurement systems builds trust and prevents conflicts.
Integrating GR&R Studies into Your Quality Management System
For maximum benefit, GR&R studies should integrate seamlessly into your broader quality management system. This integration involves several elements.
Documentation and Procedures
Develop standard procedures for conducting GR&R studies that specify when studies are required, how to select operators and parts, data collection methods, analysis approaches, and acceptance criteria. Document all studies with sufficient detail to enable review and comparison over time.
Training Requirements
Ensure that personnel responsible for measurement systems understand GR&R principles and can conduct studies independently. This knowledge should extend beyond quality department personnel to include production supervisors, process engineers, and anyone making decisions based on measurement data.
Software Tools
While manual calculations are possible, specialized software significantly streamlines GR&R analysis. Statistical software packages like Minitab, JMP, or even properly configured spreadsheet tools automate calculations, generate control charts, and produce standardized reports. Invest in appropriate tools and train personnel in their use.
Real-World Success Stories
Consider a pharmaceutical packaging facility that conducted a GR&R study on their tablet weight measurement system. The study revealed 35% measurement system variation, primarily due to reproducibility issues. Investigation showed that different operators used different techniques for zeroing the balance and positioning tablets. After implementing standardized procedures and providing hands-on training, a follow-up study showed measurement system variation reduced to 8%. This improvement enabled the facility to tighten process controls, reduce raw material waste, and improve product consistency.
In another example, an automotive components manufacturer found that their coordinate measuring machine (CMM) showed acceptable repeatability but unacceptable reproducibility even though measurements were automated. The issue traced to inconsistent part fixturing methods among operators loading parts into the CMM. Redesigning the fixture to include positive locating features and providing operator training reduced reproducibility variation by 70%, enabling the manufacturer to detect smaller process shifts and prevent defects.
Moving Forward with Confidence
Conducting GR&R studies represents a fundamental discipline in the Measure phase of Lean Six Sigma. While the process may initially seem complex, the underlying principles are straightforward: ensure your measurement system can reliably detect true differences in








