|
Rate This Blog
![]() ![]() ![]() ![]() ![]() 1 rating(s)
Categories
• Exploratory Data Analysis
• Designed Experiments • Interrelationship Digraphs • Study Cause and Effect • Analyze Common Cause Variation • Process Improvement • Process Capability Indices • Rosen Yield Example • Hoerl-Snee Strategy • Is–Is Not Analysis • Cause and Effect Diagram • Pareto Chart • Flowchart • Special Cause • Basic Concepts • History • Six Sigma
Archives
Latest Entries
Loading...
Links
Loading...
|
Statistical Thinking to Improve Quality
This blog examines the use of data analyses and statistical tools in a framework of statistical thinking to improve quality. The following principles form the basis for statistical thinking:
* All work occurs in a system of interconnected processes, * Variation exists in all processes, and * Understanding and reducing variation are keys to success. Statistical thinking significantly improves the effectiveness of data analyses and statistical tools.
Exploratory Data Analysis
Saturday March 28, 2009
Posted by: Gordon Clark at 11:29AM CST on March 28, 2009
The primary purpose Exploratory Data Analysis (EDA) is to identify the key variables that affect the quality measures. Two principles, mentioned by De Mast and Trip (2007), are helpful in identifying these variables. They are:
Chang and Lu (1995) provide an example illustrating these principles. A steel sheet metal manufacturer had customers complaining about uneven thickness. The specification was 4.5 ± .5 mm. The production manager had data collected from 120 sheets giving the thickness measurements on the left, middle and right sides of the sheets. Employees selected five sheets at shift times of 0900, 1100, 1400 and 1700 over a period of five days. The histogram appearing below shows 13% of the sheet thickness measurements below the lower specification limit of 4.0 mm. Also, the mean is lower than 4.5 mm.
After discussions with shop-floor personnel, they stratified by position on the sheet and by time. Histograms for the two stratifications appear below. The stratification by position did not show distributions much different than the aggregate distribution. However, the stratification by time showed higher frequencies of thin measurements at 1100 and 1700. Twenty four of the 26 values in the histograms below 4 mm, 24 of them were at 1100 and 1700.
Discussions with shop-floor personnel identified mold wear out, build up of chips in a work holding device, and operator fatigue as possible causes. The corrective action was to take a 10 minute break at 1030 and 1630 each day and have maintenance performed during the breaks. The corrective action produced a substantial reduction in thin sheets. References
Sunday February 15, 2009
Posted by: Gordon Clark at 7:07PM CST on February 15, 2009
De Mast and Trip (2007) specify that the purpose of Exploratory Data Analysis (EDA) is to identify the dependent (Y) and independent (X) variables that may help understand or solve a quality problem. However, they point out that EDA can only identify variables that vary in the collected data set. If the EDA can not identify key variables affecting the system performance, available options include:
Option 1 Option 2 References
Tuesday February 10, 2009
Posted by: Gordon Clark at 1:29PM CST on February 10, 2009
De Mast and Trip (2007) list the following three steps in performing Exploratory Data Analysis.
The resin output variation example, 3/25/2008 posting illustrates these steps. The Ricoh team constructed a histogram of the output quantity (Display the data), noticed the bimodal nature of the output quantity (Identify salient features), and this bimodal distribution suggested that the output distributions from lines A and B were different (Interpret salient features). Histograms of line A and line B output confirmed this conclusion. Another salient feature of the histograms was the excessive variation in output quantity. This feature motivated establishment of lower and upper limits and a target value. References
Monday December 22, 2008
Posted by: Gordon Clark at 9:23AM CST on December 22, 2008
The postings on 3/21/2008, 3/25/2008, 3/28/2008 and 4/1/2008 present the Resin Output Variation Example to illustrate Statistical Thinking and the Hoerl-Snee Process Improvement Strategy. This example also makes extensive use of Exploratory Data Analysis. Ricoh’s Numazu plant made raw material used as ingredients for copy machine toner. The company had a team which monitored the process in order to achieve continual quality improvement.
The exploratory data analysis included examination of four different graphical displays. They are a run chart, histograms, a scatter plot, and several control charts. De Mast and Trip (2007) points out that Good (1983); Hoaglin, Mosteller et al (2000); and Bisgaard (1996) note that graphical presentations are preferred in Exploratory Data Analysis. They are more effective is showing an individual what he did not expect to see. References
Monday December 15, 2008
Posted by: Gordon Clark at 8:18PM CST on December 15, 2008
Bisgaard (2006) gives us an example where Exploratory Data Analysis leads us to narrow the scope of the quality improvement investigation. The example involves the production of small outboard motors by an assembly line. Monthly quality reports showed an unacceptable number of defective motors that caused costly rework. The Vice President of Manufacturing formed a team for the purpose of reducing the number of defects. The first task performed by the team was to flowchart the assembly line. This is consistent with the first step in the Hoerl-Snee Process Improvement Strategy (See the posting on 4/8/2008). After the base motors were painted and dried, the motors traveled on a ten station line for the purpose of installing accessory components. These accessory components included the carburetor, brackets, the propeller, and electrical systems. Next, the team examined tables specifying defects and their type. The team found the tables difficult to analyze. To assist the analysis the team constructed Pareto charts specifying the defects by type of defect. For example, missing fasteners, loose fasteners, and missing operations. These Pareto Charts did not suggest principal causes. The team decided to categorize the defects by the station on the line where the defect originated. For example, a loose fastener on the carburetor, the defect originated at station 3. An incorrectly mounted spark plug wire would have occurred at station 9. The Pareto Chart categorizing defects by station appears below.
The team focused on station 9. The workers on the line revealed that design of the motors had changed leaving station 9 with more work than the other stations. The team redesigned the assembly line reducing the work load at station 9. A number of other changes were made such as improved lighting. The result was a dramatic reduction in the occurrence of defects. De Mast and Trip (2007) claim that this example illustrates the use of exploratory data analysis change the focus of the problem from “too many defects” to “too many defects from station 9”. They state that the example illustrates the use of exploratory data analysis to identify a KPOV. My viewpoint is that this example illustrates the identification of a KPIV. That is, the assembly line station. Admittedly, the next phase of the improvement effort was clearly more focused on station 9. We must remember that quality improvement is often an iterative process. That is, successive Plan-Do-Check-Act (PDCA) cycles. Identifying a KPIV on a cycle may result in that KPIV being a KPOV on the next cycle. References
Wednesday November 26, 2008
Posted by: Gordon Clark at 11:47AM CST on November 26, 2008
The purpose of Exploratory Data Analysis (EDA) is to generate hypotheses or clues that guide us in improving quality or process performance. Breyfogle (2003, pgs. 10-11) views Six Sigma as a murder mystery where we use a structured approach to uncover clues that lead us to improve process outputs. These clues are Key Process Input Variables (KPIVS) and process improvement strategies. As an example, he considers the process of traveling to work where the Key Process Output Variable (KPOV) is the arrival time. Examples of KPIVs are the setting of our alarm clock and our departure time. An alternative process improvement strategy might be a different travel route that is less subject to variation during congested time periods. Then, the route selected is another KPIV, and the travel time along that route is a function of both the route and departure time. Exploratory Data Analysis helps us identify these KPIVs. De Mast and Trip (2007) state that the purpose of EDA from a quality improvement project viewpoint is to identify the dependent (Y) and independent (X) variables that may help understand or solve the quality problem. The dependent Y variables are KPOVs, and the independent X variables are KPIVs. Leitnaker (2000) gives an example of EDA to identify KPIVs. The example is a molding operation where:
A team studied a molding operation supplying plastic switches to industrial customers for use in assembled control pads. The operation has eight machines, each machine has two molds, and each mold has four cavities. To investigate the process capability, the team took a sample of size 5 from the output of one machine every 4 hours. The following control chart displays the results for a critical dimension. The process is in control, and the range chart supported this conclusion. But the variation is large. Next the team investigated the effect of the cavities and molds on the measured dimension. To do this, they sampled one part from each of the four cavities of the two molds on one machine. Breaking down the data by cavity and mold is an example of stratification. Control charts for the individual cavities and molds showed that all cavities and molds appear to be in control. However, mold 2 cavities have larger averages than mold 1 cavities, and the averages for the cavities increases with cavity number. The following figure clearly shows this pattern.
The figure leads us to identify mold and cavities numbers as KPIVs. The exploratory data analysis produced a clue which generated a search for the reasons that molds and cavities produced different average dimensions. The team can proceed to reduce the variability in the measured dimension by reducing the differences in averages for the molds and cavities.
References
Wednesday November 19, 2008
Posted by: Gordon Clark at 3:00PM CST on November 19, 2008
This posting describes the difference between Exploratory Data Analysis (EDA) and Confirmatory Data Analysis (CDA). Tukey (1977) distinguished between EDA and CDA. Confirmatory Data Analysis tests hypotheses and produces estimates with a specified precision. Regression analysis, Analysis of Variance, and Hypothesis Tests are examples of Confirmatory Data Analysis. Confirmatory Data Analysis requires hypotheses or assumptions to consider and evaluate. Exploratory Data Analysis makes few assumptions, and its purpose is to suggest hypotheses and assumptions. Consider the OEM manufacturer described in the posting on 1/30/2008. The company was experiencing customer complaints. A team wanted to identify and remove causes of these complaints. They asked customers for usage data so the team could calculate defect rates. This started an Exploratory Data Analysis. The team plotted a control chart, and these charts identified a high defect rate in October, 1991. The investigation established that a supplier used the wrong raw material. Discussions with the supplier and team members motivated further analysis of raw material, and its composition. This decision to analyze raw material completed the Exploratory Data Analysis. The Exploratory Data Analysis used both data analysis and process knowledge possessed by team members. The supplier and company conducted a series of designed experiments which identified an improved raw material composition. Using this composition, the defect rate improved from .023% to .004%. The experimental design and its analysis was Confirmatory Data Analysis. Note that the experimental design required a hypothesis generated by the Exploratory Data Analysis. Tukey states that EDA is detective work. He uses the criminal justice process as an analogue to illustrate the roles of EDA and CDA. A detective investigating a crime needs both tools and understanding. The detectives and other investigative units search for and produce evidence. The juries and judges evaluate the evidence’s strength. Exploratory Data Analysis uncovers statements or hypotheses for Confirmatory Data Analysis to consider. Experimental design and regression modeling are more effective if Exploratory Data Analysis uncovers precise statements or hypotheses. Admittedly, one can conduct experiments searching for hypotheses; however, our viewpoint is that preliminary Exploratory Data Analyses may reduce the costs of these experiments. Exploratory and Confirmatory Data Analyses can be thought of as part of statistical thinking. De Mast and Trip (2007) present principles for more effective EDA in quality improvement projects. We will examine results from their paper in future postings. Their paper won the Nelson award for the paper having the greatest immediate impact for practitioners published during 2007 in the Journal of Quality Technology. References
|
|