Panelist Outliers Analysis

Panelist Outliers Analysis

Purpose

To detect panellists that are possible outliers, highlighting those with extremely high or low values.

Data Format

  1. See the profiling dataset.

Options

  1. Type of Panel Mean: Adjusted or Arithmetic. Arithmetic means are the well known means, calculated by summing the observations then dividing by the number of observations. Adjusted means, commonly called Least-Squares means (LSmeans) or Estimated Marginal Means (EMMs), use a regression model to calculate the means adjusting for the balance of the data.
  2. Number of standard deviations: Assessors whose mean falls outside plus or minus this number of standard deviations from the panel mean are highlighted as possible outliers.
  3. Y-axis Scale: Automatic or Manual. Automatic chooses the plotting range from the range of each attribute. Manual uses the plotting range you specified.
  4. Y-axis min value: The smallest value that will appear on all plots when specifying the plotting range manually.
  5. Y-axis max value: The largest value that will appear on all plots when specifying the plotting range manually.
  6. Anonymise Assessors? Choose to replace the assessor names or not. There are options for randomly generated names or names from the assessor metadata.
  7. Anonymise Products? Choose to replace the product names or not. There are options for randomly generated names or names from the product metadata.
  8. Anonymise Attributes? Choose to replace the attribute names or not. There are options for randomly generated names or names from the attribute metadata.

Results and Interpretation

  1. For each product and attribute a bar plot of panellist means is displayed. These plots overlay the panel mean as a black line and the chosen number of standard deviations from this mean as two dotted lines. Any panellist outside this range is highlighted as a possible outlier.
  2. If “Type of Panel Mean” was set to “Adjusted” then the panel means will be adjusted means from models of the form: Attribute = Product + Assessor + Replicate + Residuals or Attribute = Product + Assessor + Residuals if replications are not present.
  3. Outliers should not necessarily be removed.
  4. ​For a large number of draws from a normal distribution it is expected that around 5% of observations would be outside of plus or minus 1.96 standard deviations from the mean.

Technical Information

  1. R packages: This analysis uses SensoMineR to calculate the adjusted means.

    • Related Articles

    • Panelist Feedback Analysis

      Purpose Plots summarising the scoring range for each judge on each of the samples and attributes. Data Format See the profiling dataset. The attributes should be of scale or interval type. Options Type of Panel Mean: Adjusted or Arithmetic. ...
    • Penalty Analysis

      Purpose To provide a penalty analysis of a consumer data set, that is to investigate how liking or acceptability of product decreases when product attributes are not at the optimal intensity. Data Format Consumer.xlsx Note: for EyeOpenR to read your ...
    • Panelist Performance Analysis

      Purpose This analysis looks at the overall panel performance in terms of Discrimination, Agreement and Repeatability or Reproducibility, and then the performance of each individual in the panel in these terms. Data Format See the profiling dataset. ...
    • Panelist Strip Plot Analysis

      Purpose Plots for each attribute the panellists means across the products, for visual comparison and interpretation. Data Format See the profiling dataset. Background This analysis calculates the mean response from each assessor on each attribute ...
    • Napping Analysis

      Purpose To provide an analysis of data collected using the napping methodology. Data Format Napping.xlsx For EyeOpenR to read your data the first five columns must include the following in the specified order: Assessor, Product, Session, Replica and ...