Partial Least Squares Structural Equation Modeling (PLS-SEM)

Introduction

Partial Least Squares Structural Equation Modeling (PLS-SEM) is a statistical approach used to estimate complex cause-effect relationship models involving latent constructs and their indicators. It’s widely used in fields like marketing, management, psychology, and social sciences, particularly when traditional covariance-based SEM (CB-SEM) assumptions are too restrictive or the data do not meet normality or sample size requirements.

Key Concepts in PLS-SEM

  1. Latent Variables (LVs): Latent variables are unobserved variables that are inferred from multiple observed indicators. In PLS-SEM, these are often theoretical constructs like “Customer Satisfaction” or “Brand Loyalty.” LVs are measured indirectly via observed variables, called indicators.
  2. Indicators: These are the observed variables or items (such as survey questions or measurable attributes) used to measure the latent variables. For instance, “Customer Satisfaction” could be measured using a set of questions where each question represents an indicator.
  3. Measurement Model (Outer Model): This part of the model defines the relationship between the latent variables and their respective indicators. There are two types of measurement models:
  1. Reflective Measurement Model: Indicators are caused by the latent variable (e.g., “Customer Satisfaction” causes the survey responses).
  2. Formative Measurement Model: Indicators form or define the latent variable (e.g., “Income” and “Education” collectively form “Socioeconomic Status”).
  3. Structural Model (Inner Model): This represents the relationships between latent variables, defining the hypothesized paths and cause-effect relationships in the model. For example, “Customer Satisfaction” might influence “Customer Loyalty.”
  4. Path Coefficients: These are the estimates of the relationships between latent variables in the structural model. They indicate the strength and direction of the relationships (similar to regression coefficients).
  5. Weights and Loadings:
  1. Weights are used to calculate latent variable scores from their indicators (especially in formative models).
  2. Loadings represent the strength of the relationship between indicators and latent variables in reflective models.
  3. R-squared: This is the measure of explained variance for the endogenous latent variables (dependent variables) in the model. It indicates how much variance in the latent variable is explained by the independent variables.

When to Use PLS-SEM

PLS-SEM is used when:

  • The model is complex with many latent variables and indicators.
  • The data do not meet the assumptions of normality or large sample size.
  • The goal is prediction or theory development (as opposed to theory confirmation).
  • Formative constructs (where indicators form a latent variable) are present.
  • The model includes relationships that might not converge in CB-SEM.

PLS-SEM is particularly useful when dealing with smaller samples, non-normal data, and exploratory research scenarios.

Advantages of PLS-SEM

  1. Flexibility: PLS-SEM can handle both reflective and formative constructs, and it doesn’t require strict distributional assumptions (i.e., no normality assumption).
  2. Small Sample Size: PLS-SEM performs well with small to medium sample sizes, unlike CB-SEM, which requires larger samples.
  3. Complex Models: It is suited for models with a large number of constructs and relationships (even when these models may not converge using CB-SEM).
  4. Focus on Prediction: PLS-SEM is more concerned with maximizing the explained variance (prediction) of dependent variables rather than testing the fit of the entire model.
  5. Bootstrapping for Significance: PLS-SEM uses bootstrapping techniques to generate standard errors and confidence intervals, allowing for hypothesis testing without relying on traditional parametric assumptions.

Steps in PLS-SEM

  1. Model Specification: Specify the measurement model (linking latent variables and indicators) and the structural model (linking latent variables to each other).
  2. Model Estimation: PLS-SEM iteratively estimates relationships by maximizing the variance explained in the dependent latent variables. It first estimates the latent variable scores, then calculates the path coefficients.
  3. Model Evaluation: Evaluate the measurement model (using indicator reliability, internal consistency, and convergent validity) and the structural model (using path coefficients, R-squared, and predictive relevance).
  4. Bootstrapping: This is used to estimate the precision of the PLS estimates and to assess the significance of the path coefficients (p-values) in the structural model.

Model Evaluation Criteria

  1. Measurement Model:
    1. Reliability: Evaluated using indicator loadings and composite reliability.
    1. Validity: Checked with average variance extracted (AVE), discriminant validity (HTMT or Fornell-Larcker criterion).
    1. Collinearity: VIF (Variance Inflation Factor) to ensure no multicollinearity in formative constructs.
  2. Structural Model:
  1. Path Coefficients: The strength and significance of relationships between latent variables.
  2. R-squared (R²): Indicates the variance explained by independent latent variables for each endogenous latent variable.
  3. Predictive Relevance (Q²): Assessed using blindfolding procedures to determine how well the model can predict.
  4. Goodness of Fit: Although traditional SEM focuses on model fit indices (like chi-square, RMSEA, etc.), PLS-SEM focuses more on the variance explained (R²) and predictive power rather than overall model fit.

Common Software for PLS-SEM

  1. SmartPLS: A popular commercial software with a graphical user interface.
  2. ADANCO: Another commercial software specifically designed for PLS-SEM.
  3. R: Free and flexible tools like plspmsemPLS, and seminr.
  4. WarpPLS: A commercial tool that offers unique algorithms for PLS-SEM.

PLS-SEM vs. Covariance-Based SEM (CB-SEM)

FeaturePLS-SEMCB-SEM (e.g., LISREL, AMOS)
GoalPrediction, theory developmentTheory confirmation
Sample SizeSmall to mediumLarge
AssumptionsNon-parametric, no normality requiredParametric, requires normality
Model FitFocuses on R² and predictive powerFit indices (e.g., RMSEA, CFI, etc.)
Handling ComplexityHandles complex models easilyMay struggle with large, complex models

Applications of PLS-SEM

  • Marketing: Used for customer satisfaction and loyalty models.
  • Social Sciences: To explore relationships between behavioral constructs.
  • Business: For evaluating business performance and strategy.
  • Information Systems: For studying technology acceptance models (e.g., TAM).

Conclusion

PLS-SEM is a powerful and flexible method used to estimate complex models with latent variables and their indicators. It is especially suited for exploratory research and predictive modeling, offering an alternative to traditional covariance-based SEM when data and model conditions are not ideal.

1 thought on “Partial Least Squares Structural Equation Modeling (PLS-SEM)”

Leave a Comment

Your email address will not be published. Required fields are marked *