U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • Int J Med Educ

Factor Analysis: a means for theory and instrument development in support of construct validity

Mohsen tavakol.

1 School of Medicine, Medical Education Centre, the University of Nottingham, UK

Angela Wetzel

2 School of Education, Virginia Commonwealth University, USA

Introduction

Factor analysis (FA) allows us to simplify a set of complex variables or items using statistical procedures to explore the underlying dimensions that explain the relationships between the multiple variables/items. For example, to explore inter-item relationships for a 20-item instrument, a basic analysis would produce 400 correlations; it is not an easy task to keep these matrices in our heads. FA simplifies a matrix of correlations so a researcher can more easily understand the relationship between items in a scale and the underlying factors that the items may have in common. FA is a commonly applied and widely promoted procedure for developing and refining clinical assessment instruments to produce evidence for the construct validity of the measure.

In the literature, the strong association between construct validity and FA is well documented, as the method provides evidence based on test content and evidence based on internal structure, key components of construct validity. 1 From FA, evidence based on internal structure and evidence based on test content can be examined to tell us what the instrument really measures - the intended abstract concept (i.e., a factor/dimension/construct) or something else. Establishing construct validity for the interpretations from a measure is critical to high quality assessment and subsequent research using outcomes data from the measure. Therefore, FA should be a researcher’s best friend during the development and validation of a new measure or when adapting a measure to a new population. FA is also a useful companion when critiquing existing measures for application in research or assessment practice. However, despite the popularity of FA, when applied in medical education instrument development, factor analytic procedures do not always match best practice. 2 This editorial article is designed to help medical educators use FA appropriately.

The Applications of FA

The applications of FA depend on the purpose of the research. Generally speaking, there are two most important types of FA: Explorator Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA).

Exploratory Factor Analysis

Exploratory Factor Analysis (EFA) is widely used in medical education research in the early phases of instrument development, specifically for measures of latent variables that cannot be assessed directly. Typically, in EFA, the researcher, through a review of the literature and engagement with content experts, selects as many instrument items as necessary to fully represent the latent construct (e.g., professionalism). Then, using EFA, the researcher explores the results of factor loadings, along with other criteria (e.g., previous theory, Minimum average partial, 3 Parallel analysis, 4 conceptual meaningfulness, etc.) to refine the measure. Suppose an instrument consisting of 30 questions yields two factors - Factor 1 and Factor 2. A good definition of a factor as a theoretical construct is to look at its factor loadings. 5 The factor loading is the correlation between the item and the factor; a factor loading of more than 0.30 usually indicates a moderate correlation between the item and the factor. Most statistical software, such as SAS, SPSS and R, provide factor loadings. Upon review of the items loading on each factor, the researcher identifies two distinct constructs, with items loading on Factor 1 all related to professionalism, and items loading on Factor 2 related, instead, to leadership. Here, EFA helps the researcher build evidence based on internal structure by retaining only those items with appropriately high loadings on Factor 1 for professionalism, the construct of interest.

It is important to note that, often, Principal Component Analysis (PCA) is applied and described, in error, as exploratory factor analysis. 2 , 6 PCA is appropriate if the study primarily aims to reduce the number of original items in the intended instrument to a smaller set. 7 However, if the instrument is being designed to measure a latent construct, EFA, using Maximum Likelihood (ML) or Principal Axis Factoring (PAF), is the appropriate method. 7   These exploratory procedures statistically analyze the interrelationships between the instrument items and domains to uncover the unknown underlying factorial structure (dimensions) of the construct of interest. PCA, by design, seeks to explain total variance (i.e., specific and error variance) in the correlation matrix. The sum of the squared loadings on a factor matrix for a particular item indicates the proportion of variance for that given item that is explained by the factors. This is called the communality. The higher the communality value, the more the extracted factors explain the variance of the item. Further, the mean score for the sum of the squared factor loadings specifies the proportion of variance explained by each factor. For example, assume four items of an instrument have produced Factor 1, factor loadings of Factor 1 are 0.86, 0.75, 0.66 and 0.58, respectively. If you square the factor loading of items, you will get the percentage of the variance of that item which is explained by Factor 1. In this example, the first principal component (PC) for item1, item2, item3 and item4 is 74%, 56%, 43% and 33%, respectively. If you sum the squared factor loadings of Factor 1, you will get the eigenvalue, which is 2.1 and dividing the eigenvalue by four (2.1/4= 0.52) we will get the proportion of variance accounted for Factor 1, which is 52 %. Since PCA does not separate specific variance and error variance, it often inflates factor loadings and limits the potential for the factor structure to be generalized and applied with other samples in subsequent study. On the other hand, Maximum likelihood and Principal Axis Factoring extraction methods separate common and unique variance (specific and error variance), which overcomes the issue attached to PCA.  Thus, the proportion of variance explained by an extracted factor more precisely reflects the extent to which the latent construct is measured by the instrument items. This focus on shared variance among items explained by the underlying factor, particularly during instrument development, helps the researcher understand the extent to which a measure captures the intended construct. It is useful to mention that in PAF, the initial communalities are not set at 1s, but they are chosen based on the squared multiple correlation coefficient. Indeed, if you run a multiple regression to predict say  item1 (dependent variable)  from other items (independent variables) and then look at the R-squared (R2), you will see R2 is equal to the communalities of item1 derived from PAF.

Confirmatory Factor Analysis

When prior EFA studies are available for your intended instrument, Confirmatory Factor Analysis extends on those findings, allowing you to confirm or disconfirm the underlying factor structures, or dimensions, extracted in prior research. CFA is a theory or model-driven approach that tests how well the data “fit” to the proposed model or theory. CFA thus departs from EFA in that researchers must first identify a factor model before analysing the data. More fundamentally, CFA is a means for statistically testing the internal structure of instruments and relies on the maximum likelihood estimation (MLE) and a different set of standards for assessing the suitability of the construct of interest. 7 , 8

Factor analysts usually use the path diagram to show the theoretical and hypothesized relationships between items and the factors to create a hypothetical model to test using the ML method. In the path diagram, circles or ovals represent factors. A rectangle represents the instrument items. Lines (→ or ↔) represent relationships between items. No line, no relationship. A single-headed arrow shows the causal relationship (the variable that the arrowhead refers to is the dependent variable), and a double-headed shows a covariance between variables or factors.

If CFA indicates the primary factors, or first-order factors, produced by the prior PAF are correlated, then the second-order factors need to be modelled and estimated to get a greater understanding of the data. It should be noted if the prior EFA applied an orthogonal rotation to the factor solution, the factors produced would be uncorrelated. Hence, the analysis of the second-order factors is not possible. Generally, in social science research, most constructs assume inter-related factors, and therefore should apply an oblique rotation. The justification for analyzing the second-order factors is that when the correlations between the primary factors exist, CFA can then statistically model a broad picture of factors not captured by the primary factors (i.e., the first-order factors). 9   The analysis of the first-order factors is like surveying mountains with a zoom lens binoculars, while the analysis of the second-order factors uses a wide-angle lens. 10 Goodness of- fit- tests need to be conducted when evaluating the hypothetical model tested by CFA. The question is: does the new data fit the hypothetical model? However, the statistical models of the goodness of- fit- tests are complex, and extend beyond the scope of this editorial paper; thus,we strongly encourage the readers consult with factors analysts to receive resources and possible advise.

Conclusions

Factor analysis methods can be incredibly useful tools for researchers attempting to establish high quality measures of those constructs not directly observed and captured by observation. Specifically, the factor solution derived from an Exploratory Factor Analysis provides a snapshot of the statistical relationships of the key behaviors, attitudes, and dispositions of the construct of interest. This snapshot provides critical evidence for the validity of the measure based on the fit of the test content to the theoretical framework that underlies the construct. Further, the relationships between factors, which can be explored with EFA and confirmed with CFA, help researchers interpret the theoretical connections between underlying dimensions of a construct and even extending to relationships across constructs in a broader theoretical model. However, studies that do not apply recommended extraction, rotation, and interpretation in FA risk drawing faulty conclusions about the validity of a measure. As measures are picked up by other researchers and applied in experimental designs, or by practitioners as assessments in practice, application of measures with subpar evidence for validity produces a ripple effect across the field. It is incumbent on researchers to ensure best practices are applied or engage with methodologists to support and consult where there are gaps in knowledge of methods. Further, it remains important to also critically evaluate measures selected for research and practice, focusing on those that demonstrate alignment with best practice for FA and instrument development. 7 , 11

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Root out friction in every digital experience, super-charge conversion rates, and optimize digital self-service

Uncover insights from any interaction, deliver AI-powered agent coaching, and reduce cost to serve

Increase revenue and loyalty with real-time insights and recommendations delivered to teams on the ground

Know how your people feel and empower managers to improve employee engagement, productivity, and retention

Take action in the moments that matter most along the employee journey and drive bottom line growth

Whatever they’re are saying, wherever they’re saying it, know exactly what’s going on with your people

Get faster, richer insights with qual and quant tools that make powerful market research available to everyone

Run concept tests, pricing studies, prototyping + more with fast, powerful studies designed by UX research experts

Track your brand performance 24/7 and act quickly to respond to opportunities and challenges in your market

Explore the platform powering Experience Management

  • Free Account
  • For Digital
  • For Customer Care
  • For Human Resources
  • For Researchers
  • Financial Services
  • All Industries

Popular Use Cases

  • Customer Experience
  • Employee Experience
  • Employee Exit Interviews
  • Net Promoter Score
  • Voice of Customer
  • Customer Success Hub
  • Product Documentation
  • Training & Certification
  • XM Institute
  • Popular Resources
  • Customer Stories
  • Market Research
  • Artificial Intelligence
  • Partnerships
  • Marketplace

The annual gathering of the experience leaders at the world’s iconic brands building breakthrough business results, live in Salt Lake City.

  • English/AU & NZ
  • Español/Europa
  • Español/América Latina
  • Português Brasileiro
  • REQUEST DEMO
  • Experience Management
  • Survey Data Analysis & Reporting
  • Factor Analysis

Try Qualtrics for free

Factor analysis and how it simplifies research findings.

17 min read There are many forms of data analysis used to report on and study survey data. Factor analysis is best when used to simplify complex data sets with many variables.

What is factor analysis?

Factor analysis is the practice of condensing many variables into just a few, so that your research data is easier to work with.

For example, a retail business trying to understand customer buying behaviours might consider variables such as ‘did the product meet your expectations?’, ‘how would you rate the value for money?’ and ‘did you find the product easily?’. Factor analysis can help condense these variables into a single factor, such as ‘customer purchase satisfaction’.

customer purchase satisfaction tree

The theory is that there are deeper factors driving the underlying concepts in your data, and that you can uncover and work with them instead of dealing with the lower-level variables that cascade from them. Know that these deeper concepts aren’t necessarily immediately obvious – they might represent traits or tendencies that are hard to measure, such as extraversion or IQ.

Factor analysis is also sometimes called “dimension reduction”: you can reduce the “dimensions” of your data into one or more “super-variables,” also known as unobserved variables or latent variables. This process involves creating a factor model and often yields a factor matrix that organizes the relationship between observed variables and the factors they’re associated with.

As with any kind of process that simplifies complexity, there is a trade-off between the accuracy of the data and how easy it is to work with. With factor analysis, the best solution is the one that yields a simplification that represents the true nature of your data, with minimum loss of precision. This often means finding a balance between achieving the variance explained by the model and using fewer factors to keep the model simple.

Factor analysis isn’t a single technique, but a family of statistical methods that can be used to identify the latent factors driving observable variables. Factor analysis is commonly used in market research , as well as other disciplines like technology, medicine, sociology, field biology, education, psychology and many more.

What is a factor?

In the context of factor analysis, a factor is a hidden or underlying variable that we infer from a set of directly measurable variables.

Take ‘customer purchase satisfaction’ as an example again. This isn’t a variable you can directly ask a customer to rate, but it can be determined from the responses to correlated questions like ‘did the product meet your expectations?’, ‘how would you rate the value for money?’ and ‘did you find the product easily?’.

While not directly observable, factors are essential for providing a clearer, more streamlined understanding of data. They enable us to capture the essence of our data’s complexity, making it simpler and more manageable to work with, and without losing lots of information.

Free eBook: 2024 global market research trends report

Key concepts in factor analysis

These concepts are the foundational pillars that guide the application and interpretation of factor analysis.

Central to factor analysis, variance measures how much numerical values differ from the average. In factor analysis, you’re essentially trying to understand how underlying factors influence this variance among your variables. Some factors will explain more variance than others, meaning they more accurately represent the variables they consist of.

The eigenvalue expresses the amount of variance a factor explains. If a factor solution (unobserved or latent variables) has an eigenvalue of 1 or above, it indicates that a factor explains more variance than a single observed variable, which can be useful in reducing the number of variables in your analysis. Factors with eigenvalues less than 1 account for less variability than a single variable and are generally not included in the analysis.

Factor score

A factor score is a numeric representation that tells us how strongly each variable from the original data is related to a specific factor. Also called the component score, it can help determine which variables are most influenced by each factor and are most important for each underlying concept.

Factor loading

Factor loading is the correlation coefficient for the variable and factor. Like the factor score, factor loadings give an indication of how much of the variance in an observed variable can be explained by the factor. High factor loadings (close to 1 or -1) mean the factor strongly influences the variable.

When to use factor analysis

Factor analysis is a powerful tool when you want to simplify complex data, find hidden patterns, and set the stage for deeper, more focused analysis.

It’s typically used when you’re dealing with a large number of interconnected variables, and you want to understand the underlying structure or patterns within this data. It’s particularly useful when you suspect that these observed variables could be influenced by some hidden factors.

For example, consider a business that has collected extensive customer feedback through surveys. The survey covers a wide range of questions about product quality, pricing, customer service and more. This huge volume of data can be overwhelming, and this is where factor analysis comes in. It can help condense these numerous variables into a few meaningful factors, such as ‘product satisfaction’, ‘customer service experience’ and ‘value for money’.

Factor analysis doesn’t operate in isolation – it’s often used as a stepping stone for further analysis. For example, once you’ve identified key factors through factor analysis, you might then proceed to a cluster analysis – a method that groups your customers based on their responses to these factors. The result is a clearer understanding of different customer segments, which can then guide targeted marketing and product development strategies.

By combining factor analysis with other methodologies, you can not only make sense of your data but also gain valuable insights to drive your business decisions.

Factor analysis assumptions

Factor analysis relies on several assumptions for accurate results. Violating these assumptions may lead to factors that are hard to interpret or misleading.

Linear relationships between variables

This ensures that changes in the values of your variables are consistent.

Sufficient variables for each factor

Because if only a few variables represent a factor, it might not be identified accurately.

Adequate sample size

The larger the ratio of cases (respondents, for instance) to variables, the more reliable the analysis.

No perfect multicollinearity and singularity

No variable is a perfect linear combination of other variables, and no variable is a duplicate of another.

Relevance of the variables

There should be some correlation between variables to make a factor analysis feasible.

assumptions for factor analysis

Types of factor analysis

There are two main factor analysis methods: exploratory and confirmatory. Here’s how they are used to add value to your research process.

Confirmatory factor analysis

In this type of analysis, the researcher starts out with a hypothesis about their data that they are looking to prove or disprove. Factor analysis will confirm – or not – where the latent variables are and how much variance they account for.

Principal component analysis (PCA) is a popular form of confirmatory factor analysis. Using this method, the researcher will run the analysis to obtain multiple possible solutions that split their data among a number of factors. Items that load onto a single particular factor are more strongly related to one another and can be grouped together by the researcher using their conceptual knowledge or pre-existing research.

Using PCA will generate a range of solutions with different numbers of factors, from simplified 1-factor solutions to higher levels of complexity. However, the fewer number of factors employed, the less variance will be accounted for in the solution.

Exploratory factor analysis

As the name suggests, exploratory factor analysis is undertaken without a hypothesis in mind. It’s an investigatory process that helps researchers understand whether associations exist between the initial variables, and if so, where they lie and how they are grouped.

How to perform factor analysis: A step-by-step guide

Performing a factor analysis involves a series of steps, often facilitated by statistical software packages like SPSS, Stata and the R programming language . Here’s a simplified overview of the process.

how to perform factor analysis

Prepare your data

Start with a dataset where each row represents a case (for example, a survey respondent), and each column is a variable you’re interested in. Ensure your data meets the assumptions necessary for factor analysis.

Create an initial hypothesis

If you have a theory about the underlying factors and their relationships with your variables, make a note of this. This hypothesis can guide your analysis, but keep in mind that the beauty of factor analysis is its ability to uncover unexpected relationships.

Choose the type of factor analysis

The most common type is exploratory factor analysis, which is used when you’re not sure what to expect. If you have a specific hypothesis about the factors, you might use confirmatory factor analysis.

Form your correlation matrix

After you’ve chosen the type of factor analysis, you’ll need to create the correlation matrix of your variables. This matrix, which shows the correlation coefficients between each pair of variables, forms the basis for the extraction of factors. This is a key step in building your factor analysis model.

Decide on the extraction method

Principal component analysis is the most commonly used extraction method. If you believe your factors are correlated, you might opt for principal axis factoring, a type of factor analysis that identifies factors based on shared variance.

Determine the number of factors

Various criteria can be used here, such as Kaiser’s criterion (eigenvalues greater than 1), the scree plot method or parallel analysis. The choice depends on your data and your goals.

Interpret and validate your results

Each factor will be associated with a set of your original variables, so label each factor based on how you interpret these associations. These labels should represent the underlying concept that ties the associated variables together.

Validation can be done through a variety of methods, like splitting your data in half and checking if both halves produce the same factors.

How factor analysis can help you

As well as giving you fewer variables to navigate, factor analysis can help you understand grouping and clustering in your input variables, since they’ll be grouped according to the latent variables.

Say you ask several questions all designed to explore different, but closely related, aspects of customer satisfaction:

  • How satisfied are you with our product?
  • Would you recommend our product to a friend or family member?
  • How likely are you to purchase our product in the future?

But you only want one variable to represent a customer satisfaction score. One option would be to average the three question responses. Another option would be to create a factor dependent variable. This can be done by running a principal component analysis (PCA) and keeping the first principal component (also known as a factor). The advantage of a PCA over an average is that it automatically weights each of the variables in the calculation.

Say you have a list of questions and you don’t know exactly which responses will move together and which will move differently; for example, purchase barriers of potential customers. The following are possible barriers to purchase:

  • Price is prohibitive
  • Overall implementation costs
  • We can’t reach a consensus in our organization
  • Product is not consistent with our business strategy
  • I need to develop an ROI, but cannot or have not
  • We are locked into a contract with another product
  • The product benefits don’t outweigh the cost
  • We have no reason to switch
  • Our IT department cannot support your product
  • We do not have sufficient technical resources
  • Your product does not have a feature we require
  • Other (please specify)

Factor analysis can uncover the trends of how these questions will move together. The following are loadings for 3 factors for each of the variables.

factor analysis data

Notice how each of the principal components have high weights for a subset of the variables. Weight is used interchangeably with loading, and high weight indicates the variables that are most influential for each principal component. +0.30 is generally considered to be a heavy weight.

The first component displays heavy weights for variables related to cost, the second weights variables related to IT, and the third weights variables related to organizational factors. We can give our new super variables clever names.

factor analysis data 2

If we were to cluster the customers based on these three components, we can see some trends. Customers tend to be high in cost barriers or organizational barriers, but not both.

The red dots represent respondents who indicated they had higher organizational barriers; the green dots represent respondents who indicated they had higher cost barriers

factor analysis graph

Considerations when using factor analysis

Factor analysis is a tool, and like any tool its effectiveness depends on how you use it. When employing factor analysis, it’s essential to keep a few key considerations in mind.

Oversimplification

While factor analysis is great for simplifying complex data sets, there’s a risk of oversimplification when grouping variables into factors. To avoid this you should ensure the reduced factors still accurately represent the complexities of your variables.

Subjectivity

Interpreting the factors can sometimes be subjective, and requires a good understanding of the variables and the context. Be mindful that multiple analysts may come up with different names for the same factor.

Supplementary techniques

Factor analysis is often just the first step. Consider how it fits into your broader research strategy and which other techniques you’ll use alongside it.

Examples of factor analysis studies

Factor analysis, including PCA, is often used in tandem with segmentation studies. It might be an intermediary step to reduce variables before using KMeans to make the segments.

Factor analysis provides simplicity after reducing variables. For long studies with large blocks of Matrix Likert scale questions, the number of variables can become unwieldy. Simplifying the data using factor analysis helps analysts focus and clarify the results, while also reducing the number of dimensions they’re clustering on.

Sample questions for factor analysis

Choosing exactly which questions to perform factor analysis on is both an art and a science. Choosing which variables to reduce takes some experimentation, patience and creativity. Factor analysis works well on Likert scale questions and Sum to 100 question types.

Factor analysis works well on matrix blocks of the following question genres:

Psychographics (Agree/Disagree):

  • I value family
  • I believe brand represents value

Behavioral (Agree/Disagree):

  • I purchase the cheapest option
  • I am a bargain shopper

Attitudinal (Agree/Disagree):

  • The economy is not improving
  • I am pleased with the product

Activity-Based (Agree/Disagree):

  • I love sports
  • I sometimes shop online during work hours

Behavioral and psychographic questions are especially suited for factor analysis.

Sample output reports

Factor analysis simply produces weights (called loadings) for each respondent. These loadings can be used like other responses in the survey.

Related resources

Analysis & Reporting

Margin of error 11 min read

Data saturation in qualitative research 8 min read, thematic analysis 11 min read, behavioral analytics 12 min read, statistical significance calculator: tool & complete guide 18 min read, regression analysis 19 min read, data analysis 31 min read, request demo.

Ready to learn more about Qualtrics?

  • Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar

Institute for Digital Research and Education

A Practical Introduction to Factor Analysis

Factor analysis is a method for modeling observed variables and their covariance structure in terms of unobserved variables (i.e., factors). There are two types of factor analyses, exploratory and confirmatory. Exploratory factor analysis (EFA) is method to explore the underlying structure of a set of observed variables, and is a crucial step in the scale development process. The first step in EFA is factor extraction. During this seminar, we will discuss how principal components analysis and common factor analysis differ in their approach to variance partitioning. Common factor analysis models can be estimated using various estimation methods such as principal axis factoring and maximum likelihood, and we will compare the practical differences between these two methods. After extracting the best factor structure, we can obtain a more interpretable factor solution through factor rotation. Here is where we will discuss the difference between orthogonal and oblique rotations, and finally how to the final solution to generate a factor score. For the latter portion of the seminar we will introduce confirmatory factor analysis (CFA), which is a method to verify a factor structure that has already been defined. Topics to discuss include identification, model fit, and degrees of freedom demonstrated through a three-item, two-item and eight-item one factor CFA and a two-factor CFA. SPSS will be used for the EFA portion of the seminar and R (lavaan) will be used for the CFA portion. You can click on the main link to access each portion of the seminar.

I. Exploratory Factor Analysis (EFA)

  • Motivating example: The SAQ
  • Pearson correlation formula
  • Partitioning the variance in factor analysis
  • principal components analysis
  • principal axis factoring
  • maximum likelihood
  • Simple Structure
  • Orthogonal rotation (Varimax)
  • Oblique (Direct Oblimin)
  • Generating factor scores

II. Confirmatory Factor Analysis (CFA)

  • Motivating example SPSS Anxiety Questionairre
  • The factor analysis model
  • The model-implied covariance matrix
  • The path diagram
  • Known values, parameters, and degrees of freedom
  • Three-item (one) factor analysis
  • Identification of a three-item one factor CFA
  • Running a one-factor CFA in lavaan
  • (Optional) How to manually obtain the standardized solution
  • (Optional) Degrees of freedom with means
  • One factor CFA with two items
  • One factor CFA with more than three items (SAQ-8)
  • Model chi-square
  • A note on sample size
  • (Optional) Model test of the baseline or null model
  • Incremental versus absolute fit index
  • CFI (Confirmatory Factor Index)
  • TLI (Tucker Lewis Index)
  • Uncorrelated factors
  • Correlated factors
  • Second-Order CFA
  • (Optional) Warning message with second-order CFA
  • (Optional) Obtaining the parameter table

DiStefano, C., Zhu, M., & Mindrila, D. (2009). Understanding and using factor scores: Considerations for the applied researcher. Practical Assessment, Research & Evaluation , 14 (20), 2.

Field, A. (2009). Discovering statistics using SPSS . Sage publications.

Geiser, C. (2012). Data analysis with Mplus . Guilford Press.

Lawley, D. N., Maxwell, A. E. (1971). Factor Analysis as a Statistical Method.  Second Edition. American Elsevier Publishing Company, Inc. New York.

Pett, M. A., Lackey, N. R., & Sullivan, J. J. (2003). Making sense of factor analysis: The use of factor analysis for instrument development in health care research . Sage.

Your Name (required)

Your Email (must be a valid email for us to receive the report!)

Comment/Error Report (required)

How to cite this page

  • © 2021 UC REGENTS
  • Bipolar Disorder
  • Therapy Center
  • When To See a Therapist
  • Types of Therapy
  • Best Online Therapy
  • Best Couples Therapy
  • Best Family Therapy
  • Managing Stress
  • Sleep and Dreaming
  • Understanding Emotions
  • Self-Improvement
  • Healthy Relationships
  • Student Resources
  • Personality Types
  • Verywell Mind Insights
  • 2023 Verywell Mind 25
  • Mental Health in the Classroom
  • Editorial Process
  • Meet Our Review Board
  • Crisis Support

Understanding Factor Analysis in Psychology

John Loeppky is a freelance journalist based in Regina, Saskatchewan, Canada, who has written about disability and health for outlets of all kinds.

types of factor analysis in research methodology

Steven Gans, MD is board-certified in psychiatry and is an active supervisor, teacher, and mentor at Massachusetts General Hospital.

types of factor analysis in research methodology

Skynesher / Getty Images

What Is Factor Analysis and What Does It Do?

Types of factor analysis, advantages and disadvantages of factor analysis, how is factor analysis used in psychology.

Like many methods encountered by those studying psychology , factor analysis has a long history.

The primary goal of factor analysis is to distill a large data set into a working set of connections or factors.

It was originally discussed by British psychologist Charles Spearman in the early 20th century and has gone on to be used in not only psychology but in other fields that often rely on statistical analyses,

But what is it, what are some real-world examples, and what are the different types? In this article, we'll answer all of those questions.

The primary goal of factor analysis is to distill a large data set into a working set of connections or factors. Dr. Jessie Borelli, PhD , who works at the University of California-Irvine, uses factor analysis in her work on attachment.

She is doing research that looks into how people perceive relationships and how they connect to one another. She gives the example of providing a hypothetical questionnaire with 100 items on it and using factor analysis to drill deeper into the data. "So, rather than looking at each individual item on its own I'd rather say, 'Is there is there any way in which these items kind of cluster together or go together so that I can... create units of analysis that are bigger than the individual items."

Factor analysis is looking to identify patterns where it is assumed that there are already connections between areas of the data.

An Example Where Factor Analysis Is Useful

One common example of a factor analysis is when you are taking something not easily quantifiable, like socio-economic status , and using it to group together highly correlated variables like income level and types of jobs.

Factor analysis isn't just used in psychology but also deployed in fields like sociology, business, and technology sector fields like machine learning.

There are two types of factor analysis that are most commonly referred to: exploratory factor analysis and confirmatory factor analysis.

Here are the two types of factor analysis:

  • Exploratory analysis : The goal of this analysis is to find general patterns in a set of data points.
  • Confirmatory factor analysis : The goal of this analysis is to test various hypothesized relationships among certain variables.

Exploratory Analysis

In an exploratory analysis, you are being a little bit more open-minded as a researcher because you are using this type of analysis to provide some clarity in your data set that you haven't yet found. It's an approach that Borelli uses in her own research.

Confirmatory Factor Analysis

On the other hand, if you're using a confirmatory factor analysis you are using the assumptions or theoretical findings you have already identified to drive your statistical model.

Unlike in an exploratory factor analysis, where the relationships between factors and variables are more open, a confirmatory factor analysis requires you to select which variables you are testing for. In Borelli's words:

"When you do a confirmatory factor analysis, you kind of tell your analytic program what you think the data should look like, in terms of, 'I think it should have these two factors and this is the way I think it should look.'"

Let's take a look at the advantages and disadvantages of factor analysis.

A main advantage of a factor analysis is that it allows researchers to reduce a number of variables by combining them into a single factor.

You Can Analyze Fewer Data Points

When answering your research questions, it's a lot easier to be working with three variables than thirty, for example.

Disadvantages

Disadvantages include that the factor analysis relies on the quality of the data, and also may allow for different interpretations of the data. For example, during one study, Borelli found that after deploying a factor analysis, she was still left with results that didn't connect well with what had been found in hundreds of other studies .

Due to the nature of the sample being new and being more culturally diverse than others being explored, she used an exploratory factor analysis that left her with more questions than answers.

The goal of factor analysis in psychology is often to make connections that allow researchers to develop models with common factors in ways that might be hard or impossible to observe otherwise.

So, for example, intelligence is a difficult concept to directly observe. However, it can be inferred from factors that we can directly measure on specific tests.

Factor analysis has often been used in the field of psychology to help us better understand the structure of personality.

This is due to the multitude of factors researchers have to consider when it comes to understanding the concept of personality. This area of personality research is certainly not new, with easily findable research dating as far back as 1942 recognizing its power in personality research.

Britannica. Charles E. Spearman .

United State Environmental Protection Agency. Exploratory Data Analysis .

Flora DB, Curran PJ. An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data .  Psychol Methods . 2004;9(4):466-491. doi:10.1037/1082-989X.9.4.466

Wolfle D. Factor analysis in the study of personality .  The Journal of Abnormal and Social Psychology. 1942;37(3):393–397.

By John Loeppky John Loeppky is a freelance journalist based in Regina, Saskatchewan, Canada, who has written about disability and health for outlets of all kinds.

types of factor analysis in research methodology

Secure Your Spot in Our PCA Online Course Starting on April 02 (Click for More Info)

Joachim Schork Image Course

Intoduction to Factor Analysis

In this tutorial, I’ll introduce to you the multivariate statistical method , factor analysis (FA), used for uncovering the underlying relationships between a set of observed variables. It is a widely utilized technique in social and behavioral science for identifying factors such as intelligence, socio-economic status, or personality traits.

The table of contents is structured as follows:

Let’s dive into it!

What is Factor Analysis?

Factor analysis (FA) is a technique used to identify the underlying structure of the data in terms of a smaller set of *unobserved factors ( latents ). These factors are linear combinations of the *observed variables and help to explain the correlations among them.

This linear relation can be formalized for the observed variable x as x = λx * ξ + δx , where λx refers to factor loadings associated with x , ξ refers to underlying factors, and δx refers to error associated with x . This error is the portion of the variable that can’t be explained by the underlying factors.

The factor analysis model can be visualized in a diagram, where the direction of the causal effect is indicated by arrows. See the figure below.

Factor Analysis Model Diagram

As with all statistical techniques, it relies on some statistical assumptions . Let’s take a look at them!

Assumptions of FA

It is important that your data meets the following assumptions to obtain reliable results in FA.

  • Sufficient Sample Size : Generally, a larger sample size yields more reliable results.
  • Linearity: Assumes that the relationship between observed variables and factors is linear.
  • Adequacy of Correlations: There should be some correlations between the observed variables, or FA will be ineffective.

Let’s see next what the main FA types and their uses are!

Main Types of FA

In the realm of FA, two prominent approaches stand out: Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA).

  • Exploratory Factor Analysis (EFA) : EFA is used when the researcher does not have a specific idea of the underlying structure of data. It is employed to explore the possible underlying factor structure without imposing any preconceived structure on the outcome.
  • Confirmatory Factor Analysis (CFA) : CFA, on the other hand, is used when the researcher has a specific idea of the underlying structure based on theory or previous studies. In CFA, the researcher tests a hypothesized model to see how well it fits the data.

The choice between these two methods should be guided by the specific needs of your study or the research question, allowing for either an exploration of underlying patterns or a focused test of predefined hypotheses.

So far, we have had an overview of the FA concept. Next, we will explore the steps of conducting an FA.

Steps of FA

  • Gather and Prepare Your Data: Collect your dataset and ensure it is cleaned and preprocessed.
  • Choose the Number of Factors: Use criteria to decide how many factors to retain, e.g., scree plot .
  • Estimate the Factor Model: Use software (like R , SPSS , or Python ‘s scikit-learn) to perform FA. The software will estimate the factor loadings (associations between variables and factors), factor variances (variable variance explained by the factors), and unique variances (variable variance not explained by the factors).
  • Interpret the Factors: In EFA, examine the factor loadings to interpret what each factor represents; in CFA, check if the variables are associated with the factors as hypothesized. Generally, high loadings (positive or negative) of a variable on a factor indicate strong associations.
  • Rotate the Factors (if necessary, in EFA): Rotate results to make the interpretation of factors clearer by simplifying the structure of loadings.
  • Confirm the Solution (if necessary, in CFA): Confirm whether the data fits the model well using various fit indices .
  • Use the Factors: The factors can be used in further analyses as summary variables or to test hypotheses about the relationships between the factors and other variables.
  • Validate Your Model: It’s essential to validate your factor structure on a different sample to ensure that it is generalizable.

Examples of FA

To learn some more specific information about EFA and CFA, you are welcome to visit our tutorials, Exploratory Factor Analysis and Confirmatory Factor Analysis . However, FA is not limited to these two. There are various specialized or extended types of FA, e.g., Principal Component Analysis , Canonical Factor Analysis .

In this tutorial, we didn’t get into mathematical details, I suggest you check Brown’s book Confirmatory Factor Analysis for Applied Research (2006) for a deeper understanding.

*Observed variables can also be referred to as manifest variables, indicators, and endogenous variables, whereas latent variables can be referred to as factors, constructs, unobserved/underlying variables, and exogenous variables in the context of FA.

Video, Further Resources & Summary

This tutorial is introductory and simplifies many complex aspects of FA. For deeper study, it is recommended to consult more analysis-specific tutorials.

Do you need more explanations on what the factor analysis is? Then you might check out the following video of the Statistics Globe YouTube channel.

In the video tutorial, we introduce factor analysis as a general concept.

The YouTube video will be added soon.

Furthermore, you could have a look at some of the other tutorials on Statistics Globe:

  • What is a Principal Component Analysis (PCA)?
  • Scree Plot for PCA Explained
  • Exploratory Factor Analysis
  • Confirmatory Factor Analysis

This article has explained the factor analysis concept. If you have further questions, you may leave a comment below.

Rana Cansu Kebabci Statistician & Data Scientist

This page was created in collaboration with Cansu Kebabci. You might have a look at Cansu’s author page to get more information about academic background and the other articles she has written for Statistics Globe.

Subscribe to the Statistics Globe Newsletter

Get regular updates on the latest tutorials, offers & news at Statistics Globe. I hate spam & you may opt out anytime: Privacy Policy .

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Post Comment

Joachim Schork Statistician Programmer

I’m Joachim Schork. On this website, I provide statistics tutorials as well as code in Python and R programming.

Statistics Globe Newsletter

Get regular updates on the latest tutorials, offers & news at Statistics Globe. I hate spam & you may opt out anytime: Privacy Policy .

PCA Course

Related Tutorials

How to Improve DALL-E Prompts (Tips & Examples)

How to Improve DALL-E Prompts (Tips & Examples)

Autoplot of PCA in R (Example)

Autoplot of PCA in R (Example)

  • FREE ACCOUNT

3 Types of Factor Analysis

Factor analysis is a statistical technique used to identify the underlying structure of a set of variables.

types of factor analysis in research methodology

Visualize Surveys + Shop Market Research 

Access, analyze, and visualize the leading market research at mTab.

about mtab survey analysis software and services

Market research and analysis of large volumes of data are necessary when it comes to analyzing and determining the right market segment, potential demand, and potential areas of competition, product development requirements and all other facets of the business marketing portfolio. One of the most common tools used to deal with the vast amounts of data is Factor Analysis.

What is Factor Analysis?

Factor analysi s is a statistical technique used to identify the underlying structure of a set of variables. In layman’s terms, it is used to analyze the relationship between two observable variables and how it is affected by another smaller set of unobservable variables. For example, factor analysis can be used in market segmentation to identify the underlying variables according to which customers can be grouped.

Uses of factor analysis in market research and analysis

Factor analysis has proved to be very beneficial in market research and analysis of variables that determine consumer behavior:

  • It helps to make sense of large data with interlinked relationships
  • It may point out relationships that may not have been obvious
  • It can point out to the underlying relationships with respect to consumer tastes, preferences, etc.
  • It is easier to condense and correlate data through factor analysis and also to draw conclusions from the data gathered in market research and analysis.
  • It can be used to form empirical clusters of variables and underlying factors that affect them

Types of factor analysis

A factor analysis is mainly used for interpretation of data and in analyzing the underlying relationships between variable and other underlying factors that may determine consumer behavior. Instead of grouping responses and response types, factor analysis segregates the variable and groups these according to their co relevance.There are mainly three types of factor analysis that are used for different kinds of market research and analysis.

  • Exploratory factor analysis
  • Confirmatory factor analysis
  • Structural equation modeling

Exploratory factor analysis is used to measure the underlying factors that affect the variables in a data structure without setting any predefined structure to the outcome. Confirmatory factor analysis on the other hand is used as tool in market research and analysis to reconfirm the effects and correlation of an existing set of predetermined factors and variables that affect these factors. Structural equation modeling hypothesizes a relationship between a set of variables and factors and tests these casual relationships on the linear equation model.  Structural equation modeling can be used for exploratory and confirmatory modeling alike, and hence it can be used for confirming results as well as testing hypotheses.

Factor analysis will only yield accurate and useful results if done by a researcher who has adequate knowledge to select data and assign attributes. Selecting factors and variables so as to avoid too much similarity of characteristics is also important. Factor analysis, if done correctly, can allow for market research and analysis that helps in various areas of decision making like product features, product development, pricing, market segmentation, penetration and even with targeting.

Applications of Factor Analysis

Factor analysis has several applications in different fields, including:

  • Market Research: it is widely used in market research to identify the underlying factors that influence customer preferences and behavior. For example, a market research study may use factor analysis to identify the key factors that influence consumers' purchasing decisions.
  • Psychology: it is used in psychology to identify the underlying dimensions of personality traits, intelligence, and cognitive abilities. For example, a factor analysis of a personality test may identify factors such as extraversion, neuroticism, and openness to experience.
  • Education: it is used in education to identify the underlying factors that contribute to academic achievement. For example, a factor analysis of academic test scores may identify factors such as verbal reasoning, mathematical ability, and spatial ability.

Factor analysis is a powerful statistical technique that can help in identifying the underlying factors that contribute to the patterns of the data. By reducing large amounts of data into a smaller number of factors, it can simplify the data and help in making meaningful conclusions. Whether you're conducting market research, psychological studies, or educational research, factor analysis can be a valuable tool for understanding the relationships between variables and identifying the underlying factors that explain the patterns in the data.

 alt=

Recent articles

types of factor analysis in research methodology

mTab and Kayne Anderson Partner to Further Innovate Insights

types of factor analysis in research methodology

The Key to Streaming Show Loyalty is Audience Insights

types of factor analysis in research methodology

Which QSR Tops America’s “Fast Food Favorites” Ranking?

  • Factor Analysis

Factor analysis is a technique in mathematics that we use to reduce a larger number into a smaller number. Moreover, in this topic, we will talk about it and its various aspects.

What is Factor Analysis?

It refers to a method that reduces a large variable into a smaller variable factor. Furthermore, this technique takes out maximum ordinary variance from all the variables and put them in common score.

Moreover, it is a part of General Linear Model (GLM) and it believes several theories that contain no multicollinearity, linear relationship, true correlation , and relevant variables into the analysis among factors and variables.

factor analysis

Types of Factor Analysis

There are different methods that we use in factor analysis from the data set:

1. Principal component analysis

It is the most common method which the researchers use. Also, it extracts the maximum variance and put them into the first factor. Subsequently, it removes the variance explained by the first factor and extracts the second factor. Moreover, it goes on until the last factor.

2. Common Factor Analysis

It’s the second most favoured technique by researchers. Also, it extracts common variance and put them into factors . Furthermore, this technique doesn’t include the variance of all variables and is used in SEM.

3. Image Factoring

It is on the basis of the correlation matrix and makes use of OLS regression technique in order to predict the factor in image factoring.

4. Maximum likelihood method

It also works on the correlation matrix but uses a maximum likelihood method to factor.

5. Other methods of factor analysis

Alfa factoring outweighs least squares. Weight square is another regression-based method that we use for factoring.

Factor loading- Basically it the correlation coefficient for the factors and variables. Also, it explains the variable on a particular factor shown by variance.

Eigenvalues- Characteristics roots are its other name. Moreover, it explains the variance shown by that particular factor out of the total variance. Furthermore, commonality column helps to know how much variance the first factor explained out of total variance.

Factor Score- It’s another name is the component score. Besides, it’s the score of all rows and columns that we can use as an index for all variables and for further analysis. Moreover, we can standardize it by multiplying it with a common term.

Rotation method- This method makes it more reliable to understand the output. Also, it affects the eigenvalues method but the eigenvalues method doesn’t affect it. Besides, there are 5 rotation methods: (1) No Rotation Method, (2) Varimax Rotation Method, (3) Quartimax Rotation Method, (4) Direct Oblimin Rotation Method, and (5) Promax Rotation Method.

Assumptions of Factor Analysis

Factor analysis has several assumptions. These include:

  • There are no outliers in the data.
  • The sample size is supposed to be greater than the factor.
  • It is an interdependency method so there should be no perfect multicollinearity between the variables.
  • Factor analysis is a linear function thus it doesn’t require homoscedasticity between variables.
  • It is also based on the linearity assumption. So, we can also use non-linear variables. However, after a transfer, they change into a linear variable.
  • Moreover, it assumes interval data.

Key Concepts of Factor Analysis

It includes the following key concept:

Exploratory factor analysis- It assumes that any variable or indicator can be associated with any factor. Moreover, it is the most common method used by researchers. Furthermore, it isn’t based on any prior theory.

Confirmatory Factor Analysis- It is used to determine the factors loading and factors of measured variables, and to confirm what it expects on the basis of pre-established assumption. Besides, it uses two approaches:

  • The Traditional Method
  • The SEM Approach

Solved Question for You

Question. How many types of Factor analysis are there?

A. 5 B. 6 C. 4 D. 3

Answer . The correct answer is option A.

Customize your course in 30 seconds

Which class are you in.

tutor

Factorisation

  • What is Exponential Form – How to Write Numbers in Exponential Form?
  • Associative and Commutative Property of Addition and Multiplication
  • Factorisation Using Identities
  • Factorisation Using Division

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Download the App

Google Play

Best practices for your confirmatory factor analysis: A JASP and lavaan tutorial

  • Original Manuscript
  • Published: 13 March 2024

Cite this article

  • Pablo Rogers   ORCID: orcid.org/0000-0002-0093-3834 1  

189 Accesses

1 Altmetric

Explore all metrics

Confirmatory factor analysis (CFA) is a fundamental method for evaluating the internal structural validity of measurement instruments. In most CFA applications, the measurement model serves as a means to an end rather than an end in itself. To select the appropriate model, prior validity evidence is crucial, and items are typically assessed on an ordinal scale, which has been used in the applied social sciences. However, textbooks on structural equation modeling (SEM) often overlook this specific case, focusing on applications estimable using maximum likelihood (ML) instead. Unfortunately, several popular commercial SEM software packages lack suitable solutions for handling this ‘typical CFA’, leading to confusion and suboptimal decision-making when conducting CFA in this context. This article conceptually contributes to this ongoing discussion by presenting a set of guidelines for conducting a typical CFA, drawing from recent empirical research. We provide a practical contribution by introducing and developing a tutorial example within the JASP and lavaan  software platforms. Supplementary materials such as videos, files, and scripts are freely available.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price includes VAT (Russian Federation)

Instant access to the full article PDF.

Rent this article via DeepDyve

Institutional subscriptions

types of factor analysis in research methodology

AERA, APA, & NCME. (2014). Standards for Educational and Psychological Te sting . American Educational Research Association, American Psychological Association & National Council on Measurement in Education.

Agawin , M. (2020). Perceived Ease of Use, Perceived Usefulness, and Attitude towards Jeffrey's Amazing Statistics Program (JASP) among Students of a Private Higher Educational Institution in Region IV-A. Laguna Journal of Multidisciplinary Research, 4 (1), 1–14. Center for Research, Publication, and Intellectual Property. Retrieved from: https://lpulaguna.edu.ph/journal-disciplinary-research/ . Accessed 4 Mar 2024.

Bandalos, D. L. (2014). Relative Performance of Categorical Diagonally Weighted Least Squares and Robust Maximum Likelihood Estimation. Structural Equation Modeling, 21 (1), 102–116. https://doi.org/10.1080/10705511.2014.859510

Article   MathSciNet   Google Scholar  

Bandalos, D. L. (2018). Measurement theory and applications for the social sciences . Guilford Press.

Google Scholar  

Beauducel, A., & Herzberg, P. Y. (2006). On the performance of maximum likelihood versus means and variance adjusted weighted least squares estimation in CFA. Structural Equation Modeling, 13 (2), 186–203. https://doi.org/10.1207/s15328007sem1302_2

Beaujean, A. A. (2014). Latent Variable Modeling Using R: A Step-by-Step Guide . Routledge.

Book   Google Scholar  

Bell, S.M., Chalmers, R.P., & Flora, D.B. (2023). The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability. Educational and Psychological Measurement, 1–36. https://doi.org/10.1177/00131644231155804

Bido, D. S., Mantovani, D. M. N., & Cohen, E. D. (2018). Destruction of measurement scale through exploratory factor analysis in production and operations research. Management & Production, 25 (2), 384–397. https://doi.org/10.1590/0104-530X3391-16

Article   Google Scholar  

Brown, T. A. (2015). Confirmatory Factor Analysis for Applied Research (2nd ed.). The Guilford Press.

Brown, T. A. (2023). Confirmatory Factor Analysis. In R. H. Hoyle (Ed.), Handbook of Structural Equation Modeling (2nd ed.). UK: The Guilford Press.

Byrne, B. M. (2016). Structural Equation Modeling with AMOS: Basic Concepts, Application, and Programming (3rd ed.). Routledge Taylor & Francis Group.

Campayo, F. S., Sancho-Esper, F., Rodríguez-Sánchez, C., Ostrovskaya, L., Martínez, N. J., Romero-Ortiz, A., Fernández-Díaz, F. M., Mora, C. F., Bruno, J. M., & Casado, L. A. A. (2022). Approaching it classroom technology : implementation and use of free software (JASP) in quantitative market research. In R. C. Satorre (Ed.), Memories of the Research Networks Program en teaching university (pp. 1141–1169). Universitat d’Alacant http://rua.ua.es/dspace/handle/10045/130544 . Accessed 4 Mar 2024.

Chinchu, C. (2022). Free and (Mostly) Open source data analysis software for academic research . OSF Preprint . https://doi.org/10.31234/osf.io/9pnkw

Cho, E. (2022). Reliability and Omega Hierarchical in Multidimensional Data: A Comparison of Various Estimators. Psychological Methods. https://doi.org/10.1037/met0000525

Collier, J. E. (2020). Applied Structural Equation Modeling Using AMOS: Basic to Advanced Techniques . Routledge.

Crede, M., & Harms, P. (2019). Questionable research practices when using confirmatory factor analysis. Journal of Managerial Psychology, 34 (1), 18–30. https://doi.org/10.1108/JMP-06-2018-0272

Davvetas, V., Diamantopoulos, A., Zaefarian, G., & Sichtmann, C. (2020). Ten basic questions about structural equations modeling you should know the answers to – But perhaps you don’t. Industrial Marketing Management, 90 , 252–263. https://doi.org/10.1016/j.indmarman.2020.07.016

DiStefano, C., & Morgan, G. B. (2014). A Comparison of Diagonal Weighted Least Squares Robust Estimation Techniques for Ordinal Data. Structural Equation Modeling, 21 (3), 425–438. https://doi.org/10.1080/10705511.2014.915373

Dunn, T. J., Baguley, T., & Brunsden, V. (2014). From alpha to omega: A practical solution to the pervasive problem of internal consistency estimation. British Journal of Psychology, 105 (3), 399–412. https://doi.org/10.1111/bjop.12046

Article   PubMed   Google Scholar  

Feng, Y., & Hancock, G. R. (2023). Power Analysis within a Structural Equation Modeling Framework. In R. H. Hoyle (Ed.), Handbook of Structural Equation Modeling (2nd ed.). The Guilford Press.

Fiates, G. G. S., Serra, F. A. R., & Martins, C. (2014). The aptitude of Brazilian researchers belonging to stricto sensu postgraduate programs in Administration for quantitative research. Revista de Administração, 49 (2), 384–398. https://doi.org/10.5700/rausp1153

Finch, W. H., & French, B. F. (2015). Latent Variable Modeling with R . Routledge.

Flake, J. K., & Fried, E. I. (2020). Measurement Schmeasurement : Questionable Measurement Practices and How to Avoid Them. Advances in Methods and Practices in Psychological Science, 3 (4), 456–465. https://doi.org/10.1177/2515245920952393

Flake, J. K., Pek, J., & Hehman, E. (2017). Construct validation in social and personality research: Current practices and recommendations. Social Psychological and Personality Science, 8 (4), 370–378. https://doi.org/10.1177/1948550617693063

Flake, J. K., Davidson, I. J., Wong, O., & Pek, J. (2022). Construct validity and the validity of replication studies: A systematic review. American Psychologist, 77 (4), 576–588. https://doi.org/10.1037/amp0001006

Flora, D. B. (2020). Your coefficient alpha Is probably wrong, but which coefficient omega Is right? A tutorial on using R to obtain better reliability estimates. Advances in Methods and Practices in Psychological Science, 3 (4), 484–501. https://doi.org/10.1177/2515245920951747

Flora, D. B., & Curran, P. J. (2004). An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data. Psychological Methods, 9 (4), 466–491. https://doi.org/10.1037/1082-989X.9.4.466

Article   PubMed   PubMed Central   Google Scholar  

Flora, D. B., & Flake, J. K. (2017). The purpose and practice of exploratory and confirmatory factor analysis in psychological research: Decisions for scale development and validation. Canadian Journal of Behavioral Science, 49 (2), 78–88. https://doi.org/10.1037/cbs0000069

Forero, C. G., Maydeu-Olivares, A., & Gallardo-Pujol, D. (2009). Factor analysis with ordinal indicators: A Monte Carlo Study Comparing DWLS and ULS Estimation. Structural Equation Modeling: A Multidisciplinary Journal, 16 (4), 625–641. https://doi.org/10.1080/10705510903203573

Furr, M. R. (2021). Psychometrics: An Introduction (4th ed.). SAGE Publications.

Gana, K., & Broc, G. (2019). Structural Equation Modeling with lavaan . John Wiley & Sons Inc.

Goodboy, A. K., & Martin, M. M. (2020). Omega over alpha for reliability estimation of unidimensional communication measures. Annals of the International Communication Association, 44 (4), 422–439. https://doi.org/10.1080/23808985.2020.1846135

Goss-Sampson, M. (2022). Statistical Analysis in JASP: A Guide for Students (5th ed.). University of Greenwich. Retrieved from:  https://jasp-stats.org/jasp-materials/ . Accessed 4 Mar 2024.

Green, S. B., & Yang, Y. (2015). Evaluation of Dimensionality in the Assessment of Internal Consistency Reliability: Coefficient Alpha and Omega Coefficients. Educational Measurement: Issues and Practice, 34 (4), 14–20. https://doi.org/10.1111/emip.12100

Groskurth , K., Bluemke, M., & Lechner, C. M. (2023). Why we need to abandon fixed cutoffs for goodness-of-fit indices: An extensive simulation and possible solutions. Behavior Research Methods. https://doi.org/10.3758/s13428-023-02193-3

Hair, J. F., Sarstedt, M., Ringle, C., & Gudergan, S. P. (2017). Advanced Issues in Partial Least Squares Structural Equation Modeling (2nd ed.). SAGE Publications Inc.

Hair, J., Black, W., Babin, B., & Anderson, R. (2019). Multivariate Data Analysis (8th ed.). Cengage Learning.

Hair, J. F., Hult, T. M. G., Ringle, C. M., & Sarstedt, M. (2022). A Primer on Partial Least Squares Structural Equation Modeling (PLS-SEM) (3rd ed.). Sage Publications.

Harrington, D. (2009). Confirmatory Factor Analysis . Oxford University Press.

Hayes, A. F., & Coutts, J. J. (2020). Use omega rather than Cronbach’s alpha for estimating reliability. But... Communication Methods and Measures, 14 (1), 1–24. https://doi.org/10.1080/19312458.2020.1718629

Henseler, J. (2021). Composite-Based Structural Equation Modeling: Analyzing Latent and Emergent Variables . The Guilford Press.

Holgado-Tello, F., Morata-Ramirez, M., & García, M. (2016). Robust Estimation Methods in Confirmatory Factor Analysis of Likert Scales: A Simulation Study. International Review of Social Sciences and Humanities, 11 (2), 80–96.

Holgado-Tello, F. P., Morata-Ramírez, M. Á., & BarberoGarcía, M. I. (2018). Confirmatory factor analysis of ordinal variables: A simulation study comparing the main estimation methods. Avances En Psicologia Latinoamericana, 36 (3), 601–618. https://doi.org/10.12804/revistas.urosario.edu.co/apl/a.4932

Hoyle, R. H. (2023). Handbook of structural equation modeling (2nd ed.). The Guilford Press.

Hughes, D. J. (2018). Psychometric Validity: Establishing the Accuracy and Appropriateness of Psychometric Measures. In P. Irwing, T. Booth, & D. J. Hughes (Eds.), The Wiley Handbook of Psychometric Testing: A Multidisciplinary Reference on Survey, Scale and Test Development (1st ed.). John Wiley & Sons Ltd.

Jackson, D. L., Gillaspy, J. A., & Purc-Stephenson, R. (2009). Reporting practices in confirmatory factor analysis: An overview and some recommendations. Psychological Methods , 14 (1). https://doi.org/10.1037/a0014694

Jak, S., Jorgensen, T. D., Verdam, M. G. E., Oort, F. J., & Elffers, L. (2021). Analytical power calculations for structural equation modeling: A tutorial and Shiny app. Behavior Research Methods, 53 , 1385–1406. https://doi.org/10.3758/s13428-020-01479-0/Published

JASP Team. (2023). JASP (Version 0.17.1) . [Computer Software]. Available at: https://jasp-stats.org/ . Accessed 4 Mar 2024.

Jobst, L. J., Bader, M., & Moshagen, M. (2023). A tutorial on assessing statistical power and determining sample size for structural equation models. Psychological Methods, 28 (1), 207–221. https://doi.org/10.1037/met0000423

Kalkbrenner, M. T. (2023). Alpha, Omega, and H Internal Consistency Reliability Estimates: Reviewing These Options and When to Use Them. Counseling Outcome Research and Evaluation, 14 (1), 77–88. https://doi.org/10.1080/21501378.2021.1940118

Kiliç, A., Uysal, İ, & Atar, B. (2020). Comparison of confirmatory factor analysis estimation methods on binary data. International Journal of Assessment Tools in Education, 2020 (3), 451–487. https://doi.org/10.21449/ijate.660353

Kiliç, A.F., & Dogan, N. (2021). Comparison of confirmatory factor analysis estimation methods on mixed-format data. International Journal of Assessment Tools in Education , 21–37. https://doi.org/10.21449/ijate.782351

Kline, R. B. (2016). Principles and Practice of Structural Equation Modeling (4th ed.). The Guilford Press.

Kluthcovsky , A.C.G.C., & Kluthcovsky , F.A. (2009). The WHOQOL- bref, an instrument to assess quality of life: a systematic review. Revista de Psiquiatria Do Rio Grande Do Sul , 31 (3 suppl). https://doi.org/10.1590/S0101-81082009000400007

Kyriazos, T. A. (2018). Applied Psychometrics: Sample Size and Sample Power Considerations in Factor Analysis (EFA, CFA) and SEM in General. Psychology, 09 (08), 2207–2230. https://doi.org/10.4236/psych.2018.98126

Lei, P. W. (2009). Evaluating estimation methods for ordinal data in structural equation modeling. Quality and Quantity, 43 (3), 495–507. https://doi.org/10.1007/s11135-007-9133-z

Lei, P. W., & Shiverdecker, L. K. (2020). Performance of Estimators for Confirmatory Factor Analysis of Ordinal Variables with Missing Data. Structural Equation Modeling: A Multidisciplinary Journal, 27 (4), 584–601. https://doi.org/10.1080/10705511.2019.1680292

Leite, W. L., Bandalos, D. L., & Shen, Z. (2023). Simulation Methods in Structural Equation Modeling. In R. H. Hoyle (Ed.), Handbook of Structural Equation Modeling (2nd ed.). The Guilford Press.

Li, C. H. (2016a). Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares. Behavior Research Methods, 48 (3), 936–949. https://doi.org/10.3758/s13428-015-0619-7

Li, C. H. (2016b). The performance of ML, DWLS, and ULS estimation with robust corrections in structural equation models with ordinal variables. Psychological Methods, 21 (3), 369–387. https://doi.org/10.1037/met0000093

Lin, L. C., & Yao, G. (2022). Validation of the factor structure of the WHOQOL-BREF using meta-analysis of exploration factor analysis and social network analysis. Psychological Assessment, 34 (7), 660–670. https://doi.org/10.1037/pas0001122

Love, J., Selker, R., Marsman, M., Jamil, T., Dropmann , D., Verhagen, J., Ly, A., Gronau, Q. F., Šmíra , M., Epskamp , S., Matzke , D., Wild, A., Knight, P., Rouder , J.N., Morey, RD, & Wagenmakers , E.J. (2019). JASP: Graphical statistical software for common statistical designs. Journal of Statistical Software , 88 (1). https://doi.org/10.18637/jss.v088.i02

Marcoulides, K. M., & Yuan, K. H. (2017). New Ways to Evaluate Goodness of Fit: A Note on Using Equivalence Testing to Assess Structural Equation Models. Structural Equation Modeling, 24 (1), 148–153. https://doi.org/10.1080/10705511.2016.1225260

Maydeu-Olivares, A. (2017). Maximum Likelihood Estimation of Structural Equation Models for Continuous Data: Standard Errors and Goodness of Fit. Structural Equation Modeling, 24 (3), 383–394. https://doi.org/10.1080/10705511.2016.1269606

McNeish, D. (2018). Thanks coefficient alpha, We’ll take it from here. Psychological Methods, 23 (3), 412–433. https://doi.org/10.1037/met0000144

McNeish, D. (2023a). Dynamic Fit Index Cutoffs for Factor Analysis with Likert, Ordinal, or Binary Responses. PsyArXiv Preprints . https://doi.org/10.31234/osf.io/tp35s

McNeish, D. (2023b). Generalizability of Dynamic Fit Index, Equivalence Testing, and Hu & Bentler Cutoffs for Evaluating Fit in Factor Analysis. Multivariate Behavioral Research, 58 (1), 195–219. https://doi.org/10.1080/00273171.2022.2163477

McNeish, D., & Wolf, M. G. (2022). Dynamic fit index cutoffs for one-factor models. Behavior Research Methods, 55 (3), 1157–1174. https://doi.org/10.3758/s13428-022-01847-y

McNeish, D., & Wolf, M. G. (2023). Dynamic fit index cutoffs for confirmatory factor analysis models. Psychological Methods, 28 (1), 61–88. https://doi.org/10.1037/met0000425

McNeish, D., & Manapat, P. D. (2023). Dynamic fit index cutoffs for hierarchical and second-order factor models. PsyArXiv Preprints . https://doi.org/10.31234/osf.io/sm6az

Mîndrilă, D. (2010). Maximum Likelihood (ML) and Diagonally Weighted Least Squares (DWLS) Estimation Procedures: A Comparison of Estimation Bias with Ordinal and Multivariate Non-Normal Data. International Journal of Digital Society (IJDS), 1 (1), 60–66.

Morata-Ramírez, M. A., & Holgado-Tello , F.P. (2013). Construct Validity of Likert Scales through Confirmatory Factor Analysis: A Simulation Study Comparing Different Methods of Estimation Based on Pearson and Polychoric Correlations. International Journal of Social Science Studies , 1 (1). https://doi.org/10.11114/ijsss.v1i1.27

Muthén, L. K., & Muthén, B. O. (2002). How to Use a Monte Carlo Study to Decide on Sample Size and Determine Power. Structural Equation Modeling: A Multidisciplinary Journal, 9 (4), 599–620. https://doi.org/10.1207/S15328007SEM0904_8

Nalbantoğlu-Yılmaz, F. (2019). Comparison of Different Estimation Methods Used in Confirmatory Factor Analyses in Non-Normal Data: A Monte Carlo Study. International Online Journal of Educational Sciences , 11 (4). https://doi.org/10.15345/iojes.2019.04.010

Navarro, D. J., Foxcroft, D. R., & Faulkenberry, T. J. (2019). Learning statistics with JASP: A tutorial for psychology students and other beginners (0.7) . University of New South Wales https://learnstatswithjasp.com/ . Accessed 4 Mar 2024.

Niu, G., Segall, R. S., Zhao, Z., & Wu, Z. (2021). A Survey of Open Source Statistical Software (OSSS) and Their Data Processing Functionalities. International Journal of Open Source Software and Processes, 12 (1), 1–20. https://doi.org/10.4018/IJOSSP.2021010101

Nye, C. D. (2022). Reviewer Resources: Confirmatory Factor Analysis. Organizational Research Methods , 109442812211205. https://doi.org/10.1177/10944281221120541

Perera, H. N., Izadikhah, Z., O’Connor, P., & McIlveen, P. (2018). Resolving Dimensionality Problems With WHOQOL-BREF Item Responses. Assessment, 25 (8), 1014–1025. https://doi.org/10.1177/1073191116678925

Pornprasertmanit, S., Miller, P., Jorgensen, T. D., & Corbin, Q. (2022). simsem: SIMulated Structural Equation Modeling (0.5-16). [Computer software]. R package. Available at: www.simsem.org . Accessed 4 Mar 2024.

R Core Team. (2022). A: A language and environment for statistical computing (Version 4.2.2) . R Foundation for Statistical Computing. https://www.R-project.org/ . Accessed 4 Mar 2024.

Reeves, T. D., & Marbach-Ad, G. (2016). Contemporary test validity in theory and practice: A primer for discipline-based education researchers. CBE Life Sciences Education , 15 (1). https://doi.org/10.1187/cbe.15-08-0183

Rhemtulla, M., Brosseau-Liard, P. É., & Savalei, V. (2012). When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions. Psychological Methods, 17 (3), 354–373. https://doi.org/10.1037/a0029315

Rios, J., & Wells, C. (2014). Validity evidence based on internal structure. Psychothema, 26 (1), 108–116. https://doi.org/10.7334/psicothema2013.260

Robitzsch, A. (2022). On the Bias in Confirmatory Factor Analysis When Treating Discrete Variables as Ordinal Instead of Continuous. Axioms, 11 (4). https://doi.org/10.3390/axioms11040162

Rogers, P. (2021b). RAC-Revista de Administração Contemporânea. Harvard Dataverse. https://doi.org/10.7910/DVN/RCX8FF

Chapter   Google Scholar  

Rogers, P. (2021a). Data for “Best Practices for Your Exploratory Factor Analysis: a Factor Tutorial” published by RAC-Revista de Administração Contemporânea, Mendeley Data, V2 . https://doi.org/10.17632/rdky78bk8r.2

Rogers, P. (2022). Best Practices for Your Exploratory Factor Analysis: A Factor Tutorial. Revista de Administração Contemporânea, 26 (6). https://doi.org/10.1590/1982-7849rac2022210085.en

Rosseel, Y. (2012). lavaan : An R Package for Structural Equation Modeling. Journal of Statistical Software, 48 (2), 1–36. https://doi.org/10.18637/jss.v048.i02 .

Rosseel, Y. (2023). The lavaan tutorial . Retrieved from: https://lavaan.ugent.be/ . Accessed 4 Mar 2024.

Savalei, V., & Rhemtulla, M. (2013). The performance of robust test statistics with categorical data. British Journal of Mathematical and Statistical Psychology, 66 (2), 201–223. https://doi.org/10.1111/j.2044-8317.2012.02049.x

Article   MathSciNet   PubMed   Google Scholar  

Schmitt, T. A. (2011). Current Methodological Considerations in Exploratory and Confirmatory Factor Analysis. Journal of Psychoeducational Assessment, 29 (4), 304–321. https://doi.org/10.1177/0734282911406653

Schumacker, R. E., Wind, S. A., & Holmes, L. F. (2021). Resources for Identifying Measurement Instruments for Social Science Research. Measurement: Interdisciplinary Research and Perspectives, 19 (4), 250–257. https://doi.org/10.1080/15366367.2021.1950486

Shek, D. T. L., & Yu, L. (2014). Use of structural equation modeling in human development research. International Journal on Disability and Human Development, 13 (2), 157–167. https://doi.org/10.1515/ijdhd-2014-0302 . Freund Publishing House Ltd.

Shi, D., & Maydeu-Olivares, A. (2020). The Effect of Estimation Methods on SEM Fit Indices. Educational and Psychological Measurement, 80 (3), 421–445. https://doi.org/10.1177/0013164419885164

Shi, D., Maydeu-Olivares, A., & Rosseel, Y. (2020). Assessing Fit in Ordinal Factor Analysis Models: SRMR vs RMSEA. Structural Equation Modeling: A Multidisciplinary Journal, 27 (1), 1–15. https://doi.org/10.1080/10705511.2019.1611434

Sireci, S. G., & Sukin, T. (2013). Test validity. APA handbook of testing and assessment in psychology: Test theory and testing and assessment in industrial and organizational psychology (pp. 61–84). American Psychological Association. https://doi.org/10.1037/14047-004

Skevington, S. M., & Epton, T. (2018). How will the sustainable development goals deliver changes in well-being? A systematic review and meta-analysis to investigate whether WHOQOL-BREF scores respond to change. BMJ Global Health, 3 . https://doi.org/10.1136/bmjgh-2017-000609

Trizano-Hermosilla, I., & Alvarado, J.M. (2016). Best alternatives to Cronbach's alpha reliability in realistic conditions: Congeneric and asymmetrical measurements. Frontiers in Psychology, 7 (MAY). https://doi.org/10.3389/fpsyg.2016.00769

Valentini, F., & Damásio, B.F. (2016). Extracted Mean Variance and Composite Reliability: Precision Indicators. Psychology: Theory and Research, 32 (2). https://doi.org/10.1590/0102-3772e322225

Wagenmakers , E.J., Love, J., Marsman , M., Jamil, T., Ly , A., Verhagen , J., Selker , R., Gronau , Q.F., Dropmann , D., Boutin , B ., Meerhoff , F., Knight, P., Raj , A., van Kesteren , E.-J., van Doorn , J., Šmíra , M., Epskamp , S., Etz, A., Matzke , D. ., …, Morey, R. D. (2018). Bayesian inference for psychology. Part II: Example applications with JASP. Psychonomic Bulletin & Review, 25 (1), 58–76. https://doi.org/10.3758/s13423-017-1323-7

Walker, R., Moraine, A., Osborn, H., Black, K. J., Palmer, A., Scott, K., & Humphrey, K. (2022). Exploring diversity with statistics: Step-by-step JASP guides . University of Tennessee at Chattanooga. https://scholar.utc.edu/open-textbooks/1/ . Accessed 4 Mar 2024.

Whittaker, T. A., & Schumacker, R. E. (2022). A Beginner’s Guide to Structural Equation Modeling (5th ed.). Routledge Taylor & Francis Group.

Wolf, M. G., & McNeish, D. (2023). dynamic: An R Package for Deriving Dynamic Fit Index Cutoffs for Factor Analysis. Multivariate Behavioral Research, 58 (1), 189–194. https://doi.org/10.1080/00273171.2022.2163476

Xia, Y., & Yang, Y. (2019). RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods. Behavior Research Methods, 51 (1), 409–428. https://doi.org/10.3758/s13428-018-1055-2

Yang-Wallentin, F., Jöreskog, K. G., & Luo, H. (2010). Confirmatory factor analysis of ordinal variables with misspecified models. Structural Equation Modeling, 17 (3), 392–423. https://doi.org/10.1080/10705511.2010.489003

Yuan, K. H., Chan, W., Marcoulides, G. A., & Bentler, P. M. (2016). Assessing Structural Equation Models by Equivalence Testing With Adjusted Fit Indexes. Structural Equation Modeling, 23 (3), 319–330. https://doi.org/10.1080/10705511.2015.1065414

Zhang, Z., & Yuan, K. H. (2018). Practical statistical power analysis using Webpower and R . ISDSA Press.  https://doi.org/10.35566/power

Download references

Author information

Authors and affiliations.

Federal University of Uberlândia, João Naves de Ávila 2121, 38.408-100, Uberlândia, Minas Gerais, Brazil

Pablo Rogers

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Pablo Rogers .

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Open Practices Statement

The data, code, materials, and preprint for this project may be found at https://osf.io/cv8wb/ . This paper was not pre-registered.

This paper was presented (in Portuguese) at the Brazil Association of Postgraduate Management Meeting (EnANPAD) in 2023 and has undergone significant revisions in its current form.

Appendix: Best practices in a typical CFA in applied social sciences under appendix section

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Rogers, P. Best practices for your confirmatory factor analysis: A JASP and lavaan tutorial. Behav Res (2024). https://doi.org/10.3758/s13428-024-02375-7

Download citation

Accepted : 17 February 2024

Published : 13 March 2024

DOI : https://doi.org/10.3758/s13428-024-02375-7

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Confirmatory factor analysis
  • Structural equation modeling
  • Internal structure validity
  • Find a journal
  • Publish with us
  • Track your research

No internet connection.

All search filters on the page have been cleared., your search has been saved..

  • All content
  • Dictionaries
  • Encyclopedias
  • Expert Insights
  • Foundations
  • How-to Guides
  • Journal Articles
  • Little Blue Books
  • Little Green Books
  • Project Planner
  • Tools Directory
  • Sign in to my profile My Profile

Not Logged In

  • Sign in Signed in
  • My profile My Profile

Not Logged In

Introduction to Factor Analysis

  • By: Jae-On Kim & Charles W. Mueller
  • Publisher: SAGE Publications, Inc.
  • Series: Quantitative Applications in the Social Sciences
  • Publication year: 1978
  • Online pub date: January 01, 2011
  • Discipline: Anthropology
  • Methods: Factor analysis , Correlation , Covariance matrix
  • DOI: https:// doi. org/10.4135/9781412984652
  • Keywords: foundations Show all Show less
  • Print ISBN: 9780803911659
  • Online ISBN: 9781412984652
  • Buy the book icon link

Describes the mathematical and logical foundations at a level which does not presume advanced mathematical or statistical skills, illustrating how to do factor analysis with several of the more popular packaged computer programmes.

Front Matter

  • Editor's Introduction
  • Introduction
  • Logical Foundations of Factor Analysis
  • Obtaining Factor Analysis Solutions

Back Matter

  • Questions Pertaining to Books, Journals, and Computer Programs
  • About the Author

Sign in to access this content

Get a 30 day free trial, more like this, sage recommends.

We found other relevant content for you on other Sage platforms.

Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches

  • Sign in/register

Navigating away from this page will delete your results

Please save your results to "My Self-Assessments" in your profile before navigating away from this page.

Sign in to my profile

Sign up for a free trial and experience all Sage Learning Resources have to offer.

You must have a valid academic email address to sign up.

Get off-campus access

  • View or download all content my institution has access to.

Sign up for a free trial and experience all Sage Research Methods has to offer.

  • view my profile
  • view my lists

Help | Advanced Search

Computer Science > Computer Vision and Pattern Recognition

Title: mm1: methods, analysis & insights from multimodal llm pre-training.

Abstract: In this work, we discuss building performant Multimodal Large Language Models (MLLMs). In particular, we study the importance of various architecture components and data choices. Through careful and comprehensive ablations of the image encoder, the vision language connector, and various pre-training data choices, we identified several crucial design lessons. For example, we demonstrate that for large-scale multimodal pre-training using a careful mix of image-caption, interleaved image-text, and text-only data is crucial for achieving state-of-the-art (SOTA) few-shot results across multiple benchmarks, compared to other published pre-training results. Further, we show that the image encoder together with image resolution and the image token count has substantial impact, while the vision-language connector design is of comparatively negligible importance. By scaling up the presented recipe, we build MM1, a family of multimodal models up to 30B parameters, including both dense models and mixture-of-experts (MoE) variants, that are SOTA in pre-training metrics and achieve competitive performance after supervised fine-tuning on a range of established multimodal benchmarks. Thanks to large-scale pre-training, MM1 enjoys appealing properties such as enhanced in-context learning, and multi-image reasoning, enabling few-shot chain-of-thought prompting.

Submission history

Access paper:.

  • Download PDF
  • Other Formats

References & Citations

  • Google Scholar
  • Semantic Scholar

BibTeX formatted citation

BibSonomy logo

Bibliographic and Citation Tools

Code, data and media associated with this article, recommenders and search tools.

  • Institution

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs .

IMAGES

  1. Factor Analysis

    types of factor analysis in research methodology

  2. Factor Analysis

    types of factor analysis in research methodology

  3. What is factor analysis?

    types of factor analysis in research methodology

  4. Research methodology design. Principal component method = PCA, factor

    types of factor analysis in research methodology

  5. SPSS Factor Analysis

    types of factor analysis in research methodology

  6. Analyse factorielle : Définition, méthodes et exemples // Qualtrics

    types of factor analysis in research methodology

VIDEO

  1. Statistical Analysis of 2^2 Factorial Experimental Design

  2. Factor Analysis 3

  3. Research Methodology: Part 3-Factor Analysis & Reliability Test

  4. Analysis of Factorial Experiment

  5. FACTOR ANALYSIS || BUSINESS RESEARCH METHODS || Dr. SANDEEP KUMAR || MBA || TIAS || TECNIA TV

  6. 42- Factor Analysis

COMMENTS

  1. Factor Analysis

    Factor Analysis Steps. Here are the general steps involved in conducting a factor analysis: 1. Define the Research Objective: Clearly specify the purpose of the factor analysis. Determine what you aim to achieve or understand through the analysis. 2. Data Collection: Gather the data on the variables of interest.

  2. Factor Analysis: a means for theory and instrument development in

    Factor analysis methods can be incredibly useful tools for researchers attempting to establish high quality measures of those constructs not directly observed and captured by observation. Specifically, the factor solution derived from an Exploratory Factor Analysis provides a snapshot of the statistical relationships of the key behaviors ...

  3. Factor Analysis and How It Simplifies Research Findings

    Factor analysis isn't a single technique, but a family of statistical methods that can be used to identify the latent factors driving observable variables. Factor analysis is commonly used in market research , as well as other disciplines like technology, medicine, sociology, field biology, education, psychology and many more.

  4. Factor Analysis Guide with an Example

    The first methodology choice for factor analysis is the mathematical approach for extracting the factors from your dataset. The most common choices are maximum likelihood (ML), principal axis factoring (PAF), and principal components analysis (PCA). You should use either ML or PAF most of the time.

  5. PDF Factor Analysis

    Choosing among Different Methods ! Between MLE and LS ! LS is preferred with " few indicators per factor " Equeal loadings within factors " No large cross-loadings " No factor correlations " Recovering factors with low loadings (overextraction) ! MLE if preferred with " Multivariate normality

  6. Exploratory Factor Analysis: A Guide to Best Practice

    Abstract. Exploratory factor analysis (EFA) is a multivariate statistical method that has become a fundamental tool in the development and validation of psychological theories and measurements. However, researchers must make several thoughtful and evidence-based methodological decisions while conducting an EFA, and there are a number of options ...

  7. A Practical Introduction to Factor Analysis

    Overview. Factor analysis is a method for modeling observed variables and their covariance structure in terms of unobserved variables (i.e., factors). There are two types of factor analyses, exploratory and confirmatory. Exploratory factor analysis (EFA) is method to explore the underlying structure of a set of observed variables, and is a ...

  8. Factor Analysis: Definition, Types, and Examples

    Factor analysis is a commonly used data reduction statistical technique within the context of market research. The goal of factor analysis is to discover relationships between variables within a dataset by looking at correlations. ... but it's important to understand that there is different math going on behind the scenes for each method. Types ...

  9. Factor analysis

    Factor analysis can be only as good as the data allows. In psychology, where researchers often have to rely on less valid and reliable measures such as self-reports, this can be problematic. Interpreting factor analysis is based on using a "heuristic", which is a solution that is "convenient even if not absolutely true".

  10. Understanding and Using Factor Scores: Considerations for the ...

    or confirmatory factor analysis procedures, and 63 articles (27.5%) did not provide sufficient information on the methodology used. For example, many factor score methods are built on the assumption that the resulting factor scores will be uncorrelated; however, orthogonal factors are often the rarity rather than the norm in educational research.

  11. Factor Analysis

    Intuition. Figure 1 shows hypothetical data for two measurements x 1 and x 2 available for 100 respondents, when the correlation between the measurements is 0, 0.5 or 0.9, respectively. Although the graphs differ in how closely the measurements cluster around a straight line, it is apparent that in all three panels the variation along both the x axis and the y axis is substantial and about the ...

  12. Lesson 12: Factor Analysis

    Overview. Factor Analysis is a method for modeling observed variables, and their covariance structure, in terms of a smaller number of underlying unobservable (latent) "factors.". The factors typically are viewed as broad concepts or ideas that may describe an observed phenomenon. For example, a basic desire of obtaining a certain social ...

  13. Sage Research Methods

    Methods Map. This visualization demonstrates how methods are related and connects users to relevant content. Project Planner. Find step-by-step guidance to complete your research project. Which Stats Test. Answer a handful of multiple-choice questions to see which statistical method is best for your data. Reading Lists

  14. Factor Analysis in Psychology: Types, How It's Used

    The primary goal of factor analysis is to distill a large data set into a working set of connections or factors. Dr. Jessie Borelli, PhD, who works at the University of California-Irvine, uses factor analysis in her work on attachment. She is doing research that looks into how people perceive relationships and how they connect to one another.

  15. (PDF) Overview of Factor Analysis

    It is a collection of methods employed for this purpose. (DeCoster, 1998).Correlation pattern (or covariances) between the parameters (observed measure) are examined to perform the factor analysis

  16. Introduction to Factor Analysis

    Intoduction to Factor Analysis . In this tutorial, I'll introduce to you the multivariate statistical method, factor analysis (FA), used for uncovering the underlying relationships between a set of observed variables.It is a widely utilized technique in social and behavioral science for identifying factors such as intelligence, socio-economic status, or personality traits.

  17. Sage Research Methods Foundations

    Methods Map. This visualization demonstrates how methods are related and connects users to relevant content. Project Planner. Find step-by-step guidance to complete your research project. Which Stats Test. Answer a handful of multiple-choice questions to see which statistical method is best for your data. Reading Lists

  18. Factor Analysis

    Factor analysis (Chattopadhyay and Chattopadhyay 2014) is a statistical method used to study the dimensionality of a set of variables. In factor analysis, latent variables represent unobserved constructs and are referred to as factors or dimensions. Factor analysis attempts to identify underlying variables or factors that explain the pattern of ...

  19. 3 Types of Factor Analysis

    Instead of grouping responses and response types, factor analysis segregates the variable and groups these according to their co relevance.There are mainly three types of factor analysis that are used for different kinds of market research and analysis. Exploratory factor analysis. Confirmatory factor analysis. Structural equation modeling.

  20. What Is Factor Analysis? (Plus 5 Methods for Conducting It)

    Here's a list of five common methods you can use to conduct a factor analysis: 1. Principal component analysis. Principal component analysis involves identifying the variables with the maximum amount of variance using a covariance matrix. A covariance matrix is a visual representation of correlations and differences between a set of variables.

  21. Factor Analysis

    Types of Factor Analysis. There are different methods that we use in factor analysis from the data set: 1. Principal component analysis. It is the most common method which the researchers use. Also, it extracts the maximum variance and put them into the first factor. Subsequently, it removes the variance explained by the first factor and ...

  22. Best practices for your confirmatory factor analysis: A JASP and

    Confirmatory factor analysis (CFA) is a fundamental method for evaluating the internal structural validity of measurement instruments. In most CFA applications, the measurement model serves as a means to an end rather than an end in itself. To select the appropriate model, prior validity evidence is crucial, and items are typically assessed on an ordinal scale, which has been used in the ...

  23. Sage Research Methods Video

    In chapter five of his series on statistics using SPSS, Professor Andy Field uses hands-on demonstration and screen-shot images to examine factor analysis. He discusses factor loads, scree plots, and other concepts to vanquish statistics fears and help cultivate SPSS mastery.

  24. Sage Research Methods

    Methods Map. This visualization demonstrates how methods are related and connects users to relevant content. Project Planner. Find step-by-step guidance to complete your research project. Which Stats Test. Answer a handful of multiple-choice questions to see which statistical method is best for your data. Reading Lists

  25. MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

    In this work, we discuss building performant Multimodal Large Language Models (MLLMs). In particular, we study the importance of various architecture components and data choices. Through careful and comprehensive ablations of the image encoder, the vision language connector, and various pre-training data choices, we identified several crucial ...