SciELO - Scientific Electronic Library Online

 
vol.29WORKING MEMORY IN CHILDREN: A PRECURSOR READING SKILL? author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

article

Indicators

Share


Estudos de Psicologia (Natal)

Print version ISSN 1413-294XOn-line version ISSN 1678-4669

Estud. psicol. (Natal) vol.29  Natal  2024  Epub Nov 10, 2025

https://doi.org/10.69909/1678-4669.20240001 

Psicobiologia e Psicologia Cognitiva

Invariance analysis of two empathy instruments: sex differences

Análises de invariância de dois instrumentos sobre empatia: diferenças entre sexos

Análisis de invarianza de dos instrumentos de empatía: diferencias entre sexos

Luisa Braga Pereira¹ 

Mestre em Psicologia Social pela Universidade do Estado do Rio de Janeiro (UERJ). Doutoranda em Psicologia Social pela Universidade do Estado do Rio de Janeiro (UERJ). E-mail: lubrpereira@gmail.com ORCID: https://orcid.org/0000-0003-4474-3298 Correspondência:Rua São Francisco Xavier, 524, bloco F, 10º andar, - Maracanã - Rio de Janeiro, RJ. Telefone: 2334-0216


http://orcid.org/0000-0003-4474-3298

Angela Donato Oliva¹ 

Doutora em Psicologia Escolar e do Desenvolvimento Humano pela Universidade de São Paulo (USP). Professora Adjunta da Universidade do Estado do Rio de Janeiro e da Universidade Federal do Rio de Janeiro. E-mail: angeladonatoliva@gmail.com ORCID: https://orcid.org/0000-0003-4875-9078


http://orcid.org/0000-0003-4875-9078

1¹Universidade do Estado do Rio de Janeiro


Abstract

Empathy is a multidimensional construct that involves the comprehension of others’ feelings and intentions. Sex differences concerning empathy have been widely discussed. Women tend to respond more empathically in studies that use self-report instruments, but the results may be caused by social desirability and cultural expectations. This research aimed to evaluate differences between men and women in the psychometrical parameters of empathy measurements by conducting multi-group confirmatory factor analyses (MGCFA). The sample was formed of 477 participants and two instruments were used: the Brazilian versions of the Interpersonal Reactivity Index (IRI) and the Empathy Quotient (EQ). Sex invariance was found for the EQ but the IRI did not support metric invariance, as four items from the Empathic Concern subscale and two from the Personal Distress subscale were not equivalent between groups. Social desirability, psychometrical and instrument adaptation properties, and social expectations might have influenced the results.

Keywords: empathy; sex differences; factor analysis; self-report

Resumo

A empatia é um construto multidimensional que envolve a compreensão das intenções e sentimentos de outrem. Diferenças entre sexos quanto à empatia têm sido amplamente discutidas. Pesquisas que usam medidas de autorrelato mostram que as mulheres expressam mais empatia, porém esses resultados podem ser influenciados pela desejabilidade social e expectativas culturais. O objetivo deste estudo foi verificar se os instrumentos de empatia são invariantes para homens e mulheres a partir de análises fatoriais confirmatórias multigrupos. A amostra foi composta por 477 participantes e foram utilizados dois instrumentos: Escala Multidimensional de Reatividade Interpessoal (EMRI) e Quociente de Empatia (QE). O QE apresentou-se como invariante para o sexo, mas a EMRI não demonstrou invariância métrica: quatro itens da subescala de consideração empática e dois da subescala de personal distress não foram equivalentes entre os grupos. A desejabilidade social, propriedades psicométricas e de adaptação, e expectativas sociais podem ter influenciado tais resultados.

Palavras-chave: empatia; diferenças sexuais; análise fatorial; autorrelato

Resumen

La empatía es un constructo multidimensional que envuelve la comprensión de intenciones y sentimientos de otros. Diferencias entre sexos sobre empatía son ampliamente discutidas. Pesquisas que usan medidas de autoinforme demuestran que las mujeres expresan más empatía, pero esos resultados pueden ser influenciados pela deseabilidad social y expectativas culturales. El objetivo del estudio fue avaluar si hay diferencias en los parámetros psicométricos de instrumentos entre hombres y mujeres al conducir análisis factoriales confirmatorias multigrupo (AFCMG). La muestra fue compuesta por 477 participantes y fueron utilizados dos instrumentos: las versiones brasileras del Interpersonal Reactivity Index (IRI) y del Empathy Quotient (EQ). El QE se presentó como invariante para el sexo, pero el IRI no obtuvo invariancia métrica: cuatro ítems de la escala de consideración empática y dos de la escala de angustia personal no fueron equivalentes. La deseabilidad social, propiedades psicométricas y de adaptación, y expectativas sociales probablemente influenciaron los resultados.

Palabras-clave: empatía; diferencias sexuales; análisis factorial; autoinforme

Introduction

Empathy is a social ability that involves the interpersonal sharing of needs, experiences, and desires (Hall & Schwartz, 2019). Although it is considered an essential element regarding relationships, there have been registered differences concerning the number of dimensions that it is composed of. It is often described as being formed by two dimensions: affective and cognitive (Azevedo et al., 2018); whereas other authors mention a third dimension, named behavioral (Clark et al., 2018; Falcone et al., 2008).

Among the most employed definitions of empathy and its development in studies, the theoretical proposal formulated by Davis (1980) stands out by using other individuals’ observed experiences to define the cited social ability. Therefore, the cognitive domain, named perspective taking, refers to the ability to recognize others’ points of view, while the affective dimension, characterized as empathic concern, involves the feelings of consideration, worry, or compassion for others’ emotional states. Moreover, Davis (1983) talks about personal distress, which is related to one’s anxiety due to tensions in interpersonal contexts, and about fantasy, which is the use of fictional situations as a background to assume the character’s perspectives and feelings, although the latter has been the target of critics and reconsiderations (Koller et al., 2001).

The theme of sex differences regarding empathy has been frequently studied in psychology (Calandri et al., 2019; Di Tella et al., 2020; Eisenberg & Lennon, 1983). It is suggested that the idea that women tend to be more empathetic than men is based on social behavior stereotypes, which allude to the notion that female individuals are more caring and interpersonally oriented than males (Eisenberg & Lennon, 1983). Differences in cognitive tasks’ performances between men and women have been widely reported in the literature; consequently, even though women repeatedly perform better in those tasks, there is no consensus in the literature regarding sex divergences, even in research that specifically approaches the construct of empathy (Di Tella et al., 2020).

Sex differences are often evaluated by comparing men’s and women’s responses to self-report measures (Dryburgh & Vachon, 2019; Zhao et al., 2019). Although this kind of instrument has its advantages, it is strongly influenced by social aspects, such as social norms and expectations, for example (Dang et al., 2020). Self-report scales are also known to be affected by biased responses from participants who want to be considered socially adjusted and accepted by the community; this aspect that influences self-report instruments is called social desirability (Pavolov et al., 2021; Vesely & Klöckner, 2020). Accordingly, the use of self-report measure results regarding sex differences in empathy might be problematic, since the evaluated construct is characterized as a type of social ability, and, therefore, it usually comprises the desire for acceptance and social biases.

Self-report measures seem to evaluate how frequently a person acts and responds empathically to interpersonal situations, so they may diverge when measuring this construct between participants with different characteristics (Eisenberg & Lennon, 1983). Regarding sex differences, the differentiated levels of reported empathy by women may be related to a greater feminine disposition to perceive their behaviors and sentiments as represented by items that allude to empathic aspects, in comparison to men (Ventura-León et al., 2019), and it could influence the identification of participants with the stereotypes commonly attributed to their sex (Baez et al., 2017).

Multi-group confirmatory factorial analysis (MGCFA) has been used to verify if empathy is measured by self-report instruments in the same way for men and women. In this context, the invariance of empathy questionnaires related to the participants’ sex can be evaluated; if a measure is invariant, it means that group divergences are derived from individual differences, not methodological ones (Han et al., 2019). Whereas some recent studies point out the invariance of empathy scales (Grevenstein, 2020; Takamatsu et al., 2021), others attest that the instruments evaluate this social ability differently for men and women (Diotaiuti et al., 2021; Liu et al., 2018). Due to these diverse results in the literature, it is relevant to delineate an empirical study that seeks the identification of potential empathy measurements’ invariance regarding sex differences.

Self-report instruments continue to be often used to verify potential sex differences in empathy expression. Thus, it is common for research that evaluates empathy to utilize more than one self-report measurement, to present a more complex profile of empathic response (Azevedo et al., 2018; Zhao et al., 2019), especially by using the Interpersonal Reactivity Index (IRI) and the Empathy Quotient (EQ; de Lima & Osório, 2021; Zhao et al., 2019). This study’s objective consists of evaluating whether there are any divergences in the parameters of the cited empathy instruments concerning the participants’ sex by conducting MGCFAs and, therefore, concluding if those self-report measures are indicated to evaluate empathy in both men and women.

Method

Participants

Four hundred and seventy-seven adult participants were recruited. Participants’ ages varied from 18 to 59 years old (M=31.28, SD=11.34), and three hundred and sixteen were women (66.2%). The final sample was majorly composed of individuals from the state of Rio de Janeiro (72.1%), with an undergraduate degree (55.8%), and characterized as being single (59.3%).

Instruments

Two instruments were used to measure empathy. The Brazilian version of the Interpersonal Reactivity Index (IRI; Davis, 1980) was applied to evaluate empathy (Escala Multidimensional de Reatividade Interpessoal; Koller et al., 2001), as it measures the social ability through the reactions that participants report while observing experiences lived by others. It is formed of 17 items and three subscales: Empathic Concern (7 items; α=0.67), Perspective Taking (5 items; α=0.63), and Personal Distress (5 items; α=0.54). The adapted version to the Brazilian context did not include the Fantasy dimension, as the original one did (Koller et al., 2001).

The Empathy Quotient (EQ; Baron-Cohen & Wheelwright, 2004) was translated into Brazilian Portuguese (Quociente de Empatia) and adapted into a short version consisting of 21 items (Castelhano-Souza et al., 2018). It was composed of four dimensions: Cognitive Empathy (5 items; α=0.78), Emotional Reactivity (5 items; α=0.73), Social Abilities (5 items; α=0.73), and Emphatic Difficulties (6 items; α=0.57). The assessment of empathy was administered through the ratings of responses on a 4-point Likert scale.

The decision of choosing to focus on verifying if the IRI and the EQ are invariant when accounting for the sex variable stems from their frequent use in literature (de Lima & Osório, 2021). Since the two instruments were adapted to the Brazilian context in two different time periods, as the IRI was translated in 2001 and the EQ was validated in 2018, it would be interesting to attest whether these distinct scales may have significant differences in the assessment of empathy.

Procedures

Data were collected online and participants were recruited through invitations on emails and social media. The sample was assembled by convenience. All the participants had previously signed an Informed Consent Form. This study was performed after the approval from the Ethics in Research Committee.

Statistical Analysis

MGCFAs were performed to assess whether there were any differences in the empathy instruments (IRI and EQ) parameters regarding the participants’ sex. According to Roover (2022), the minimum sample size needed to adequately conduct the MGCFAs is equal to 300 individuals. The Robust Diagonally Weighted Least Squares (RDWLS) estimation method was used for categorical data (Li, 2016).

Many models have to be tested before measurement invariance can be supported for an instrument. However, since psychology scales tend to perform badly in some models due to the nature of the constructs they assess, it is widely accepted that the invariance can be evaluated through three models: configural, metric, and residual invariance (Counsell et al., 2020). The first model verifies whether the factors and number of items per factor, that is if the instrument’s structure is acceptable for both groups being tested. The second model analyses if the factor loads are equally significant for the participants’ groups. The final model evaluates the level of latent trait that is necessary to ensure scores’ categories (Cheung & Rensvold, 2002).

Goodness-of-fit indexes used to evaluate the models were: Root Mean Square Error of Approximation (RMSEA; values should be lower than 0.08 or, preferably, 0.06), Standardized Root Mean Square Residual (SRMR; values should be lower than 0.08), Comparative Fit Index (CFI; values should be higher than 0.90 or, preferably, 0.95), and Tucker-Lewis Index (TLI; values should be above 0.90 or, preferably, 0.95; Brown, 2015). Instruments’ invariances were assessed through the use of the CFI difference test (ΔCFI), which implies that, while analyzing a new model, if there is a reduction of CFI index levels of 0.01 and above, it is considered that the measurement invariance cannot be accepted (Cheung & Rensvold, 2002; Counsell et al., 2020).

Results

MGCFA results for the EQ can be observed in Table 1. Goodness-of-fit indexes allude to configural, metric, and residual invariance, thus demonstrating that the instrument is equivalent for both men and women. Therefore, it allows that group comparison to be performed.

Table 1: Multi-Group Confirmatory Factor Analysis (MGCFA) for the Empathy Quotient (EQ) 

Measurement Invariance Goodness-of-Fit Indexes
RMSEA [95% CI] SRMR TLI CFI ?#61508;CFI
Configural Invariance Metric Invariance Scalar Invariance 0.031 [0.019 - 0.040] 0.034 [0.025 - 0.043] 0.034 [0.024 - 0.043] 0.061 0.064 0.062 0.988 0.985 0.985 0.990 0.986 0.986 - -0.004 0

IRI invariance analysis can be seen in Table 2, and it showed that the scale did not support metric invariance. Consequently, the instrument is not equivalent for men and women and the comparison between groups is not recommended.

Table 2: Multi-Group Confirmatory Factor Analysis (MGCFA) for the Interpersonal Reactivity Index (IRI) 

Measurement Invariance Goodness-of-Fit Indexes
RMSEA [95% CI] SRMR TLI CFI ?#61508;CFI
Configural Invariance Metric Invariance Scalar Invariance 0.050 [0.043 - 0.058] 0.053 [0.045 - 0.060] 0.052 [0.045 - 0.059] 0.078 0.082 0.079 0.920 0.912 0.914 0.929 0.919 0.917 - -0.010 -0.002

After observing that metric invariance was not supported for the IRI, extra analyses were conducted to specify which items did not show equivalent factor loads between the two groups. Items’ metric invariance was tested by using the chi-square difference test, in which items should present an associated non-significant p-value (Byrne, 2009; Counsell et al., 2020). The results can be seen in Table 3. The use of this method requires that some constraints are imposed across groups (Byrne, 2009) and, because of that, the first items of each subscale (items 1, 2, and 4) were constrained and are not shown in the table below. Six of the instrument’s items were not invariant, from which four composed the Empathic Concern subscale and two were a part of the Personal Distress subscale of the IRI (see Table 4).

Table 3: Invariance Models’ p-values, Assuming that the Configural Model is Indeed Invariant 

Factor loads’ model (IRI item) χ2 p-value
Item 3 4.36 0.037*
Item 5 1.56 0.21
Item 6 2.05 0.15
Item 7 0.034 0.86
Item 8 0.097 0.76
Item 9 3.03 0.082
Item 10 4.37 0.036*
Item 11 0.12 0.73
Item 12 1.64 0.20
Item 13 4.09 0.043*
Item 14 4.67 0.031*
Item 15 1.63 0.20
Item 16 2.28 0.13
Item 17 5.90 0.015*
Item 18 7.18 0.007**
Item 19 0.021 0.88
Item 20 1.10 0.29
Item 21 0.22 0.64

* = p < .05; ** = p < .01.

4: IRI Items that Did Not Present Invariance for Men and Women. 

Item Dimension Evaluated Item Description (version in Brazilian Portuguese)
IRI 3 Empathic Concern Sometimes I don't feel very sorry for other people when they are having problems. (Às vezes, eu não lamento por outras pessoas que estão tendo problemas).
IRI 10 Empathic Concern Other people's misfortunes do not usually disturb me a great deal. (As desgraças e problemas dos outros em geral não me perturbam muito).
IRI 13 Empathic Concern When I see someone being treated unfairly, I sometimes don't feel very much pity for them. (Quando eu vejo alguém sendo injustiçado, eu às vezes não sinto muita pena dele).
IRI 14 Personal Distress I am usually pretty effective in dealing with emergencies. (Geralmente eu sou muito efetivo para lidar com emergências).
IRI 17 Empathic Concern I would describe myself as a pretty soft-hearted person. (Eu descreveria a mim mesmo como uma pessoa de coração mole).
IRI 18 Personal Distress I tend to lose control during emergencies. (Eu tendo a perder o controle durante emergências).

Discussion

As to discuss women’s tendency to report higher empathy levels in self-report measures, MGFCAs were proposed to test whether two empathy instruments could be used to test sex differences. Findings of instrument invariance suggest that the scale is adequate to evaluate a construct in the same matter in diverse groups (Counsell et al., 2020). EQ’s goodness-of-fit indexes showed that the measurement was invariant for the configural, metric, and scalar models, which allows comparisons between the tested groups (Han et al., 2019). Although not many studies have verified if the EQ is equivalent for men and women, sex invariance was attested in an Italian sample (Paolo Senese et al., 2018) and in an English sample (Allison et al., 2011).

While the EQ was found to successfully assess sex differences in empathy, IRI did not exhibit invariance for the metric model. Six items’ factor loads were not invariant between the two analyzed groups, from which four items referred to the Empathic Concern subscale and two were attributed to the Personal Distress subscale.

Among the most recent research regarding IRI’s sex invariance, the instrument was reported as equivalent for men and women in Spanish (Lucas-Molina et al., 2017), and German (Grevenstein, 2020) adult samples. It is noteworthy that while the Chinese study’s adapted version of the IRI was invariant, it had a different factor structure from the original scale: whereas Davis (1983) stipulated four dimensions for the instrument, the Chinese authors presented a bifactor model that highlighted the Empathic Concern and Perspective Taking dimensions (Wang et al., 2020). In contrast, the measurement’s lack of invariance for the scalar model was attested in a sample of Colombian ex-combatants (Garcia-Barrera et al., 2017).

The IRI version used in the present study did not follow the original four dimensions proposed by the author of the original scale (Davis, 1980). Many researchers have classified the IRI’s original factor structure as inadequate throughout the years and have also proposed other structural configurations (Garcia-Barrera et al., 2017; Lucas-Molina et al., 2017; Murphy et al., 2020; Wang et al., 2020). Since factor structure problems have influenced other studies, the present research’s instrument configuration might not be the best fit and, because of that, it might have affected the MGCFA. Moreover, it is also worth mentioning that the Brazilian variant of the IRI was adapted and validated in 2001 (Koller et al., 2001). A recent systematic review and meta-analysis verified that only two studies assessed the IRI’s test-retest reliability (de Lima & Osório, 2021), so there is not enough evidence to stipulate whether the scale can be used to measure empathy in the same manner after some time has passed.

Although the IRI’s factor structure and temporal problems might have played a part in the lack of equivalence between groups, other aspects might have also influenced it. Most of the non-invariant items were a part of the Empathic Concern subscale. This affective empathy component involves genuine feelings toward others (Davis, 1983), especially worry (Koller et al., 2001). While empathy’s cognitive dimension is often not affected by sex differences, the affective aspect presents more differential results, as it is more influenced by gender-specific expectations (Kamas & Preston, 2021). Accordingly, groups’ non-invariance might be a consequence of participants’ gender differences, instead of sex-related ones. To act correspondingly to society’s expectations, men could report lower levels of empathic concern in a way to not appear or to not be considered too feminine (Kamas & Preston, 2021).

The idea that gender-specific social expectations might have contributed to the lack of invariance can also be verified when the Empathic Concern subscale’s items are analyzed. Item 17 (“I would describe myself as a pretty soft-hearted person”) involves the term “soft-hearted”, which may elucidate different meanings for men and women. Research conducted in over fifty countries, including Brazil, described that this adjective was considered by participants as being a feminine personality trait (Williams & Bennett, 1975). More recently, studies executed in Jamaica and Barbados (Walters & Capertener, 2017), and Spain (Castillo-Mayén & Monten-Berges, 2017) also reported that both men and women associate the term with a more feminine characteristic. Thus, the item’s invariance may be related to social expectations and pressure on men not to act similarly to women.

The other three non-equivalent items of the Empathic Concern subscale (items 3, 10, and 13) may also have been influenced by social factors, especially by social desirability. The items’ statements, which include feeling pity, disturbed, or worry towards others, might act as a trigger for conscious or unconscious distortions of responses (Hauck Filho & Valentini, 2019). Men and women might be differently affected by the impact of having to choose the most sociably accepted answer and which one that could be.

The three aforementioned items are also characterized by being the only reverse items of the Empathic Concern subscale. The psychometrical properties of an instrument are often hindered when the measurement contains negative items since participants have more difficulty in assessing the item’s meaning when it’s inverted (Kam et al., 2021; Vigil-Colet et al., 2020). Consequently, individuals do not always understand a construct or an item the same way and, because of that, group comparison is also undermined when the instruments comprise inverse items (Suárez-Álvarez et al., 2018). Therefore, the fact that the previously mentioned items were constructed inversely might have contributed to the differentiated measure of empathic concern by men and women in this study.

Two items of the Personal Distress subscale were not invariant in this study. Personal distress is distinguished from empathic concern because the feelings of worry experienced by an individual are directed toward the self, not others, in an emotionally demanding context (Israelashvili et al., 2020). Nevertheless, this study’s two non-invariant items of the Personal Distress subscale do not include themes or statements that make emotional defying conditions explicit or implicit. The term “emergencies” might be too general and broad for Brazilian participants’ full comprehension of the construct since it might not allude to the emotional response that those items should generate. Conceptual equivalence is an important aspect to be examined while a psychological measurement is translated and adapted to different languages as it evaluates if a specific term or expression, even if it is adequately translated, measures the same construct in diverse cultures (Khouri & Silva, 2019). Those items may not be suitable to represent personal distress and, accordingly, the participants may not interpret them the same way. That could have contributed to the lack of metric equivalence between men and women.

Item 14 of the Personal Distress subscale (“I am usually pretty effective in dealing with emergencies”), in addition, was classified as troublesome in the step of the statistical validity of the instrument. The aforementioned item might not be an adequate measure of the construct, thus it can be removed from the Brazilian version of the IRI (Koller et al., 2001). Besides the conceptual equivalence problem presented by the item, it did not show the necessary psychometrical properties.

Measurements that do not display group invariance do not necessarily measure different constructs, but they might have diverse meanings attributed to each of the groups (Han et al., 2019). As a result, the absence of metric invariance presented by the IRI might mainly reflect a masculine difficulty in expressing worry about others, which is most possibly related to the fear of not behaving as it is expected by the respective gender and the influences that men suffer throughout their childhood and adolescence. Future research should not only analyze empathy’s sex differences but also investigate gender’s influence on empathy. Moreover, it should be noted that problems associated with items’ writing (e.g., social desirability and terminology issues) and psychometrical (e.g., an item that did not contribute to the scale’s confiability), and structural properties (e.g., reverse items) of the Brazilian version of the IRI might have also influenced the lack of metric invariance for some items.

Conclusions

This study’s objective was to analyze two empathy measurements to find out if there were any measurable differences in empathy for a sample of Brazilian adult men and women. In agreement with this theme’s literature, EQ is shown to be an invariant instrument and, thus, can be used to compare men’s and women’s perceptions of their empathy.

In contrast, one should be careful when evaluating empathic concern’s sex differences, since the IRI was not considered to be equivalent for this study’s sample. Those non-invariant IRI items may be influenced by gender and social expectations. In addition, social desirability, psychometrical, temporal, and adaptation issues might have also contributed to the instrument’s aforementioned results. Future studies may still use the IRI to measure empathy, but they will need to take into consideration that some items seem to be problematic when comparing men and women.

One of the present study’s limitations refers to the administration of only two empathy instruments to test their invariance. Since empathy is a complex construct, the use of other forms of measure might provide more information and enhance its comprehension, especially because empathy involves many different theoretical frameworks.

This study contributes to the research of empathy’s sex differences by being the first in Brazil to test empathy instruments’ invariance. The results reported here may highlight the importance of verifying if the psychological measurements used by research are adequate to measure what is expected. Therefore, the present study’s results may be able to help researchers decide which scale is the most suitable to be used in the Brazilian context, outlining that the more recent instrument seems to be more robust when considering the sex variable.

Whereas this study has reiterated the existence of a sex difference concerning empathy, it has also pointed out that not all self-report empathy instruments are appropriate to measure such difference, as men and women seem to comprehend and answer some items (especially those that comprise empathy’s affective components, such as empathic concern) differently. Consequently, men’s and women’s divergence regarding empathy seems to influence empathy measurements.

References

Allison, C., Baron-Cohen, S., Wheelwright, S. J., Stone, M. H., & Muncer, S. J. (2011). Psychometric analysis of the Empathy Quotient (EQ). Personality and Individual Differences, 51(7), 829-835. https://doi.org/10.1016/j.paid.2011.07.005Links ]

Azevedo, S. M. L. de, Mota, M. M. P. E. da, & Mettrau, M. B. (2018). Empatia: perfil da produção científica e medidas mais utilizadas em pesquisa [Empathy: the profile of scientific production and measure more used in research]. Estudos Interdisciplinares em Psicologia, 9(3), 03-23. https://doi.org/10.5433/2236-6407.2018v9n3p3Links ]

Baez, S., Flichtentrei, D., Prats, M., Mastandueno, R., García, A. M., Cetkovich, M., & Ibáñez, A. (2017). Men, women…who cares? A population-based study on sex differences and gender roles in empathy and moral cognition. PloS one, 12(6), e0179336. https://doi.org/10.1371/journal.pone.0179336Links ]

Baron-Cohen, S., & Wheelwright, S. (2004). The Empathy Quotient: An Investigation of Adults with Asperger Syndrome or High Functioning Autism, and Normal Sex Differences. Journal of Autism and Developmental Disorders, 34(2), 163-175. https://doi.org/10.1023/B:JADD.0000022607.19833.00Links ]

Brown, T. A. (2015). Confirmatory factor analysis for applied research (2nd ed.). The Guilford Press. [ Links ]

Byrne, B. (2009). Structural Equation Modeling with AMOS: Basic Concepts, Applications, and Programming (Multivariate Applications) (2nd ed.). Routledge. [ Links ]

Calandri, E., Graziano, F., Testa, S., Cattelino, E., & Begotti, T. (2019). Empathy and depression among early adolescents: The moderating role of parental support. Frontiers in Psychology, 10, Article e1447. https://doi.org/10.3389/fpsyg.2019.01447Links ]

Castelhano-Souza, M., Mendes, I. A. C., Martins, J. C. A., Trevizan, M. A., Souza-Júnior, V. D., & Godoy, S. de. (2019). Validação semântica das versões curtas das Escalas de Medição do Quociente de Empatia/Sistematização [Semantic validation of the short versions of the Empathy-Systemizing Quotient Scales]. Revista Latino-Americana De Enfermagem, 26, Article e3044. https://doi.org/10.1590/1518-8345.2606.3044Links ]

Castillo-Mayén, R., & Montes-Berges, B. (2017). ¿Which Automatic Associations Prevail? Congruency and Reverse Priming Effects on Implicit Gender Stereotyping. Pensamiento Psicológico, 15(1), 33-49. https://doi.org/https://dx.doi.org/10.11144/Javerianacali.PPSI15-1.AACRLinks ]

Cheung, G. W., & Rensvold, R. B. (2002). Evaluating goodness-of-fit indexes for testing measurement invariance. Structural Equation Modeling, 9(2), 233-255. https://doi.org/10.1207/S15328007SEM0902_5Links ]

Clark, M. A., Robertson, M. M., & Young, S. (2018). “I feel your pain”: A critical review of organizational research on empathy. Journal of Organizational Behavior, 40(2), 166-192. https://doi.org/10.1002/job.2348Links ]

Counsell, A., Cribbie, R. A., & Flora, D. B. (2020). Evaluating Equivalence Testing Methods for Measurement Invariance. Multivariate behavioral research, 55(2), 312-328. https://doi.org/10.1080/00273171.2019.1633617Links ]

Dang, J., King, K. M., & Inzlicht, M. (2020). Why Are Self-Report and Behavioral Measures Weakly Correlated?. Trends in cognitive sciences, 24(4), 267-269. https://doi.org/10.1016/j.tics.2020.01.007Links ]

Davis, M. H. (1980). A Multidimensional Approach to Individual Differences in Empathy. JSAS Catalog of Selected Documents in Psychology, 10, 85-103. https://www.uv.es/~friasnav/Davis_1980.pdf Links ]

Davis, M. H. (1983). Measuring individual differences in empathy: Evidence for a multidimensional approach. Journal of Personality and Social Psychology, 44(1), 113-126. https://doi.org/10.1037/0022-3514.44.1.113Links ]

de Lima, F. F., & Osório, F. L. (2021). Empathy: Assessment Instruments and Psychometric Quality - A Systematic Literature Review With a Meta-Analysis of the Past Ten Years. Frontiers in psychology, 12, Article e781346. https://doi.org/10.3389/fpsyg.2021.781346Links ]

Di Tella, M., Miti, F., Ardito, R. B., & Adenzato, M. (2020). Social cognition and sex: Are men and women really different? Personality and Individual Differences, 162, 110045. https://doi.org/10.1016/j.paid.2020.110045Links ]

Diotaiuti, P., Valente, G., Mancone, S., Grambone, A., & Chirico, A. (2021). Metric Goodness and Measurement Invariance of the Italian Brief Version of Interpersonal Reactivity Index: A Study With Young Adults. Frontiers in psychology, 12, Article e773363. https://doi.org/10.3389/fpsyg.2021.773363Links ]

Dryburgh, N. S. J., & Vachon, D. D. (2019). Relating sex differences in aggression to three forms of empathy. Personality and Individual Differences, 151, Article e109526. https://doi.org/10.1016/j.paid.2019.109526Links ]

Eisenberg, N., & Lennon, R. (1983). Sex differences in empathy and related capacities. Psychological bulletin, 94(1), 100-131. https://doi.org/10.1037/0033-2909.94.1.100Links ]

Falcone, E. M. de O., Ferreira, M. C., Luz, R. C. M., Fernandes, C. S., Faria, C. de A., D'Augustin, J. F., Sardinha, A., & Pinho, V. D. (2008). Inventário de Empatia (I.E.): desenvolvimento e validação de uma medida brasileira [Construction of a brazilian measure to evaluate empathy: the Empathy Inventory (E.I.)]. Avaliação Psicológica, 7(3), 321-334. http://pepsic.bvsalud.org/scielo.php?script=sci_arttext&pid=S1677-04712008000300006&lng=pt&tlng=pt. [ Links ]

Garcia-Barrera, M. A., Karr, J. E., Trujillo-Orrego, N., Trujillo-Orrego, S., & Pineda, D. A. (2017). Evaluating empathy in Colombian ex-combatants: Examination of the internal structure of the Interpersonal Reactivity Index (IRI) in Spanish. Psychological Assessment, 29(1), 116-122. https://doi.org/10.1037/pas0000331Links ]

Grevenstein, D. (2020). Factorial validity and measurement invariance across gender groups of the German version of the Interpersonal Reactivity Index. Measurement Instruments for the Social Sciences, 2(8), 1-8. https://doi.org/10.1186/s42409-020-00015-2Links ]

Hall, J. A., & Schwartz, R. (2019). Empathy present and future. The Journal of social psychology, 159(3), 225-243. https://doi.org/10.1080/00224545.2018.1477442Links ]

Han, K., Colarelli, S. M., & Weed, N. C. (2019). Methodological and statistical advances in the consideration of cultural diversity in assessment: A critical review of group classification and measurement invariance testing. Psychological Assessment, 31(12), 1481-1496. https://doi.org/10.1037/pas0000731Links ]

Hauck Filho, N., & Valentini, F. (2019). O controle da desejabilidade social no autorrelato usando quádruplas de itens [Social desirability control in self-report measures using quadruples of items]. Avaliação Psicológica, 18(3), 1-3. https://dx.doi.org/10.15689/ap.2019.1803.edLinks ]

Israelashvili, J., Sauter, D. A., & Fischer, A. H. (2020). Different faces of empathy: Feelings of similarity disrupt recognition of negative emotions. Journal of Experimental Social Psychology, 87, Article e103912. https://doi.org/10.1016/j.jesp.2019.103912Links ]

Kam, C. C. S., Meyer, J. P., & Sun, S. (2021). Why Do People Agree With Both Regular and Reversed Items? A Logical Response Perspective. Assessment, 28(4), 1110-1124. https://doi.org/10.1177/10731911211001931Links ]

Kamas, L., & Preston, A. (2021). Empathy, gender, and prosocial behavior. Journal of Behavioral and Experimental Economics, 92, Article e101654. https://doi.org/10.1016/j.socec.2020.101654Links ]

Khouri, N. D. M. A. A., & Silva, J. C. (2019). Revisão narrativa: metodologias de adaptação e validação de instrumentos psicológicos [Narrative review: psychological instruments’ adaptation and validation methodologies]. Revista Eixos, 8(2), 220-229. http://revistaeixo.ifb.edu.br/index.php/RevistaEixo/article/view/588/483Links ]

Koller, S. H., Camino, C., & Ribeiro, J. (2001). Adaptação e validação interna de duas escalas de empatia para uso no Brasil [Adaptation and internal validation of two empathy scales for use in Brazil]. Estudos de Psicologia, 18(3), 43-53. https://doi.org/10.1590/S0103-166X2001000300004Links ]

Li, C. H. (2016). Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares. Behavior research methods, 48(3), 936-949. https://doi.org/10.3758/s13428-015-0619-7Links ]

Liu, J., Qiao, X., Dong, F., & Raine, A. (2018). The Chinese version of the cognitive, affective, and somatic empathy scale for children: Validation, gender invariance and associated factors. PLoS ONE, 13(5), Article e0195268. https://link.gale.com/apps/doc/A537668111/AONE?u=capes&sid=bookmark-AONE&xid=7528d3e7Links ]

Lucas-Molina, B., Pérez-Albéniz, A., Ortuño-Sierra, J., & Fonseca-Pedrero, E. (2017). Dimensional structure and measurement invariance of the Interpersonal Reactivity Index (IRI) across gender. Psicothema, 29(4), 590-595. https://doi.org/10.7334/psicothema2017.19Links ]

Murphy, B. A., Costello, T. H., Watts, A. L., Cheong, Y. F., Berg, J. M., & Lilienfeld, S. O. (2020). Strengths and Weaknesses of Two Empathy Measures: A Comparison of the Measurement Precision, Construct Validity, and Incremental Validity of Two Multidimensional Indices. Assessment, 27(2), 246-260. https://doi.org/10.1177/1073191118777636Links ]

Paolo Senese, V., De Nicola, A., Passaro, A., & Ruggiero, G. (2018). The factorial structure of a 15-item version of the Italian Empathy Quotient Scale. European Journal of Psychological Assessment, 34(5), 344-351. https://doi.org/10.1027/1015-5759/a000348Links ]

Pavolov, G., Shi, D., Maydeu-Olivares, A., & Fairchild, A. (2021). Item desirability matching in forced-choice test construction. Personality and Individual Differences, 183, Article e111114. https://doi.org/10.1016/j.paid.2021.111114Links ]

Roover, K. D. (2022). Finding clusters of groups with measurement invariance: Unraveling intercept non-invariance with mixture multigroup factor analysis. Structural Equation Modeling: A Multidisciplinary Journal, 28(5), 1-21. https://doi.org/10.1080/10705511.2020.1866577Links ]

Suárez-Álvarez, J., Pedrosa, I., Lozano, L. M., García-Cueto, E., Cuesta, M., & Muñiz, J. (2018). Using reversed items in Likert scales: A questionable practice. Psicothema, 30(2), 149-158. https://doi.org/10.7334/psicothema2018.33Links ]

Takamatsu, R., Tsou, Y.-T., Kusumi, T., & Rieffe, C. (2021). The Japanese empathy questionnaire (EmQue) for preschool children: Psychometric properties and measurement invariance across gender. International Journal of Behavioral Development, 45(5), 453-462. https://doi.org/10.1177/01650254211022852Links ]

Ventura-León, J., Caycho-Rodríguez, T., & Dominguez-Lara, S. (2019). Invarianza Factorial Según Sexo de la Basic Empathy Scale Abreviada en Adolescentes Peruanos [Factor Invariance Across Sex of the Abbreviated Basic Empathy Scale in Peruvian Adolescents]. Psykhe (Santiago), 28(2), 1-11. https://dx.doi.org/10.7764/psykhe.28.2.1418Links ]

Vesely, S., & Klöckner, C. A. (2020). Social Desirability in Environmental Psychology Research: Three Meta-Analyses. Frontiers in psychology, 11, 1395-1395. https://doi.org/10.3389/fpsyg.2020.01395Links ]

Vigil-Colet, A., Navarro-González, D., & Morales-Vives, F. (2020). To reverse or to not reverse Likert-type items: That is the question. Psicothema, 32(1), 108-114. https://doi.org/10.7334/psicothema2019.286Links ]

Walters, G. K., & Carpenter, K. (2017). Gender-Role Stereotypes and Culture in Jamaica and Barbados. In K. Carpenter (Org.), Interweaving Tapestries of Culture and Sexuality in the Caribbean (pp. 15-34). Palgrave Macmillan. [ Links ]

Wang, Y., Li, Y., Xiao, W., Fu, Y., & Jie, J. (2020). Investigation on the Rationality of the Extant Ways of Scoring the Interpersonal Reactivity Index Based on Confirmatory Factor Analysis. Frontiers in Psychology, 11, Article e1086. https://doi.org/10.3389/fpsyg.2020.01086Links ]

Williams, J. E., & Bennett, S. M. (1975). The definition of sex stereotypes via the adjective check list. Sex Roles, 1, 327-337. https://doi.org/10.1007/BF00287224Links ]

Zhao, Q., Neumann, D. L., Cao, Y., Baron-Cohen, S., Yan, C., Chan, R., & Shum, D. (2019). Culture-Sex Interaction and the Self-Report Empathy in Australians and Mainland Chinese. Frontiers in psychology, 10, Article e396. https://doi.org/10.3389/fpsyg.2019.00396Links ]

Received: May 04, 2022; Revised: November 22, 2023; Accepted: March 19, 2024

Creative Commons License This is an open-access article distributed under the terms of the Creative Commons Attribution License