Measurement of courtesy stigma: Systematic literature review

This article aimed to conduct a literature review regarding the instruments used to measure the stigma of courtesy, in the databases related to the field of investigation (


3
According to Erving Goffman (1922Goffman ( -1982)), social stigma is a trait or differentiation that places the individual in a position of inferiority when compared to hegemonic groups.This trait is a derogatory attribute that, by reinforcing the ideal of superiority of the normative character, causes the dehumanization and social exclusion of the stigmatized person.In this context, stigma can be classified in three ways depending on its origin: abominations of the body, when there is some type of physical deformity; blemishes of individual character, related to moral failure; and tribal stigma of race, nation, and religion, which refer to cultural aspects (Goffman, 1975).
The social construction of stigma takes place through the recognition and consequent devaluation of a difference or trait carried by the subject.Therefore, social or public stigma comes from the idea of the other in relation to the stigmatized person, so that when the marked subject becomes aware of this public perception, starts to agree with this negative view and apply it to themself, a situation of internalized stigma arises (Ronzani et al., 2017).Also from this perspective, it can be observed that the internalization of this differentiation may be associated with harm to this individual, such as decreased self-esteem, intensification of negative emotions, and social withdrawal (Malagodi et al., 2019).
Recent studies (McCann & Lubman, 2017;Huang et al., 2016) have investigated the way social stigma interferes in the lives of people that live directly with individuals who carry this trait, such as family members and healthcare providers.This stigmatization process occurs when the family member or caregiver associated with this individual begins to experience situations of suffering and harm to their physical and mental health.Mak and Cheung (2008) observed that the internalization of this negative view by caregivers of individuals with intellectual disabilities or mental illness is strongly associated with a greater subjective burden related to the act of caring and negativity in relation to this process, as well as an increase in the perception of inferiority and social withdrawal.It can, therefore, be said that courtesy stigma results from negative social perceptions in relation to the interaction between the marked subject and close people, while affiliation stigma would correspond to the internalization of these impressions (Mak & Cheung, 2008).
Accordingly, it is necessary to develop and adapt scales that aim to measure courtesy stigma and its internalization in family members of people who carry some type of derogatory trait, as well as in professionals whose work is directly related to the healthcare of these individuals.This study aimed to conduct a literature review on courtesy and affiliation stigmas, focusing on the validation of the instruments used to measure these constructs.This analysis of the state of the art aimed to establish a starting point for future studies.

Method
The report of this systematic literature review was based on the Preferred Report Items for Systematic Reviews and Meta-Analyses (PRISMA) recommendations (Galvão et al., 2015), with the aim of increasing its future reproducibility.For this, a bibliographic search was carried out involving the concept of courtesy stigma as the main subject in articles indexed in the databases.The keywords courtesy stigma, affiliate stigma, and associative stigma were used.
These databases were chosen because they are all recognized in the health areas and publish evidence-based, peer-reviewed studies.Although the keywords used are not included in the Health Sciences Descriptors (Descritores em Ciências da Saúde [DeCS]), the criteria for their selections were based primarily on the keyword courtesy stigma, maintained because it was a term initially used by Goffman, the original author in the area of stigma, while the keywords affiliate stigma and associative stigma were used later due to the conceptual approximation they establish with the first, and because they are also terms commonly found in articles.
With regard to the Boolean operators used in the search process, only the AND operator was chosen, with the search procedure for all databases carried out as follows: "Courtesy" AND "Stigma", "Associative" AND "Stigma" and "Affiliate" AND "Stigma".In the case of the Capes Publications Portal, no specific database was selected from among those that make up its collection, aiming to expand the screening process of articles related to the topic.
The inclusion criteria were articles published in English, Portuguese, or Spanish that had courtesy stigma as their central theme, were original studies of an empirical nature and analyzed the psychometric properties of instruments that measure courtesy stigma or similar constructs.
No time period in relation to the year of publication of the articles was established.Articles until the year 2019 were included.
First, the titles and abstracts of the articles were read, and we observed whether they met the inclusion criteria proposed for this literature review, with the articles that did not meet these criteria being disregarded.Subsequently, the elimination of duplicate articles was conducted, and the remaining articles were read in full.To facilitate the process of analyzing the articles, descriptive categories were created based on the studied population, the condition of the people in need of care, name of the instrument used, absence or presence of translation, sample size, number of items and factors of these instruments, and types of validity and reliability employed in the study.During this article selection process, two researchers carried out the independent categorization of the articles, and, in situations in which there was no consensus, a third researcher made the final decision.

Results
The electronic search in the databases resulted in a total of 564 abstracts, of which 314 were eliminated because they were duplicates.A total of 240 studies were excluded after reading the titles and abstract content, resulting in 10 full texts that fulfilled the inclusion criteria and were read in full.All the articles identified in the databases were published in English (Figure 1).The ASS (Mak & Cheung, 2008) is an instrument developed in China used with a sample of family members of individuals with some type of mental illness or intellectual disability, with its data showing good stability and validity for these groups (Saffari et al., 2019).
The CASS (Yanos et al., 2017), a scale recently developed with mental health care providers in the United States of America (USA), measures the associative stigma of these professionals with people who need care in this area, having shown good internal consistency and convergent validity with other stigma indicators.
The CCSS (Liu et al., 2014) focuses on the stigma experienced by family members and caregivers of people with the human immunodeficiency virus (HIV).The study that originated this scale assumes that people that are seronegative may also experience a certain degree of stigma, as they are associated with people who are seropositive.
The PISMI (Zisman-Ilani et al., 2013), a scale based on the Internalized Stigma of Mental Illness (ISMI) scale (Ritsher et al., 2003), starts from the consideration that family members of people with severe mental illnesses may also be target of stigma.Therefore, the elaboration of the PISMI was premised on presenting the same factor structure as the ISMI.
The LBG-ASM (Robinson & Brewster, 2016) was developed to understand the emotional and psychological impact of stigma among family and close friends of lesbian, gay, and bisexual (LGB) people and propose initiatives that offer them greater support.Finally, the ASS-M scale (Yun et al., 2018) was developed for the context of the population residing in Malaysia and is based on the ASS, mentioned above.
Regarding some general characteristics of the included studies, the samples used ranged from a minimum of 180 people to a maximum of 649, while the number of items present in the scales had a minimum of 12 and a maximum of 22. Concerning the number of factors, most of the scales are composed of three factors, except for the CASS and ASS-M scales, which have four factors each, and the CCSS scale, with only two factors (Table 1).
Some studies used translated versions of the instruments into Persian, Chinese, Hebrew, Arabic, and Malay.In these studies, the technique of translation into the language of research interest and back-translation into the original language, which in all cases was English, was adopted.In the specific case of the ASS scale, this instrument has already been translated into seven languages: Chinese (Mak & Cheung, 2008), Urdu (Farzand & Abid, 2013), Hebrew (Werner & Shulman, 2015), Hindi (Banga & Ghosh, 2017), Persian (Denahvi et al., 2011), Malay (Yun et al., 2018), and Amharic (Hailemariam, 2015) (Table 1).Regarding the characteristics of the population studied, eight out of the ten selected studies considered family members and close people to be the main targets of courtesy stigma, Note.HIV -human immunodeficiency virus; LGB -lesbian, gay, and bisexual.
In relation to the reliability of the instruments, Cronbach's alpha values were considered, both for the scales as a whole and for their respective dimensions, establishing that values above 0.70 correspond to a good indicator of internal consistency (Souza et al., 2017).In almost all the studies, the alpha values were above 0.70, except for one, carried out with the CASS scale, in which the stereotype about the mental health of the professionals (SMHP) dimension had an alpha of 0.68, and another, with the PISMI scale, in which the social withdrawal (SW) and alienation (AL) dimensions had an alpha of 0.65 and 0.61, respectively.Three studies reported only the alpha values referring to the dimensions of the scale, and not the general alpha of the instrument (Table 3).
Concerning the stability of the scales, the test-retest statistical analysis and intraclass correlation coefficients (ICC) above 0.70 were considered to be recommended (Souza et al., 2017).
Only two studies used this type of analysis and both ICC values were above 0.70, with an interval of two to three weeks between the first and second application of the instrument (Table 3).
To verify the factorial validity of the instruments, most studies used exploratory factor analysis (EFA) and confirmatory factor analysis (CFA), with the principal component analysis (PCA) technique and the Rasch model adopted in some cases.A variance of 50% was considered the minimum cumulative percentage of the total variance extracted by successive factors to indicate an adequate factorial fit (Howard, 2016).
In the PISMI scale, the extraction of three factors accounted for 54.2% of the total variance, indicating an adequate fit to the model.A similar value was also found for the LGB-ASM scale, in which the extraction of three factors represented 54.4%.For the CASS scale, only one article confirmed the four-factor structure through CFA.Other statistical techniques of EFA that were used combined with PCA did not report the percentage of variance extracted by each factor (Table 3).
For the CCSS scale, the two-factor model represented 83.0% of the extracted variance, which indicated a good fit to the model and the best extracted variance when compared to the other instruments found in this review.The factor structure of this scale was also confirmed using CFA (Table 3).
The ASS scale, in turn, had the three-factor model confirmed in two studies through CFA, while in the original study, in which the scale was developed, PCA indicated that the extraction of one factor was responsible for 49.03% of the total variance considering a sample of family members of people with intellectual disabilities and 43.87% for a sample of family members of people with mental illness, evidencing the impossibility of the scale having only one factor (Table 3).
Also, in relation to the ASS scale, a specific study used Rasch analysis to justify the unidimensionality of each of the three factors, confirming that they are separate domains.In this study, PCA also demonstrated that the extraction of one factor was only responsible for 46.28% of the total variance.Another study also used the Rasch analysis, however, aiming to assess the difficulty of the items in each factor.Finally, the ASS-M scale had its four-factor structure justified through EFA and CFA, although the study in question did not indicate the percentage of variance extracted by each factor (Table 3).
In the discriminant validity, significant and negative correlations were found between affiliation stigma and quality of life, social support, self-esteem, quality of care, and social desirability, through the instruments: Short Form 12 (Montazeri et al., 2009), World Health Organization Quality of Life-BREF (Yao et al., 2002), Multidimensional Scale of Perceived Social Support (Bagherian-Sararoudi et al., 2013), Rosenberg Self-Esteem Scale (Shapurian et al., 1987), the Quality of Care Scale (Salyers et al., 2015), and the Balanced Inventory of Desirable Responding (Paulhus & Reid, 1991) (Table 3).
The validity of known groups was obtained through the hierarchical regression model, multiple lnear regression analysis, the Rasch model or through simple correlations, such as Pearson's r.In this sense, one study with the ASS scale showed a significant association between the age of the caregiver of people with mental illness and the scale's total score.Another study, besides also having applied the ASS, used the Rasch model, in order to assess the difficulty of the items, and suggested that men and women score the scale differently.Two other studies, in which the CASS scale was used, found that mental health care providers obtained different results in the total score of the scale due to age, gender, educational level, and professional occupation (Table 3).

Table 3
Reliability and validity of the instruments

Discussion
The instruments found in this literature review presented, in general, Cronbach's alpha values above 0.70, which indicates good internal consistency.However, it should be noted that these values are subject to the influence of the characteristics of the samples, the type of instrument, and the method of administration used, factors that were quite diverse in the studies analyzed (Roach, 2016).Another important point regarding the alpha coefficient refers to the fact that this value is strongly influenced by the number of items in the measurement instrument, and, although the scales considered in this review present a similar number of items, there were scales with a difference of up to ten items when compared to each other (Roach, 2016).
With regard to the test-retest statistical analysis, only two studies used this resource.It is important to consider that, although these studies presented satisfactory values (ICC above 0.70), it is necessary to reapply this method considering different periods between the first and second application, as test-retest reliability tends to decrease as the test reapplication is delayed (Nakagawa et al., 2017).
Regarding factorial validity, few of the studies included repeatedly evaluated the same instrument.The exception was the ASS scale, in which one study confirmed the data obtained from the original article on the development of the scale.This happened because, when analyzing the values obtained in the original study through the classical test theory (CTT) with modern statistical techniques, such as the Rasch model, a recent study confirmed the unidimensionality of each of the three scale factors (Chang et al., 2015).Two other studies were able, through the use of the Rasch model, to confirm the factorial structure of the scale, as well as its suitability for application to other populations besides caregivers of people with intellectual disabilities or mental illness (Saffari et al., 2019;Chang et al., 2016).

11
Considering convergent validity, the studies suggest that higher scores in the affiliation stigma scale are accompanied by an increase in scores in scales that measure the depression, anxiety, caregiver burden, burnout, awareness of public devaluation, and awareness of stigma variables.Similarly, in the discriminant validity analysis, high scores in the affiliation stigma scale suggested decreased scores in scales that measure quality of life, social support, self-esteem, quality of care, and social desirability (Saffari et al., 2019;Yanos et al., 2017;Chang et al., 2015;Robinson & Brewster, 2016;Chang et al., 2016).
Finally, the analysis of the validity of known groups for the CASS scale, through a study carried out with mental health care providers in China, verified that older professionals with a lower level of education and who worked in inpatient units were more subject to association stigma (Lin et al., 2018).Through analysis of the differential item function (DIF), one study on the ASS showed that women and men scored differently in relation to the affective and cognitive dimensions of the scale, which raises the hypothesis that this finding could be due to perceptions of gender roles in society (Chang et al., 2015;Su et al., 2013).
The analysis of the articles included in this review shows that there is still a lack of studies that assess the psychometric properties of instruments that measure courtesy stigma or similar constructs.Accordingly, despite many initiatives aimed at the development of new scales, most of the studies have low reproducibility, in the sense that there are no additional studies that allow the validation of the factor structures of the instruments included in this review or the generalization of their application to different cultures, population contexts, and health conditions.It is, therefore, necessary to develop and validate instruments that measure courtesy stigma, taking into account different population contexts and proposals that minimize the harmful effects of this type of stigma on societies.

Table 1
General characteristics of the selected studies Affiliate Stigma Scale-Malay (ASS-M).
while only two studies addressed mental health care providers.Considering the contexts covered in the studies, half referred to mental illness, followed by dementia in two of the studies.The themes of people with mental illness or intellectual disability, human immunodeficiency virus (HIV), and LGB people appeared in one study each (Table2).

Table 3
Reliability and validity of the instruments (continuation) s ); standardized regression coefficient (β).