少点错误 2024年11月05日
Empathy/Systemizing Quotient is a poor/biased model for the autism/sex link
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文探讨了性别差异和自闭症与系统化和共情能力之间关系的复杂性。研究者通过调查问卷,发现性别差异和自闭症在系统化和共情方面的表现并非简单的权衡取舍。男性更倾向于科技兴趣和不随和,而自闭症人群则更专注于细节、内向、社交困难且感官敏感。研究结果表明,系统化和共情能力并非单一维度,而是多维度的复杂特征,需要更深入的研究来理解它们与性别差异和自闭症之间的关系。

🤔 **研究背景:**基于自闭症的系统化/共情理论,认为自闭症与性别差异都涉及到系统化(理解规则性系统)和共情(理解他人情感)之间的权衡,男性和自闭症人士更倾向于系统化,而女性和非自闭症人士更倾向于共情。

📊 **数据收集与分析:**研究者通过线上平台招募了不同性别和自闭症诊断状态的参与者,收集了与自闭症、系统化和共情相关的问卷数据,并使用主成分分析法提取数据中的主要维度。

📈 **测量不变性:**研究发现,性别差异在问卷项目中的表现与整体得分之间的相关性较低,这表明性别差异并非简单地反映在系统化/共情维度上。而自闭症群体与非自闭症群体之间的差异则表现出较高的相关性。

🧩 **因子分析与解释:**通过因子分析,研究者将提取出的多个维度进行解释,发现性别差异和自闭症在这些维度上的表现存在差异。例如,性别差异主要体现在科技兴趣和不随和方面,而自闭症差异则体现在社交困难、感官敏感和专注于细节等方面。

💡 **结论:**研究结果表明,性别差异和自闭症在系统化和共情方面的表现并非简单的权衡取舍,而是涉及到多个维度的复杂关系。需要更深入的研究来理解这些维度及其与性别差异和自闭症之间的关联。

Published on November 4, 2024 9:11 PM GMT

Thank you to Justis Millis for providing feedback and proofreading on this post. This post is also available on my Substack.

TL/DR: Contrary to the theory that neurological sex differences and autism both involve the same tradeoff of systemic versus empathic thinking, I found complex differences. It turned out that men were more interested in technology and more disagreeable, whereas autistic people had a narrower focus on details, were more introverted, more socially challenged, and had stronger sensory sensitivity.

I asked people on Prolific a bunch of questions that are supposed to be related to autism, systemizing and empathy. As a preview before I get into the details, the overall results can be seen here:

If there was only a single underlying tradeoff that all of these scales were measuring, then we would expect all of the group differences to be highly correlated, with the items just varying based on how well they “tap into” this underlying tradeoff. As that is not what we see, I think there’s something more complicated going on.

Background

The Empathizing/Systemizing theory of autism asserts that there’s a neurological tradeoff between Empathizing (understanding people’s emotions) and Systemizing (understanding how deterministic, rule-based systems work), and that men and autistic people are more prone to Systemizing, whereas women and allistic people are more prone to Empathizing. Furthermore, it asserts that a deficit in Empathizing or a switch towards Systemizing over Empathizing is not merely a feature of autism, but rather core to what autism is.

The main proponent of this theory is Simon Baron-Cohen, who often justifies it in terms of several psychometric scales he’s made to measure traits like Systemizing, Empathizing or autism. A recent example is Testing the Empathizing-Systemizing theory of sex differences and the Extreme Male Brain theory of autism in half a million people (2018), a study where he claims to find support for his theory.

I used to consider the theory plausible, but I ran into several problems. Some formulations of the theory go really heavily into the idea that the Empathizing-Systemizing axis is the core of feminine/masculine psychology, but that seems somewhat sketchy. Simon Baron-Cohen’s scales have been criticized for sex bias, but while he came up with a scale that supposedly balanced out the bias, he didn’t use the standard psychometric methods for testing for sex bias, and when I took a superficial look with those methods, it still seemed biased. Furthermore, genetic studies of empathy and systemizing seem to have found it to be close to independent of autism. As such, I became suspicious the theory might be false.

Data

For the main analysis, I collected a bunch of items supposed to be relevant for measuring autism. The full item list can be seen in the appendix. To get data, I used Prolific to recruit 100 non-autistic men, 100 non-autistic women, 50 autistic men diagnosed in childhood, 50 in adulthood, 50 autistic women diagnosed in childhood and 50 in adulthood.

I asked the respondents to rate each item on a scale from “Disagree strongly”, “Disagree”, “Neither", “Agree” to “Agree strongly”. To quantify the data, I mapped these response options to -2, -1, 0, 1, and 2. Also, sometimes to make the computation more convenient, I divided by the standard deviation.

I also had a bunch of ideas for follow-up analyses, but they didn’t really lead anywhere, so I’m not going to publish them here right now. If you want access to the data, contact me and I will provide it.

Principal component analysis

“Empathizing” and “Systemizing” are generally conceived of as quite abstract general traits, but (especially for Systemizing) many of the items are quite concrete and narrow, e.g. “If I were buying a computer, I would want to know exact details about its hard drive capacity and processor speed”.

This is fine and perfectly intentional. If we expect a general trait to influence many distinct behaviors within a person, we can infer the level of the trait by looking for this overall pattern of behaviors, rather than any one specific behavior. In fact, unless we know the root cause of variation in a general trait, this seems to be the only way to measure a general trait.

One way to quantify patterns of variation is to use principal component analysis, which lists independent axes in the data in descending order of variance. Because the item responses occur in a bounded range from -2 to 2, no individual item has much variance. Instead, the primary variance occurs because of the correlation between the items (so some people are outliers on many items at once), and therefore principal component analysis narrows in one the dimensions that are relevant to general traits.

As a sanity check, if I extract the first principal component of all the items, I get a score with a reasonably large separation between the autistic and allistic respondents, though with significant overlap and many outliers:

An overall score like this can be hard to relate to, in my opinion. One thing that helps add semantics to the score is seeing how the score relates to the survey responses, so I’ve invented this new kind of diagram to map that out:

Basically, at the top of the diagram, you see the distribution of responses for each group. Below this distribution, you see the median item response for people at a given level of the distribution. So for instance, for the item “I find it hard to know what to do in a social situation”, the label below the 5 score is “Disagree”, while the level below the -5 score is “Agree”, corresponding to the median responses from people at scores of 5 and -5 respectively.

Measurement invariance

This whole discussion started because I was concerned about test bias in the autism metrics. To better illustrate this, I have the following plot:

Each dot in the plot shows an item, and on the y-axis, we see the sex difference for said item. Notably, the sex difference for the item “I am very interested in technology” is very large. That fact alone of course doesn’t intrinsically prove that the item has a sex bias, since if the theory is correct, the item would have a sex difference because men are more prone to Systemizing over Empathizing. However, that is where the x-axis comes in.

If this item is highly reflective of Systemizing vs Empathizing tendencies, then we should expect the item to be highly correlated with the overall score we computed using principal component analysis, and we should expect the score to exhibit a large sex difference too. By multiplying the sex difference in the overall score with the correlation between the overall score and the item, we can get a “predicted sex difference” (according to a single-factor model).

Because it’s relatively easy to predict the sign, to make the test more sensitive I’ve reverse-scored the items with a negative sex difference to emphasize whether it predicts the magnitude of the sex difference. It turns out, this correlation is low, only 0.25.

For contrast, consider what happens if we look at the gap between autistic and non-autistic people:

The total score has a large separation between autistic people and allistic people, and an item like “I find it hard to know what to do in a social situation” has a high correlation with the total score even within the groups (so an autistic person who disagrees with this item is also more likely to score low in the overall autism score), which implies a prediction that the item itself ought to have a large separation between autistic people and allistic people too - and indeed it does! This is the case for most of the items, reflected in the fact that the correlation between the predicted group difference and the actual group difference is much higher than for sex (0.76).

It becomes sketchy to even start interpreting or talking about the sex differences in these scores if those sex differences do not reflect sex differences in the items that we expect to be related to the scores. To solve this, we can bring in additional dimensions, rather than trying to reduce everything to a single Autism-Allism or Systemizing-Empathizing dimension. To pick the number of dimensions to use, I look at the correlation between the predicted and the actual sex difference as a function of the number of dimensions:

This seems to stabilize around 7 dimensions, so therefore I used principal component analysis to extract 7 principal components. To verify, here’s the predicted vs actual sex differences when using more dimensions (note, the diagram does not show anything about these 7 dimensions, only the predicted properties of the items):

Improving interpretability

It’s a bit hard to intuit what the diagram above is about, because I haven’t explained the meaning of the 7 scored dimensions. Using principal components puts us in a bit of a pickle, because initially we were interpreting the meaning of the score by looking at the relationship between the score and the items, but this relationship becomes very complex as we include more principal components.

To help make it more interpretable, we can apply a linear transformation to the scores to make their relationship with the items sparse. I choose to use the algorithm varimax to find a transformation that keeps each of the scores uncorrelated with the other scores while making the scores’ correlations to the items as sparse as feasible.

I then gave names to the factors that vaguely described what the items correlated with the factors had in common. The tables with the items and their correlations to the scores can be seen below:

Given these factors, we can show the average level of each of these traits by group. For the plot below, I’ve divided the trait levels by their standard deviation to make the comparison less dependent on the particulars of the scoring:

Here, the sex differences are in many ways clearly different from the autism difference. The sex difference is concentrated mainly on System Interests, whereas the autism difference is on most of the other variables, except for Curiosity and Orderliness.

If I take the items that are specific to each factor and apply a similar methodology to the above more narrowly to them, I can also break things down further. For example, System Interests had items like “If I were buying a computer, I would want to know exact details about its hard drive capacity and processor speed.”, which I would consider to be about more technologically-oriented interests, and items like “When I learn about historical events, I do not focus on exact dates.”, which I’d consider to be about more detail-oriented interests. Detail Interests were more autistic, whereas Tech Interests were more male:

Similarly, the Empathy items included agreeable items like “I care about others' feelings.” as well as items related to understanding others like “I find it easy to ‘read between the lines’ when someone is talking to me.”. It turns out that the sex difference on such items was more concentrated on those involving caring about others, whereas the autism difference was more concentrated on those involving understanding others:

To unify the two frames, I performed varimax again after breaking the item-sets down, yielding something with all the factors, but System Interests and Empathy broken down into two:

Overall, it seems to me that the EQ-SQ theory mixes sex differences and autism-allism differences together in a way that doesn’t really correspond to reality. It’s more accurate to say that men and women have a different pattern of psychological differences than autistic and allistic people do.

Some people also like to look at the dimension optimized for separating the groups, sometimes known as Gender Diagnosticity or Mahalanobis D. I have some philosophical quibbles that make me not-super-enthusiastic about these quantities[1], but I thought I might as well compute them here:

 

One thing that you might have noticed, either on this diagram or the previous ones, is that autistic women in some ways score more male-typical than non-autistic women, for instance in that they have more technical interests. This seems to be the main “correct prediction” the EQ-SQ theory has made in this study, but on net I would interpret these results as evidence against the theory.

Appendix: Item list

From Systemizing Quotient:

From Empathy Quotient:

From Autism Spectrum Quotient:

From Sensory Perception Quotient:

Additional items that I felt were nice/relevant to include, because in other surveys I've seen them be highly correlated with various relevant traits:

Appendix: Score distributions

 

  1. ^

    People seem to assume that computing the group-separating axis yields the “essence” of the group in some sense, but that assumption is quite ill-defined and as far as I can tell usually not justified.



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

自闭症 性别差异 系统化 共情 主成分分析
相关文章