I find that some of my ideas take a few weeks, months or even years to form. This one took almost exactly a year before coalescing (coagulating?) in my mind. I’ve been thinking about personality tests in the context of efficacy, equity and neurodiversity recently, and it troubles me.
I’ve always found personality testing problematic – indeed any pseudo-Jungian approach to putting people into type categories I find highly distasteful and potentially harmful.
Critical literacy is sorely lacking in the business and management world. This is possibly largely because it’s not rewarded: we reward confidence, sticking by decisions, bullishness and simple answers to complex problems.
In respect to diversity, inclusion, and equity, I just can’t square the desire to categorise people and their personalities with the very real need for inclusion and diversity of ways of thinking. It’s seems simply antithetical.
To summarise the flaws in personality testing:
- There is very little evidential basis behind personality profiling, and significant evidence against it.
- The models are usually based on false dichotomies of “big picture vs detail-oriented”, when there is no evidence that these exist.
- The models are also based on WEIRD (Western, Educated, Individualistic, Rich, and Democratic) societies, and fail to recognise collectivist, holistic strengths.
- They rarely address context and inter-relational behaviours, but instead make assumptions about behaviour from individualistic measures.
- They tend to assume that our personalities are largely fixed and unchangeable.
- These tools can lead to false and potentially harmful assumptions made about other people and the way they behave.
- The tools may be used for unethical (and illegal) practices such as recruitment, selection for promotion, or other decisions made about someone without their consent.
- In my experience, they are one of the most highly weaponised management tools ever created.
- Because they lead people to believe that they can understand someone based upon a profile, they can prevent further, discussion, examination and effort to understand people and their ever-evolving uniqueness.
- The algorithms used are rarely open. Algorithms inherit the biases of those people that created them, and if we are making ourselves subject to analysis by algorithm, I want to know what it’s doing and who designed it.
- Many tests are biased (see above) – for example, the Big Five was shown to bias against women and categorise them as more aggressive when answering identically to a man: because the original data model was flawed.
- To avoid a critique of poor reliability, we’re often told to avoid doing the tests more than once.
- When assigned a profile, we are generally not allowed to dispute it. Even though we have spent decades in our own minds, a five-minute test is assumed to know more about me, than me.
Even scientists who are most concerned with assessing individual differences in personality would concede that our ability to predict how particular people will respond in particular situations is very limited.
Personality, strength, or psychometric models such as Myers-Briggs, DISC, Belbin, Predictive Index, Tilt and the myriad others available, attempt to codify people and their preferences, personalities, behaviours and values into archetypes, using fixed (usually proprietary and opaque) algorithms. There is usually a commercial reason that these tests are closed-source, because companies don’t want someone copying the code and using it for distributing it, but it also prevents detailed analysis and evaluation of the algorithm.
Repeatability and validity
These archetypes (such as “maverick”, or “Inventor”) are then categorised and collated into larger group types, and in many organisations, used to inform everything from role selection, management approach, or even hiring decisions, (which is illegal in many cases).
In 20 years of management, I have never seen a psychometric analysis tool generate a constructive outcome, particularly from a diversity, equity and inclusion (DEI) perspective. I also find it interesting that personality testing *only* exists in the business world, not in the academic world of actual psychological study. Do business managers actually think they know something psychologists don’t?
In my opinion (somewhat backed up by many years of experience and study), categorising people and attempting to simplify the complexities of our nature, in an attempt to make other people and ourselves more predictable, is certainly a seductive proposition. But it is error-prone, and dangerous. Adam Grant, organisational psychologist at Wharton, agrees.
Psychometric analyses don’t work. Indeed, they are often damaging.
The reason they will never work is because they try to map a complicated framework onto a complex problem. You may be familiar with Carl Jung, and his “12 Archetypes” of “Ruler, Sage, Explorer, etc”, which are frequently criticised as mystical or metaphysical essentialism. Since archetypes are defined so vaguely and since archetypal images have been observed by many Jungians in a wide and essentially infinite variety of everyday phenomena, they are neither generalisable nor specific in a way that may be researched or demarcated with any kind of rigour. Hence they elude systematic study, which is true of many other domains of knowledge that seek to reduce complex problems and systems to simple, archetypal models and solutions.
As Cynefin shows us, complicated systems can be really big, and appear complex, but the laws of cause and effect don’t change. When you press the A/C button in your modern car (which is “complicated”), the A/C comes on, and the same thing happens every subsequent time you do it. This is rather obviously not the case with people.
In a complex systems such as humans, asking a teammate to help you out with a task one day results in them helping you, but on another day, they might tell you to stick it; maybe they’re hungover, stressed and busy, maybe they’re tired, or maybe they just don’t feel like helping. Cause and effect change in complex systems; and humans are complex. Really complex. Which is why “the soft stuff is the hard stuff“.
Complicated systems can seem messy, but an action results in the same result each time. People are not like that. They are complex, and groups of people even more so. Cause and effect changes constantly – pressing the equivalent of that A/C button on a complex human has one effect today and a different effect tomorrow.
And that is why personality, psychometric, “strength” tests etc will never work in the way people desire them to. People don’t fit into boxes, and neither should we try to.
All models are wrong. Some are useful.
The problem is when you use a model and apply it to a complex problem in the assumption that it’s right.
“It ain’t what you don’t know that hurts you, it’s what you know for sure that just ain’t so.”
And people selling these systems either know this, in which case they’re selling snake oil. Or they’re simply being optimistically gullible, looking for simple answers to complex problems. To be fair, we humans are almost infinitely susceptible to the seductive simplicity of personality archetypes, even more so when they’re about us. This is known as the Barnum effect, where it’s possible to give everyone the same description and people nevertheless rate the description as very accurate.
Flawed evidence of personality test reliability
MBTI fails on both validity and reliability tests, as do most other personality and psychometric tools. Proponents (usually people selling them) are keen to point out reliability measures that show, with a degree of error, that the same person taking the same test at a different time often obtains a similar result. This only serves to highlight the problem however – just as I would tell you my favourite colour is yellow if you ask me today, and I’d respond usually with the same a month later – it doesn’t follow that my favourite colour has anything to do with my personality, nor that my personality is stable over time. Equally, I may be lying. My favourite colour is actually blue.
Most of these systems apply an assumption of dichotomies, or even force them – you are either X or Y: cannot be both, and you cannot change from one to the other. This has been disproven too.
When I did a “Predictive Index” test, I was told that I was far from empathetic, because I was evidence and data driven. According to PI, someone cannot be both evidence-oriented and empathetic. Not only is this offensive, it’s completely unfounded. In fact research shows that people with more rigorous and evidence-driven thinking skills are also better at understanding and managing emotions. These are simply not valid tests.
We should all be suspicious of algorithms that describe us or make decisions about us that are closed source, and psychometric tests are no different. Predictive Index have repeatedly declined to open source their algorithm, ostensibly to protect their intellectual property.
The key to the Big 5 model is its simplicity. It doesn’t sort anybody into a “type,” just informs them where they fall on a continuum of personality traits. There are no tricks and no surprises to be revealed, and it’s not a black box. However, even though it’s the most trusted psychological profiling test in academia, the “Big 5” has been found to be systematically sexist. “Women are told they’re significantly more disagreeable than men who answer questions identically.“
Criticism of MBTI and others extend even further, often due to a highly westernised, English-language, neurotypical approach.
Dangerous tools?
Evidence shows that, far from being a “short-cut” to more insightful leadership, tools such as these can be harmful – they may convince managers that they’re doing “good management”, and discourage further effort to improve management and leadership behaviours. At worst, they’re actively discriminatory and detrimental to individual and team performance, reducing the quality of human interactions and decreasing levels of psychological safety.
Conversely, I’ve actually found value in doing “which Hogwarts house are you in?” or “Which sex and the city character are you?” quizzes with teams. They’re obviously nonsense, but they facilitated a good discussion with team members about preferences and styles – and it was much more fun than MBTI!
(In fact, those quizzes have an advantage over some of the “official” tests because they make no pretence of scientific accuracy.)
Finally, I’ve never come across a strongly competent leader who used personality testing and categorisation. It seems to me (and I’m conscious of my own biases here) that these tests can sometimes risk replacing empathy. A way to feel like you’re understanding people, and “doing the work” without actually putting in the effort to do so.
Personally, given all the flaws and limitations of personality profiling, I hope organisations stop using them, and businesses stop trying to make money out of them. We try not to use flawed tools to do finance, accounting, software development, design, or data analysis. Why is it acceptable to use flawed tools to understand and manage the most important thing in organisations – people? And why, when we realise that they’re flawed, are they so “sticky”? Why can’t we seem to get rid of them?
What do you think? Are they a useful tool, or a potentially dangerous over-simplification of human nature?
Read more: https://adamgrant.substack.com/p/mbti-if-you-want-me-back-you-need and https://www.psychologytoday.com/gb/blog/give-and-take/201309/goodbye-to-mbti-the-fad-that-wont-die