Can someone with a background in studying Statistics help me out?

The_Jewish_Cuban [he/him]@hexbear.net · edit-2 10 months ago

Can someone with a background in studying Statistics help me out?

Maoo [none/use name]@hexbear.net · 10 months ago

It sounds like a very silly study even from the initial design. A 4-point scale is embarrassing. You need an odd number in order to have both a neutral choice option and symmetry. The 5-point Likert is basically standard if you don’t want to think about it too hard, like the authors. 1=lowest rating, 5=highest, 3=neutral, and 2 and 4 represent intermediates. If they really did just want to know pro vs con vs neutral, they should’ve chosen a 3-point scale so that participants had to actually make that choice: 1=low rating, 2=neutral, 3=high rating. 4 is silly and you can see that they grouped together the 2s and 3s answers for exactly this reason.

In terms of analysis, yeah that’s absurd. Even using K-means should raise little alarm bells given that their scale and dimensionality are so small that it’s basically discrete. K-means clustering is a continuous analysis. There are analogous methods to clustering for discrete cases where you cut cubes or sets of blocks.

But even that is silly. It sounds like they could have just made a 2D plot and shown it, like a density plot. The frewuy of all the answers can be shown on a 4x4 plot, right? You could make every dot have a different opacity or size. Easy to see all of the results and any subsequent analysis would be visually explained. At the same time, the clustering should be visualized the same way and it would probably show how silly their choices were. It would be obvious that this is a discrete and low-dimension dataset, for example.

Also, if they’re just gonna group middle clusters anyways, they should just admit they want to ask a question heuristically and create the clusters manually.