Just estimating a population parameter for a multilevel sample
Posted: Thu Dec 15, 2022 7:40 pm
It seems that we have all kinds of sophisticated models (GLM, Mixed Effects, etc.) for analyzing "effects," but if we want to simply estimate a population parameter (like a mean) we're mostly stuck with doing a one sample t test or binomial test. But what if the sample is "multilevel" as in the following example:
Each of 100 research participants is asked to guess which of two cities has the larger population. Each participant's response can be either correct or incorrect. The two cities are either (i) San Francisco and Dallas, or (ii) Miami and Philadelphia, or (iii) Las Vegas and Atlanta, or (iv) New Orleans and Minneapolis, or (v) Baltimore and Denver. Participants have been randomly assigned to make their judgment about City Pair i, ii, iii, iv, or v. Moreover, the researcher selected those five city pairs at random from the total set of American cities.
Traditionally, if one wants to know whether accuracy is significantly different from 0.5, one can conduct a binomial test, which assumes that participants have been randomly selected from a population of potential participants. But this ignores that the city pair selections are also random. Is there a way to estimate the population proportion in a way that treats both participants and city-pairs as having been randomly sampled? I can think of ways to hack a standard mixed-effects analysis to get an answer that's approximately correct. But given the problem's conceptual simplicity, is there a more standard way of approaching it?
Each of 100 research participants is asked to guess which of two cities has the larger population. Each participant's response can be either correct or incorrect. The two cities are either (i) San Francisco and Dallas, or (ii) Miami and Philadelphia, or (iii) Las Vegas and Atlanta, or (iv) New Orleans and Minneapolis, or (v) Baltimore and Denver. Participants have been randomly assigned to make their judgment about City Pair i, ii, iii, iv, or v. Moreover, the researcher selected those five city pairs at random from the total set of American cities.
Traditionally, if one wants to know whether accuracy is significantly different from 0.5, one can conduct a binomial test, which assumes that participants have been randomly selected from a population of potential participants. But this ignores that the city pair selections are also random. Is there a way to estimate the population proportion in a way that treats both participants and city-pairs as having been randomly sampled? I can think of ways to hack a standard mixed-effects analysis to get an answer that's approximately correct. But given the problem's conceptual simplicity, is there a more standard way of approaching it?