Hi all,
I want to fit a dataset that contains information on both the individual and the group level. The (binary) outcome and some input variables are on the individual level, some other input variables (including the one I’m most interested in, let’s call it “x”) are only available on the group level (as an average). I’m wondering what kind of model would be appropriate in that case.
I was thinking about running a logistic regression and modeling “x” on the individual level as a latent variable. In that case I would use the group average to construct an informative prior for “x”. I guess an alternative approach would be to use a binomial likelihood and aggregate the individual-level data I have. However, this feels like throwing away some important information. I’m not sure at all what’s best practice here and if there’s another solution I haven’t thought of.
Thanks in advance!