Observed and unobserved heterogeneity
Posted: 12 Oct 2020, 05:18
Hi Prof. Hess,
I try to implement a mixed logit model considering both observed and unobserved heterogeneity, but have a few confusions on the model specification.
I intend to specify the coefficient for a cost variable to be random but also relate to income (greater income, less sensitive to cost). So for the normal distribution, I have tried the following two specifications (1) and (2), and the model results prefer the first one. I am not sure whether this is dataset specific. Generally I feel the first specification is more intuitive, as it clearly shows that the importance of cost consists of one observed component linked to income and one purely random component.
(1) randcoeff[["rand_cost"]] = b_cost / income + sigma_cost * norm_draws
(2) randcoeff[["rand_cost"]] = (b_cost + sigma_cost * norm_draws)/ income
Similarly, for the lognormal distribution, we also have the following two:
(3) randcoeff[["rand_cost"]] = - exp(b_cost / income + sigma_cost * norm_draws)
(4) randcoeff[["rand_cost"]] = - exp(b_cost + sigma_cost * norm_draws) / income
Lastly, for the triangular distribution,
(5) randcoeff[["rand_cost"]] = a / income + b * (draws_u1 + draws_u2)
(6) randcoeff[["rand_cost"]] = a / income + b / income * (draws_u1 + draws_u2)
(7) randcoeff[["rand_cost"]] = [a + b * (draws_u1 + draws_u2)] / income
Thanks for your time and appreciate if you can help to clarify which one is the appropriate specification.
Best,
Wenjian
I try to implement a mixed logit model considering both observed and unobserved heterogeneity, but have a few confusions on the model specification.
I intend to specify the coefficient for a cost variable to be random but also relate to income (greater income, less sensitive to cost). So for the normal distribution, I have tried the following two specifications (1) and (2), and the model results prefer the first one. I am not sure whether this is dataset specific. Generally I feel the first specification is more intuitive, as it clearly shows that the importance of cost consists of one observed component linked to income and one purely random component.
(1) randcoeff[["rand_cost"]] = b_cost / income + sigma_cost * norm_draws
(2) randcoeff[["rand_cost"]] = (b_cost + sigma_cost * norm_draws)/ income
Similarly, for the lognormal distribution, we also have the following two:
(3) randcoeff[["rand_cost"]] = - exp(b_cost / income + sigma_cost * norm_draws)
(4) randcoeff[["rand_cost"]] = - exp(b_cost + sigma_cost * norm_draws) / income
Lastly, for the triangular distribution,
(5) randcoeff[["rand_cost"]] = a / income + b * (draws_u1 + draws_u2)
(6) randcoeff[["rand_cost"]] = a / income + b / income * (draws_u1 + draws_u2)
(7) randcoeff[["rand_cost"]] = [a + b * (draws_u1 + draws_u2)] / income
Thanks for your time and appreciate if you can help to clarify which one is the appropriate specification.
Best,
Wenjian