Hi,
I am trying to find out how to handle missing data in the following case: I have some missing data (not at random) of a variable that significantly influences the choices in a model where I simply delete all the rows with a missing value. However, these observations are valuable and I would rahter not delete them.
Is there a way to include the variable (if available) and sort of ignore it if the value is missing? Could I apply the method for Joint estimation of multiple model components and treat the data as two sets of data, one with the information and one without?
Thanks,
Anna
Important: Read this before posting to this forum
- This forum is for questions related to the use of Apollo. We will answer some general choice modelling questions too, where appropriate, and time permitting. We cannot answer questions about how to estimate choice models with other software packages.
- There is a very detailed manual for Apollo available at http://www.ApolloChoiceModelling.com/manual.html. This contains detailed descriptions of the various Apollo functions, and numerous examples are available at http://www.ApolloChoiceModelling.com/examples.html. In addition, help files are available for all functions, using e.g. ?apollo_mnl
- Before asking a question on the forum, users are kindly requested to follow these steps:
- Check that the same issue has not already been addressed in the forum - there is a search tool.
- Ensure that the correct syntax has been used. For any function, detailed instructions are available directly in Apollo, e.g. by using ?apollo_mnl for apollo_mnl
- Check the frequently asked questions section on the Apollo website, which discusses some common issues/failures. Please see http://www.apollochoicemodelling.com/faq.html
- Make sure that R is using the latest official release of Apollo.
- Users can check which version they are running by entering packageVersion("apollo").
- Then check what is the latest full release (not development version) at http://www.ApolloChoiceModelling.com/code.html.
- To update to the latest official version, just enter install.packages("apollo"). To update to a development version, download the appropriate binary file from http://www.ApolloChoiceModelling.com/code.html, and install the package from file
- If the above steps do not resolve the issue, then users should follow these steps when posting a question:
- provide full details on the issue, including the entire code and output, including any error messages
- posts will not immediately appear on the forum, but will be checked by a moderator first. This may take a day or two at busy times. There is no need to submit the post multiple times.
Include independent variables with missing values
-
- Site Admin
- Posts: 1049
- Joined: 24 Apr 2020, 16:29
Re: Include independent variables with missing values
Anna
it's quite a common case, and one that is easily accommodated via a separate parameter.
So let's imagine we're looking at age, and that this is measured continuously in the data, with -99 for missing.
Then let's say we want to interact time sensitivity with age. You would then use:
( beta_time + shift_btime_age * ( age > 0 ) * age + shift_btime_age_missing * ( age == -99 ) ) * time
So there would be a separate effect for those with missing age
Stephane
it's quite a common case, and one that is easily accommodated via a separate parameter.
So let's imagine we're looking at age, and that this is measured continuously in the data, with -99 for missing.
Then let's say we want to interact time sensitivity with age. You would then use:
( beta_time + shift_btime_age * ( age > 0 ) * age + shift_btime_age_missing * ( age == -99 ) ) * time
So there would be a separate effect for those with missing age
Stephane
Re: Include independent variables with missing values
Brilliant! That makes sense. The suggestions on stat stackexchange had me a little worried.
Thanks for the quick response!
Cheers
Anna
Thanks for the quick response!
Cheers
Anna
Re: Include independent variables with missing values
Greetings. While looking to see how Apollo handles missing values, I noticed this earlier post. The solution of creating a new parameter makes sense if there is only one or few demographic variables with missing values. However, once we start working with latent variables, it's quite common to have at least a small amount of missing values for multiple indicator variables. Have there been any new developments in Apollo to handle missing values?
Thanks for any feedback on this.
Thanks for any feedback on this.
-
- Site Admin
- Posts: 1049
- Joined: 24 Apr 2020, 16:29
Re: Include independent variables with missing values
Hi
in that case, you would have missing data for a dependent variable (indicator in hybrid choice), and you would skip those rows for that dependent variable only. See the 'rows' setting
Stephane
in that case, you would have missing data for a dependent variable (indicator in hybrid choice), and you would skip those rows for that dependent variable only. See the 'rows' setting
Stephane