Important: Read this before posting to this forum

  1. This forum is for questions related to the use of Apollo. We will answer some general choice modelling questions too, where appropriate, and time permitting. We cannot answer questions about how to estimate choice models with other software packages.
  2. There is a very detailed manual for Apollo available at http://www.ApolloChoiceModelling.com/manual.html. This contains detailed descriptions of the various Apollo functions, and numerous examples are available at http://www.ApolloChoiceModelling.com/examples.html. In addition, help files are available for all functions, using e.g. ?apollo_mnl
  3. Before asking a question on the forum, users are kindly requested to follow these steps:
    1. Check that the same issue has not already been addressed in the forum - there is a search tool.
    2. Ensure that the correct syntax has been used. For any function, detailed instructions are available directly in Apollo, e.g. by using ?apollo_mnl for apollo_mnl
    3. Check the frequently asked questions section on the Apollo website, which discusses some common issues/failures. Please see http://www.apollochoicemodelling.com/faq.html
    4. Make sure that R is using the latest official release of Apollo.
  4. If the above steps do not resolve the issue, then users should follow these steps when posting a question:
    1. provide full details on the issue, including the entire code and output, including any error messages
    2. posts will not immediately appear on the forum, but will be checked by a moderator first. This may take a day or two at busy times. There is no need to submit the post multiple times.

Data cleaning

Ask questions about data format and processing of data, including the use of pre-estimation functions in Apollo. If your question relates to a specific error you are getting, please provide some of the output.
Post Reply
Patrick_K
Posts: 9
Joined: 14 May 2022, 14:20

Data cleaning

Post by Patrick_K »

Hi there,

I am trying to clean my data at the moment.
So far, I´ve looked for straighlining within the choice tasks and other survey questions and also for speeders who spend less than 3 seconds on one choice task. Furthermore, I took a look at suspicious answers to open qualitative questions in the survey.
Is that approach sufficient or would you recommend any other analysis to find "bad respondents"? Also hints on helpful literature on this topic are very welcome.

Thank you very much in advance,
Best regards,
Patrick
stephanehess
Site Admin
Posts: 974
Joined: 24 Apr 2020, 16:29

Re: Data cleaning

Post by stephanehess »

Patrick

removing people arbitrarily according to straightlining or speeding is bad practice. These people might still be behaving rationally. There is ample literature guidance on this. Much better to look at why someone might be behaving in a certain way, and see how you can accommodate them in a model

Stephane
--------------------------------
Stephane Hess
www.stephanehess.me.uk
Patrick_K
Posts: 9
Joined: 14 May 2022, 14:20

Re: Data cleaning

Post by Patrick_K »

Dear Stephane,

thank you for this important advice.
Though, it might be the same with irrational respondents (not answering two same fixed choice tasks in the same way), isn´t it?
So not deleting them from the sample just because they failed the test for completeness axiom?

Thank you very much.
stephanehess
Site Admin
Posts: 974
Joined: 24 Apr 2020, 16:29

Re: Data cleaning

Post by stephanehess »

The key job for the modeller here is to judge whether the behaviour is in line with RUM or not
--------------------------------
Stephane Hess
www.stephanehess.me.uk
Patrick_K
Posts: 9
Joined: 14 May 2022, 14:20

Re: Data cleaning

Post by Patrick_K »

Are there any specific guidelines how this can be judged or any papers you can recommend on dealing with inconsistent respondents?
stephanehess
Site Admin
Posts: 974
Joined: 24 Apr 2020, 16:29

Re: Data cleaning

Post by stephanehess »

Patrick

a long time ago, I wrote this paper: https://doi.org/10.1016/j.trd.2010.04.008

There are probably many others now

This is in terms of "inconsistent" responses. For people that are very fast or very slow, my recommendation is always to first see whether these people actually behave differently from others

Stephane
--------------------------------
Stephane Hess
www.stephanehess.me.uk
Post Reply