Important: Read this before posting to this forum

  1. This forum is for questions related to the use of Apollo. We will answer some general choice modelling questions too, where appropriate, and time permitting. We cannot answer questions about how to estimate choice models with other software packages.
  2. There is a very detailed manual for Apollo available at http://www.ApolloChoiceModelling.com/manual.html. This contains detailed descriptions of the various Apollo functions, and numerous examples are available at http://www.ApolloChoiceModelling.com/examples.html. In addition, help files are available for all functions, using e.g. ?apollo_mnl
  3. Before asking a question on the forum, users are kindly requested to follow these steps:
    1. Check that the same issue has not already been addressed in the forum - there is a search tool.
    2. Ensure that the correct syntax has been used. For any function, detailed instructions are available directly in Apollo, e.g. by using ?apollo_mnl for apollo_mnl
    3. Check the frequently asked questions section on the Apollo website, which discusses some common issues/failures. Please see http://www.apollochoicemodelling.com/faq.html
    4. Make sure that R is using the latest official release of Apollo.
  4. If the above steps do not resolve the issue, then users should follow these steps when posting a question:
    1. provide full details on the issue, including the entire code and output, including any error messages
    2. posts will not immediately appear on the forum, but will be checked by a moderator first. This may take a day or two at busy times. There is no need to submit the post multiple times.

A data contain too many observations for each person even workInLogs=TRUE

Put in requests for new capabilities such as functions for new models.
Post Reply
toshi
Posts: 2
Joined: 29 Jan 2024, 03:14

A data contain too many observations for each person even workInLogs=TRUE

Post by toshi »

Hi, I am working with a choice model that a person is observed about 2300 times of up to 7 alternatives.
In this data, even I use "workInLogs=TRUE", R reached the numerical limits on the choice probability when I use apollo_panelProd.
(e.g. exp(log(0.14)*2300) = 0)
I try to use "Rmpfr" to increase the numerical limits by modifying the apollo_panelProd.
(e.g. exp(log(Rmpfr::mpfr(0.14, 32)*2300)) = 322.00000024)
However, the function depends on the type of object and seems too many modifications are required to use Rmpfr in apollo_panelProd.
I also surmised that there may be several other functions that make up apollo that need to be fixed as well.

I would appreciate it if you would consider introducing an option to increase the accuracy of the choice probability when using panel data If you think it is worthwhile to fix this problem.

Thank you for your consideration,
Toshifumi
stephanehess
Site Admin
Posts: 998
Joined: 24 Apr 2020, 16:29

Re: A data contain too many observations for each person even workInLogs=TRUE

Post by stephanehess »

Hi

this is an extreme case. Do you have that large a number of choices for each person in the data or just for one person (in which case they will dominate the data).

If you're not going to run a model with random parameters, then you may consider treating the data as if it came from separate people, which would avoid the issue, but of course means not correcting the standard errors for repeated choice

Stephane
--------------------------------
Stephane Hess
www.stephanehess.me.uk
toshi
Posts: 2
Joined: 29 Jan 2024, 03:14

Re: A data contain too many observations for each person even workInLogs=TRUE

Post by toshi »

Thank you for your suggestion.

Yes, I have a large number of choices for each person. The number of individuals is about 5,000. Therefore, the data contains 9M rows.
This was the first time I tried Apollo after being unable to estimate with various other packages.
Because Apollo makes explicit what it calculates one at a time, it make me recognize why simple logits could be estimated for this data, but not individual-specific parameters.
I am currently trying alternative approaches, such as dropping from the data those selection opportunities that are not important to the problem or writing the whole estimation code that employs the Rmpfr.

Thank you for your consideration,
Toshifumi
stephanehess
Site Admin
Posts: 998
Joined: 24 Apr 2020, 16:29

Re: A data contain too many observations for each person even workInLogs=TRUE

Post by stephanehess »

Hi

you could also try some sampling approaches

Stephane
--------------------------------
Stephane Hess
www.stephanehess.me.uk
Post Reply