I have the data of players active on a gaming console and the playtime hours corresponding to the games they have played and their age. I want to analyze the top (say 10) games that the people between age 18-24 have played. However, I know that the reported age on the gaming console is highly unreliable. Is there anyway we could reduce the unreliability and reach a (nearly) fair estimation? Please let me know if additional information is required for answering this question. Thank you.
$\begingroup$
$\endgroup$
5
-
1$\begingroup$ Do you have some reliable, independent ground truth? $\endgroup$Ggjj11– Ggjj112024-01-18 07:33:42 +00:00Commented Jan 18, 2024 at 7:33
-
$\begingroup$ Could you please give me examples of "ground truths"? $\endgroup$Ritik P. Nayak– Ritik P. Nayak2024-01-18 07:36:09 +00:00Commented Jan 18, 2024 at 7:36
-
$\begingroup$ Yes: those would be people whose ages you are more certain about. Without some such information, you're purely guessing. $\endgroup$whuber– whuber ♦2024-01-18 14:36:07 +00:00Commented Jan 18, 2024 at 14:36
-
$\begingroup$ Sorry for not being able to reply earlier @whuber, I thought about this and I don't think we have any source for ground truth yet. What are the other ways to make an estimated guess? Please let me know if you think additional info on this would be subservient for you to suggest a solution $\endgroup$Ritik P. Nayak– Ritik P. Nayak2024-01-23 04:15:50 +00:00Commented Jan 23, 2024 at 4:15
-
$\begingroup$ You can make an "estimated guess" using a Ouija board, a random number generator, an oracle, or whatever. You just won't have any basis to support any claims concerning how accurate your guessing might be. $\endgroup$whuber– whuber ♦2024-01-23 14:48:37 +00:00Commented Jan 23, 2024 at 14:48
Add a comment
|