#Duplicate data in the Popularity Primitive dump, missing data in the Age Rating Category dump

3 messages · Page 1 of 1 (latest)

vivid light
#
  • There are duplicate entries in the popularity_primitives dump for at several games, 24 rows are incoherent. For example, game id: 119339 has these seven rows in today's dump:
10094,119339,3,121,1.3070528572175458e-05,2025-02-24 01:05:03,2024-06-12 10:16:05,2025-02-24 01:08:16,,121
130352,119339,4,121,2.123250441636092e-06,2025-02-24 01:18:00,2024-06-12 10:26:47,2025-02-24 01:19:58,,121
72799,119339,2,121,1.4214075345970594e-05,2025-02-24 01:14:34,2024-06-12 10:21:54,2025-02-24 01:15:53,,121
702178,119339,8,1,4.6812189488581e-06,2025-02-22 11:04:18,2025-01-31 07:11:39,2025-02-22 11:13:34,,1
884863,119339,6,1,2.0909821810011385e-07,2025-02-22 11:04:17,2025-01-31 07:17:30,2025-02-22 11:18:42,,1
885767,119339,8,1,1.7977031293617895e-07,2025-02-22 11:04:18,2025-01-31 07:17:31,2025-02-22 11:19:10,,1
700640,119339,6,1,5.118724379090787e-06,2025-02-22 11:04:17,2025-01-31 07:11:36,2025-02-22 11:13:22,,1

The last four rows show two pairs with the same game_id, source and type, but two different values.

  • There are missing values in the age_rating_categories endpoint: id #8 and #9 are gone, but there are over 30k entries in age_ratings that have that value in rating_category.
molten plover
#

Second bullet point has been addressed, It looks like the first is caused when multiple Steam pages are linked to the same IGDB game, I'll discuss with the team what intended behaviour is here.

vivid light