@markus heya! I was looking at the BoardGameGeek XML API and I saw your DB scraping results, which are cool. But what I'm really looking for are RPGs, not board games; it's not very clear whether rpggeek.com and boardgamegeek.com share the same DB :-) Does your scraped data contain RPG info as well as board games? I don't think the stuff on Kaggle does (for example, rpggeek.com/rpgsystem/68128/ho which is quite famous isn't in it afaict) but I know the Kaggle dump is a subset!

@mavit yeah; I don’t understand the underlying DB structure!

@mavit (see how Honey Heist, linked above, is on rpggeek but doesn't even come up in a BGG search. So do the two DBs share some items? is there one underlying DB but BGG exposes a subset and rpggeek a different subset? Two different DBs but with duplicates? I am confused.)

Follow

@sil I'd imagine each database table has a "what site(s) is this row relevant to?" column, that searching and browsing respects (but direct references do not). Notice that Honey Heist can be found on BGG at boardgamegeek.com/boardgamefam

Sign in to participate in the conversation
Mastodon

Time for a cuppa... Earl Grey please!