Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I don't think sampling's even necessary. Presumably Spotify log every play. If they pushed their logs into BigQuery or something similar, it would be trivial to calculate the revenue breakdown the way the OP describes. 'Big data' is here, and it works. With 100 million users at 1000 tracks per month, we're 'only' talking about 100 billion or so rows to process each month.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: