Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Before reaching for spark, etc:

Sort is good for aggregations that fit on disk (TBs these days, I guess)

Perl does well too if the output fits in a hashtable in DRAM, so 10’s (or maybe 100’s?) of GBs



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: