I understand what it means, but what I don't understand, and was curious about, is how you can achieve constant-time searching or the dataset. I started with the assumption that you're doing approximate matches, is this correct?
Ah – well to clarify, we're performing exact match lookups on individual k-mers (k-length genetic strings), and then computing a classification result from those exact searches.