At Circonus we process a lot of data. We learned early on that some data can be sampled and some data cannot. The way you treat data when you "need it all" to make good sense of things is radically different than the way you must treat sampled datapoints.
This presentation will walk through the architectural evolution of our system as it had to scale to billions of events per day and trillions of datapoints for regression.
I hope you'll learn some of what we learned constructing a real-time, ad-hoc, global event analysis software as a service platform.