Read a good article on the upcoming data paralysis on Datanami.
It is useful to see people understanding their problems (we cannot decide, as we have too much and too little data), but it seems we’re still not very much aware of the problem we’re creating with the Big Data hype.
Quite simply – there has always been the need to:
- have data,
- have people with domain knowledge and experience in the domain data comes from,
- to learn from data and
- to interpret the data, using the said domain knowledge AND knowledge about the algorithms used for predictions or modelling.
Currently, we usually have data. Domain knowledge expertise is hard to get by (and no, no newfangled job descriptions will change that :)). We also have very good machine learning people. But, the intersection of all the requirements from above is usually quite lonely place.
So, you see, the big data paralysis is not coming. It’s always been here, but now, it’s uber-hyped, so we acknowledge it :).