Hadoop

Open source and proprietary software solutions: the key for an analytic project

Open source and proprietary software solutions: the key for an analytic project

In the world of data analysis it may be no coincidence that open source tools like the ‘R’ statistical computing language have blossomed as analytics and big data have matured together. Hadoop, Python… There seems to be a special kind of magic between the curious minds of data analysts (with a small ‘a’ – as Continue reading Open source and proprietary software solutions: the key for an analytic project

Hadoop: the rise of the modern data lake platform

Hadoop: the rise of the modern data lake platform

Hadoop, while it may be synonymous with big data, and while it may be free to access and work with, engineering teams globally will admit that behind every Hadoop undertaking is a major technical delivery project. Failures are so commonplace that even the experts don’t have great expectations of 2017: at the recent Gartner Data Continue reading Hadoop: the rise of the modern data lake platform

Apache Flink in Processing Streaming Data

Apache Flink in Processing Streaming Data

Streaming data processing is an emerging area. It means processing the data almost instantly (with very low latency) when it is generated. Until now, most data processing was based on batch systems, where processing, analysis and decision making were a much slower process. Now, as the new technologies and platforms are evolving, organizations are gradually shifting towards Continue reading Apache Flink in Processing Streaming Data