Big Data/Data Analytics

productize

State Januar 2019

The Data Lake project started with the aim of building infrastructure for future big data analytics systems.

Revisions:

productize | November 2017

Definitions

Big data analytics is the process of examining large and varied data sets to discover hidden patterns, unknown correlations, market trends, customer preferences and other useful information that can help organizations make more-informed business decisions.

Why?

  • Cost reduction. Big data technologies such as Hadoop and cloud-based analytics bring significant cost advantages when it comes to storing large amounts of data – plus they can identify more efficient ways of doing business.

  • Faster, better decision making. With the speed of Hadoop and in-memory analytics, combined with the ability to analyze new sources of data, businesses are able to analyze information immediately – and make decisions based on what they’ve learned.

  • New products and services. With the ability to gauge customer needs and satisfaction through analytics comes the power to give customers what they want.

The concept of big data has been around for years; most organizations now understand that if they capture all the data that streams into their businesses, they can apply analytics and get significant value from it.

The new benefits that big data analytics brings to the table, however, are speed and efficiency. Whereas a few years ago a business would have gathered information, run analytics and discover information that could be used for future decisions, today that business can identify insights for immediate decisions. The ability to work faster – and stay agile – gives organizations a competitive edge they didn’t have before.

Technology choices/solutions

  • Classical Hadoop ecosystem distributions e.g. Cloudera, Hortonworks, MapR etc.
  • Apache Spark
  • Cloud-based big data as a service e.g. Azure HDInsights or AWS EMR
  • Commercial analytical platforms e.g. SAS Visual Analytics or Microsoft Power BI

Our projects

  • Customer Analytics team runs Big Data Lab with SAS Visual Analytics and Hadoop ecosystem back-end.
  • Microsoft Power BI is used in various departments of our company.
  • PoCs related to Big Data Analytics are in progress across the company.

Contact