Data Lake Instance

From GM-RKB
Jump to navigation Jump to search

A Data Lake Instance is a large composite heterogeneous data base with data bases in their original data structure.



References

2016

  • (Wikipedia, 2016) ⇒ http://wikipedia.org/wiki/data_lake Retrieved:2016-2-4.
    • A data lake is a large storage repository and processing engine. They provide "massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs".

2016

2016

  • http://blog.zaloni.com/modernizing-your-big-data-architecture-key-considerations
    • QUOTE:
      • Enable the lake: Build the lake and determine how you will ingest, organize and catalog your data.
      • Govern the data: This involves data quality rules, automation workflows, as well as data security.
      • Engage the business: Deliver the data to more end users, including business end users, to maximize its value — “democratizing” access to your data. This involves implementing tools that make data discovery, enrichment and provisioning very intuitive for less-technically savvy business users.

2015

2015

  • https://azure.microsoft.com/en-us/solutions/data-lake/
    • Azure Data Lake includes all the capabilities required to make it easy for developers, data scientists, and analysts to store data of any size, shape and speed, and do all types of processing and analytics across platforms and languages. It removes the complexities of ingesting and storing all of your data while making it faster to get up and running with batch, streaming, and interactive analytics. Azure Data Lake works with existing IT investments for identity, management, and security for simplified data management and governance. It also integrates seamlessly with operational stores and data warehouses so you can extend current data applications. We’ve drawn on the experience of working with enterprise customers and running some of the largest scale processing and analytics in the world for Microsoft businesses like Office 365, Xbox Live, Azure, Windows, Bing and Skype. Azure Data Lake solves many of the productivity and scalability challenges that prevent you from maximizing the value of your data assets with a service that’s ready to meet your current and future business needs.