Data Lake

Technology Radar

As informa??es desta p��gina n?o est?o completamente dispon��veis no seu idioma de escolha. Esperamos disponibiliza-las integralmente em outros idiomas em breve. Para ter acesso ��s informa??es no idioma de sua prefer��ncia, fa?a o download do PDF ��ܨ�.

Atualizado em : Nov 07, 2016

N?O ENTROU NA EDI??O ATUAL

Este blip n?o est�� na edi??o atual do Radar. Se esteve em uma das ��ltimas edi??es, �� prov��vel que ainda seja relevante. Se o blip for mais antigo, pode n?o ser mais relevante e nossa avalia??o pode ser diferente hoje. Infelizmente, n?o conseguimos revisar continuamente todos os blips de edi??es anteriores do Radar. Saiba mais

Nov 2016

Experimente

A is an immutable data store of largely unprocessed "raw" data, acting as a source for data analytics. While the technique can clearly be misused, we have used it successfully at clients, hence motivating its move to trial. We continue to recommend other approaches for operational collaborations, limiting the use of the data lake to reporting, analytics and feeding data into data marts.

Apr 2016

Experimente

Nov 2015

Avalie

A is an immutable data store of largely unprocessed 'raw' data, acting as a source for data analytics. Whereas the more familiar Data Warehouse filters and processes the data before storing it, the lake just captures the raw data, leaving it to the users of that data to carry out the particular analysis that they need. Examples include HDFS or HBase within a , or processing framework. Usually only a small group of data scientists work on the raw data, developing streams of processed data into lakeshore data marts for most users to query. A Data Lake should only be used for analytics and reporting. For collaboration between operational systems we prefer using services designed for that purpose.

May 2015

Avalie

An Enterprise Data Lake is an immutable data store of largely un-processed ��raw�� data, acting as a source for other processing streams but also made directly available to a significant number of internal, technical consumers using some efficient processing engine. Examples include HDFS or HBase within a Hadoop, Spark or Storm processing framework. We can contrast this with a typical system that collects raw data into some highly restricted space that is only made available to these consumers as the end result of a highly controlled ETL process.

Embracing the concept of the data lake is about eliminating bottlenecks due to lack of ETL developer staffing or excessive up front data model design. It is about empowering developers to create their own data processing pipelines in an agile fashion when they need it and how they need it��within reasonable limits��and so has much in common with another model that we think highly of, the DevOps model.

Jan 2015

Avalie

Publicado : Jan 28, 2015

Baixe o PDF

?

English?|?Espa?ol?|?�ʴǰ��ٳܲ��ܨ��?|?��

Inscreva-se para receber a newsletter do Technology Radar

Seja assinante

��Ӱֱ��

Solu??es

��Ի娲��ٰ��

Destaques

Ferramentas e Publica??es Digitais

Todos os Insights

Baixe o PDF

?

?

?

Inscreva-se para receber a newsletter do Technology Radar

?

Visite nosso arquivo para acessar os volumes anteriores

��Ӱֱ��

Solu??es

���Ի娲���ٰ�������

Destaques

Ferramentas e Publica??es Digitais

Todos os Insights

Data Lake

Baixe o PDF

?

?

?

Inscreva-se para receber a newsletter do Technology Radar

?

Visite nosso arquivo para acessar os volumes anteriores

��Ի娲��ٰ��