These errors don’t stop the image from being built but inform you that the installation process tried to open a dialog box, but was unable to. Generally, these errors are safe to ignore. Some people circumvent these errors by changing the DEBIAN_FRONTEND environment variable inside the Dockerfile using: ENV DEBIAN_FRONTEND=noninteractive This prevents the installer from opening dialog… Continue reading unable to initialize frontend: Dialog
They are complementary. VMs are best used to allocate chunks of hardware resources. Containers operate at the process level, which makes them very lightweight and perfect as a unit of software delivery.
Dubbed an “SQL-on-Hadoop” solution, Apache Tajo is used for low-latency and scalable ad-hoc queries, online aggregation, and ETL (extract-transform-load process) on large data sets stored on HDFS (Hadoop Distributed File System) and other data sources. By supporting SQL standards and leveraging advanced database techniques, Tajo allows direct control of distributed execution and data flow across… Continue reading Apache Tajo™: A big data warehouse system on Hadoop
Schema-less is a bit of a misnomer, it’s better to think of it as: SQL = Schema enforced by a RDBMS on Write NoSQL = Partial Schema enforced by the DBMS on Write, PLUS schema fully enforced by the Application on Read (Externalised schema) So while a supposed Schema-less NoSQL data-store will in theory allow… Continue reading what does schema Less means to no-sql ?
Kylin is an open source Distributed Analytics Engine from eBay Inc. that provides SQL interface and multi-dimensional analysis (OLAP) on Hadoop supporting extremely large datasets. http://www.kylin.io/
Zeppelin is data analytics environment Web based notebook style editor. Built-in Apache Spark support http://zeppelin-project.org/