KNIME

We're going to pound the drum a little bit for one of the most overlooked and underrated tools out there.  KNIME (Konstanz Information Miner), so named because it began life as a project at the University of Konstanz, is a GUI-based ETL and ML platform.  The base program is open source and free of charge; which given a broad enough deployment and user base within an organization and combined with the paid (but still cheap!) server component can be used even up to a replacement for Airflow as a scheduling tool (that can implement R as well as Python, and doesn't require a priori knowledge of DAGs).

It connects natively to all the big boys- Azure and AWS varying storage services; runs Python, R, Java; and integrates Weka, h2o.ai, Keras, Spark (and Databricks).

Did we mention the analytics part of it is free.  Just download the damn thing.

Leave a Reply

Your email address will not be published. Required fields are marked *