Apache Flink

From air
Revision as of 11:33, 14 September 2016 by Donsez (talk | contribs) (→‎Installation)
Jump to navigation Jump to search

https://flink.apache.org/

Apache Flink® is an open source platform for distributed stream and batch data processing. Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.

Getting started

Installation

wget http://www.apache.org/dyn/closer.lua/flink/flink-1.1.2/flink-1.1.2-bin-hadoop27-scala_2.11.tgz
tar xf flink-1.1.2-bin-hadoop27-scala_2.11.tgz
FLINK_HOME=~/flink-1.1.2
cd $FLINK_HOME
ls bin
ls examples

Local execution

cd $FLINK_HOME
bin/start-local.sh


cd $FLINK_HOME
bin/start-scala-shell.sh


Cluster execution

Amazon AWS EC2