Apache Flink – Flink vs Spark vs Hadoop

Apache Flink

This topic is about Apache Flink – Flink vs Spark vs Hadoop.

Here is a comprehensive table, which shows the comparison between three most popular big data frameworks: Apache Flink, Apache Spark and Apache Hadoop.

Apache HadoopApache SparkApache Flink
Year of Origin200520092009
Place of OriginMapReduce (Google) Hadoop (Yahoo)University of California, BerkeleyTechnical University of Berlin
Data Processing EngineBatchBatchStream
Processing SpeedSlower than Spark and Flink100x Faster than HadoopFaster than spark
Programming LanguagesJava, C, C++, Ruby, Groovy, Perl, PythonJava, Scala, python and RJava and Scala
Programming ModelMapReduceResilient distributed Datasets (RDD)Cyclic dataflows
Data TransferBatchBatchPipelined and Batch
Memory ManagementDisk BasedJVM ManagedActive Managed
LatencyLowMediumLow
ThroughputMediumHighHigh
OptimizationManualManualAutomatic
APILow-levelHigh-levelHigh-level
Streaming SupportNASpark StreamingFlink Streaming
SQL SupportHive, ImpalaSparkSQLTable API and SQL
Graph SupportNAGraphXGelly
Machine Learning SupportNASparkMLFlinkML

Next Topic : Click Here

This Post Has 2 Comments

Leave a Reply