data:image/s3,"s3://crabby-images/b54e0/b54e0544f8f781b2922eb5a58ed5cd05edb9fe20" alt="Apache Flink"
This Chapter is about Apache Flink – Creating a Flink Application. In this chapter, we will learn how to create a Flink application.
Open Eclipse IDE, click on New Project and Select Java Project.
data:image/s3,"s3://crabby-images/428a8/428a8c0aa3995af43237c5aaeabe0a9223d7a7f7" alt="Create Flink Application"
Give Project Name and click on Finish.
data:image/s3,"s3://crabby-images/60560/60560bc42d3151c28f7228db7960848bebff4b8c" alt="Create Flink Application2"
Now, click on Finish as shown in the following screenshot.
data:image/s3,"s3://crabby-images/837c1/837c16f04633cfed418610f8aee924adfb90370d" alt="Create Flink Application3"
Now, right-click on src and go to New >> Class.
data:image/s3,"s3://crabby-images/baea7/baea7aaa570aed3e3443bb7087c1c8d44143eeb7" alt="Create Flink Application4"
Give a class name and click on Finish.
data:image/s3,"s3://crabby-images/cfd6e/cfd6eee6cec3797e0cdd5ad6e7ebe29b336a91e0" alt="Create Flink Application5"
Copy and paste the below code in the Editor.
import org.apache.flink.api.common.functions.FlatMapFunction; import org.apache.flink.api.java.DataSet; import org.apache.flink.api.java.ExecutionEnvironment; import org.apache.flink.api.java.tuple.Tuple2; import org.apache.flink.api.java.utils.ParameterTool; import org.apache.flink.util.Collector; public class WordCount { // ************************************************************************* // PROGRAM // ************************************************************************* public static void main(String[] args) throws Exception { final ParameterTool params = ParameterTool.fromArgs(args); // set up the execution environment final ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment(); // make parameters available in the web interface env.getConfig().setGlobalJobParameters(params); // get input data DataSet<String> text = env.readTextFile(params.get("input")); DataSet<Tuple2<String, Integer>> counts = // split up the lines in pairs (2-tuples) containing: (word,1) text.flatMap(new Tokenizer()) // group by the tuple field "0" and sum up tuple field "1" .groupBy(0) .sum(1); // emit result if (params.has("output")) { counts.writeAsCsv(params.get("output"), "\n", " "); // execute program env.execute("WordCount Example"); } else { System.out.println("Printing result to stdout. Use --output to specify output path."); counts.print(); } } // ************************************************************************* // USER FUNCTIONS // ************************************************************************* public static final class Tokenizer implements FlatMapFunction<String, Tuple2<String, Integer>> { public void flatMap(String value, Collector<Tuple2<String, Integer>> out) { // normalize and split the line String[] tokens = value.toLowerCase().split("\\W+"); // emit the pairs for (String token : tokens) { if (token.length() > 0) { out.collect(new Tuple2<>(token, 1)); } } } } }
You will get many errors in the editor, because Flink libraries need to be added to this project.
data:image/s3,"s3://crabby-images/f7ab2/f7ab26311da03d02bfae311cd38b41e0d203503a" alt="Flink libraries Added"
Right-click on the project >> Build Path >> Configure Build Path.
data:image/s3,"s3://crabby-images/23504/2350444d6b8a04b48e4d55a095ec9655a24be22b" alt="Right click Project"
Select the Libraries tab and click on Add External JARs.
data:image/s3,"s3://crabby-images/5006f/5006f9e616a9b7364499943d313ea2ff1a9befed" alt="Select Libraries"
Go to Flink’s lib directory, select all the 4 libraries and click on OK.
data:image/s3,"s3://crabby-images/442bf/442bf0eb92eed6fdd10b6f9c4625ac64d65e2b71" alt="Flinks lib directory"
Go to the Order and Export tab, select all the libraries and click on OK.
data:image/s3,"s3://crabby-images/fbb9d/fbb9ddeedbe7d2009a705d18d8f4aa9a432e5382" alt="Order and Export Tab"
You will see that the errors are no more there.
Now, let us export this application. Right-click on the project and click on Export.
data:image/s3,"s3://crabby-images/dd2bc/dd2bcd8613f28756ce3ab224f85c20b3d1040b54" alt="Export this Application"
Select JAR file and click Next
data:image/s3,"s3://crabby-images/f5a64/f5a64e31e9d3e4bdde22b409e441f622d6cf2afa" alt="Select JAR file"
Give a destination path and click on Next
data:image/s3,"s3://crabby-images/c4136/c413676d17a235ab41b0cebaf773465ddc34aeb0" alt="destination path"
Click on Next>
data:image/s3,"s3://crabby-images/9ecac/9ecacf7ed972f831d053484eae9caa205005ed51" alt="Click Next"
Click on Browse, select the main class (WordCount) and click Finish.
data:image/s3,"s3://crabby-images/7f54f/7f54f928d5e21e23c41504e75b0cd8078c2809a2" alt="Click Finish"
Note − Click OK, in case you get any warning.
Run the below command. It will further run the Flink application you just created.
./bin/flink run /home/ubuntu/wordcount.jar --input README.txt --output /home/ubuntu/output
data:image/s3,"s3://crabby-images/14005/14005674feba7f62348be916378cdc25568437a9" alt="Get Warning"
Next Topic : Click Here
Pingback: Apache Flink - Table API and SQL - Adglob Infosystem Pvt Ltd