http://github.com/GoogleCloudPlatform/DataflowJavaSDK
http://ostatic.com/blog/google-open-sources-cloud-dataflow-sdk-built-to-trump-mapreduce