Bigdata Boilerplate

Motivation

The aim is to create a disposable Hadoop/HBase/Spark/Flink/ML stack where you can test your jobs locally or to submit them to the Yarn resource manager. We are using Docker to build the environment and Docker-Compose to provision it with the required components (Next step using Kubernetes). Along with the infrastructure, We are check that it works with 4 projects that just probes everything is working as expected. The boilerplate is based on a sample search flight web application.

Keywords : Docker, (Kubernetes soon), Apache Spark SQL/Streaming/MLib, (Apache Flink, Kafka Streams soon), Scala, Python, Apache Kafka, Apache Hbase, Apache Avro, (Apache NiFi, Kylo next step), MongoDB, NodeJS (graphql, kafka-node, mongoose, avsc), Angular, Apollo-GraphQL

Prod mode

docker network create vnet
cd docker
docker-compose -f mongo.yml -f zookeeper.yml -f kafka.yml -f hadoop-hbase.yml up -d
docker-compose up -d

Dev mode

docker network create vnet
cd batch && sbt clean package assembly
cd ..
cd streaming && sbt clean package assembly
cd ..
cd docker
docker-compose -f mongo.yml -f zookeeper.yml -f kafka.yml -f hadoop-hbase.yml up -d
docker-compose -f dev/webapp.yml up -d
docker-compose -f dev/batch.yml up -d
docker-compose -f dev/streaming.yml up -d
docker-compose -f dev/ml.yml up -d

Interactions

Contributing

Pull requests are welcome.

Support

Please raise tickets for issues and improvements at https://github.com/Chabane/bigdata-boilerplate/issues

License

This example is released under version 2.0 of the Apache License.

Name		Name	Last commit message	Last commit date
Latest commit History 250 Commits
batch		batch
docker		docker
ml		ml
streaming		streaming
webapp		webapp
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bigdata Boilerplate

Motivation

Prod mode

Dev mode

Interactions

Contributing

Support

License

About

Releases

Packages

Languages

License

msellamiTN/bigdata-boilerplate

Folders and files

Latest commit

History

Repository files navigation

Bigdata Boilerplate

Motivation

Prod mode

Dev mode

Interactions

Contributing

Support

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages