This project runs in Windows environment, for data analysis purpose without any ETL steps. The project depends on the following environments: Scala 2.12 Java 8U351 Python 37 Hadoop 2.7 Spark 3.0.3