-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathReadMe.txt
49 lines (36 loc) · 2.47 KB
/
ReadMe.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
DESCRIPTION - Describe the package in a few paragraphs
Green Analytics is a unique and creative platform to analyze, visualize and identify the socio-economic factors
that are most relevant to the changes of Forest cover rate. It is a user-friendly visual tool that determines
correlations against Forest cover changes and socio-economic factors, over the course of past 30 years.
Tech used - Jupyter Notebook, Python 3.7 or higher, Tableau Desktop 2021 or higher
Contents
- CODE - Parent directory for all code and data
| - data - All input and output data is collected here (Unzip data.zip to get unprocessed data)
| - viz - Contains "Tableau" visualization file - "green_analytics_v2.twbx"
| - Correlation.ipynb - Creates Pearson's correlation coefficients for all the countries - Output -> data_relationship.csv
| - data-clustering.ipynb - Does Principal component analysis and clustering for countries - Output -> data-clustering.csv
| - Project_Wrangling.ipynb - Reading and cleaning of downloaded and web scraped data - Output -> data_countries.csv, data_measurements.csv
- DOC
| - Poster - Project Poster
| - Report - Project Report
INSTALLATION - How to install and setup your code
- Install Jupyter notebook - https://jupyter.org/install
- Install Python 3 (3.7 or higher) - https://realpython.com/installing-python/
- Install Tableau Desktop 2021 (or higher) - https://www.tableau.com/products/desktop
EXECUTION - How to run a demo on your code - The input and output files are already included in the "data" folder,
so execution is "Optional"
- Navigate to the "CODE" folder and launch / run Jupyter Notebook
- SAMPLE - CODE> Jupyter Notebook
- Within the Jupyter - Open Project_Wrangling.ipynb
- Go to Cell menu and Run All
- Output files - data_countries.csv, data_measurements.csv would be created in the data folder
- Within the Jupyter - Open Correlation.ipynb
- Go to Cell menu and Run All
- Output files - data_relationship.csv would be created in the data folder
- Within the Jupyter - Open data-clustering.ipynb
- Go to Cell menu and Run All
- Output files - data-clustering.csv would be created in the data folder
- Now open "green_analytics_v2.twbx" which has pre-created dashboards and preloaded data
- If required the data sources can be remapped by going to "Data Source" in the bottom left corner of the tableau
dashboard
- Map the data sources in the "data" folder.