-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathindex.qmd
132 lines (89 loc) · 3.98 KB
/
index.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
---
title: "Data Analyst Job Insights"
subtitle: "Based on 15K+ real job ads"
author: "Rafael Belokurows"
date: "June 2023"
format:
revealjs:
width: 1600
height: 900
transition: slide
transition-speed: fast
slide-number: c/t
footer: "@rafabelokurows - <a href=\"https://github.com/rafabelokurows/data-analyst-job-skills\">Github repository</a> - <a href=\"https://rafabelokurows.medium.com/what-do-companies-want-in-a-data-analyst-and-the-salaries-they-offer-in-return-fba8324101db\">Medium article</a>"
theme: moon
editor: visual
---
## Goal
- Identifying trends in Data Analyst Job Ads
- Are there common characteristics in jobs that offer higher salaries?
## Tools used
- Python to scrape, analyze and plot
- *spacy*: text extraction
- *pandas*: storing, processing, feature engineering
- *plotly*: plots (duh)
- *pycaret*: ML model building and evaluation
- Quarto - using R - to present
## Top Skills

- SQL is the top skill in general
- Python 2 for 1 compared with R
- Tableau \> Power BI
- AWS the top cloud provider
## Salary

## Skills x Salary
::: columns
::: {.column width="50%"}
{alt="Python means more $"}
:::
::: {.column width="50%"}
{alt="Excel means less $"}
:::
:::
## Salary x Skills (cont.)
::: columns
::: {.column width="50%"}
{alt="Cloud and Engineer stuff👍🏻"}
:::
::: {.column width="50%"}
{alt="Cliches 👎🏻"}
:::
:::
## How prevalent are AI buzzwords?\*
::: columns
::: {.column width="50%"}

:::
::: {.column width="50%"}
:::
:::
## Prediction 🔮
::: columns
::: {.column width="30%"}
{alt="% of jobs with salary larger than a few thresholds — choosing my goal"}
:::
::: {.column width="70%"}
{alt="ML model results"}
:::
:::
## ML Results - Feature Importance
{alt="Skills are good for you (but not just any skills)"}
## Verifying one prediction
::: {.fragment fragment-index="1"}
{alt="My model says this job doesn’t pay so well" width="542"}
:::
::: {.fragment fragment-index="2"}
{alt="And my model was right 🥳" width="453"}
:::
## Takeaways
- If training to be a Data Analyst, make sure you are proficient in at least:
- SQL
- Tableau or Power BI
- Python (or R for some specific fields or skills)
- One cloud platform, especially AWS
- Be on lookout for positions with the following cliches on the job description. Those are probably workplaces that will challenge your mental health and you won't even earn enough for it to be worth it:
- "ability to work under pressure"
- "multitasking"
- "fast-paced environment"
- European companies, please start disclosing salaries for your positions, thank you in advance!