Data Science Salaries¶

Problem:¶

Do different levels of experience in data science jobs actually lead to a higher pay in salary

Solution:¶

My project compares how different levels of experience for a job compare to the amount of pay that an employee is getting. I will be comparing entry level, mid-level, senior level, and executive level jobs to see how different the salaries can be. https://www.interviewquery.com/salaries/entry-level-data-scientist-salary here is a link to more information on comparing the different level experience for jobs.

Data Set¶

In [2]:
import pandas as pd

df_data_science = pd.read_csv('Data Science Jobs Salaries.csv')
In [3]:
df_data_science
Out[3]:
work_year experience_level employment_type job_title salary salary_currency salary_in_usd employee_residence remote_ratio company_location company_size
0 2021e EN FT Data Science Consultant 54000 EUR 64369 DE 50 DE L
1 2020 SE FT Data Scientist 60000 EUR 68428 GR 100 US L
2 2021e EX FT Head of Data Science 85000 USD 85000 RU 0 RU M
3 2021e EX FT Head of Data 230000 USD 230000 RU 50 RU L
4 2021e EN FT Machine Learning Engineer 125000 USD 125000 US 100 US S
... ... ... ... ... ... ... ... ... ... ... ...
240 2020 SE FT Data Scientist 412000 USD 412000 US 100 US L
241 2021e MI FT Principal Data Scientist 151000 USD 151000 US 100 US L
242 2020 EN FT Data Scientist 105000 USD 105000 US 100 US S
243 2020 EN CT Business Data Analyst 100000 USD 100000 US 100 US L
244 2021e SE FT Data Science Manager 7000000 INR 94917 IN 50 IN L

245 rows × 11 columns

Data Dictionary:¶

Column Data Type Description
work_year Integer The year the salary was paid
experience_level String Experience level needed for job
employment_type String What kind of employment the job is
job_title String Title of job
salary Integer Salary paid for job
salary_currency String The type of currency
salary_in_usd String Currency in USD
employee_residence String Country of Residence
remote_ratio Integer How much remote work is done
company_location String location of company
company_size String Size of company

How Will I Use My Code:¶

The data contains the salary and level of experience needed for each job so I can compare to see if getting a higher level job relates to a higher pay. To determine this, I can sort each type of job and find the average salary as well as whether the data is skewed. I can also use clustering to determine what the pay of a job would be based on the level of experience and title of job.