Do different levels of experience in data science jobs actually lead to a higher pay in salary
My project compares how different levels of experience for a job compare to the amount of pay that an employee is getting. I will be comparing entry level, mid-level, senior level, and executive level jobs to see how different the salaries can be. https://www.interviewquery.com/salaries/entry-level-data-scientist-salary here is a link to more information on comparing the different level experience for jobs.
import pandas as pd
df_data_science = pd.read_csv('Data Science Jobs Salaries.csv')
df_data_science
work_year | experience_level | employment_type | job_title | salary | salary_currency | salary_in_usd | employee_residence | remote_ratio | company_location | company_size | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2021e | EN | FT | Data Science Consultant | 54000 | EUR | 64369 | DE | 50 | DE | L |
1 | 2020 | SE | FT | Data Scientist | 60000 | EUR | 68428 | GR | 100 | US | L |
2 | 2021e | EX | FT | Head of Data Science | 85000 | USD | 85000 | RU | 0 | RU | M |
3 | 2021e | EX | FT | Head of Data | 230000 | USD | 230000 | RU | 50 | RU | L |
4 | 2021e | EN | FT | Machine Learning Engineer | 125000 | USD | 125000 | US | 100 | US | S |
... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
240 | 2020 | SE | FT | Data Scientist | 412000 | USD | 412000 | US | 100 | US | L |
241 | 2021e | MI | FT | Principal Data Scientist | 151000 | USD | 151000 | US | 100 | US | L |
242 | 2020 | EN | FT | Data Scientist | 105000 | USD | 105000 | US | 100 | US | S |
243 | 2020 | EN | CT | Business Data Analyst | 100000 | USD | 100000 | US | 100 | US | L |
244 | 2021e | SE | FT | Data Science Manager | 7000000 | INR | 94917 | IN | 50 | IN | L |
245 rows × 11 columns
Column | Data Type | Description |
---|---|---|
work_year | Integer | The year the salary was paid |
experience_level | String | Experience level needed for job |
employment_type | String | What kind of employment the job is |
job_title | String | Title of job |
salary | Integer | Salary paid for job |
salary_currency | String | The type of currency |
salary_in_usd | String | Currency in USD |
employee_residence | String | Country of Residence |
remote_ratio | Integer | How much remote work is done |
company_location | String | location of company |
company_size | String | Size of company |
The data contains the salary and level of experience needed for each job so I can compare to see if getting a higher level job relates to a higher pay. To determine this, I can sort each type of job and find the average salary as well as whether the data is skewed. I can also use clustering to determine what the pay of a job would be based on the level of experience and title of job.