# does money buy happiness
import pandas as pd
import numpy as np
https://www.kaggle.com/datasets/jahaidulislam/world-happiness-report-2005-2021
df = pd.read_csv('World Happiness Report 2005-2021.csv')
df.head()
Country name | Year | Life Ladder | Log GDP per capita | Social support | Healthy life expectancy at birth | Freedom to make life choices | Generosity | Perceptions of corruption | Positive affect | Negative affect | Confidence in national government | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | Afghanistan | 2008 | 3.723590 | 7.302574 | 0.450662 | 50.500000 | 0.718114 | 0.173169 | 0.881686 | 0.414297 | 0.258195 | 0.612072 |
1 | Afghanistan | 2009 | 4.401778 | 7.472446 | 0.552308 | 50.799999 | 0.678896 | 0.195469 | 0.850035 | 0.481421 | 0.237092 | 0.611545 |
2 | Afghanistan | 2010 | 4.758381 | 7.579183 | 0.539075 | 51.099998 | 0.600127 | 0.125859 | 0.706766 | 0.516907 | 0.275324 | 0.299357 |
3 | Afghanistan | 2011 | 3.831719 | 7.552006 | 0.521104 | 51.400002 | 0.495901 | 0.167723 | 0.731109 | 0.479835 | 0.267175 | 0.307386 |
4 | Afghanistan | 2012 | 3.782938 | 7.637953 | 0.520637 | 51.700001 | 0.530935 | 0.241247 | 0.775620 | 0.613513 | 0.267919 | 0.435440 |
(1%) Describes and motivates a real-world problem where data science may provide helpful insights. Your description should be easily understood by a casual reader and include citations to motivating sources or relevant information (e.g. news articles, further reading links … Wikipedia makes for a poor reference but the links it cites are usually promising).
(1%) Explicitly load and show your dataset. Provide a data dictionary which explains the meaning of each feature present. Demonstrate that this data is sufficient to make progress on your real-world problem described above.
(1%) Write one or two sentences about how the data will be used to solve the problem. Earlier in the semester, we won’t have studied the Machine Learning methods just yet but you should have a general idea of what the ML will set out to do. For example:
“We’ll cluster the movies into sets of movies which are often watched by the same users. Doing so allows us to discover if there is a more natural grouping of movies rather than the traditional genres: horror, comedy, romantic-comedy, etc”.