Correlation Between Gray Matter Volume, Cardiorespiratory Fitness, and Adiposity.¶

Alzheimer's Disease is a highly progressive neurological disorder characterized by accumulation of amyloid plaques and Tau protein in the brain. One of the most notable effects of Alzheimer's Disease is the loss of gray matter volume in the brain, contributing to decline in memory processing and cognitive function.

Many modifiable and nonmodifiable environmental factors have been studied in recent years, that are hypothesized to contribute to the onset and prevalence of Alzheimer's Disease. Amongst these factors include cardiorespiratory fitness, and adiposity. Research has shown that lack of physical activity, and increase in adiposity affect oxygen delivery to the brain, and causes increased risk of cognitive impairment. However, increased levels of physical activity and lower adiposity is reported to preserve gray matter volume in the brain.

In 2012, the United States alone presented with approximately 4.5-5.4 million cases of Alzheimer's Disease, with a worldwide prevalence of approximately 26.6 million cases. It is projected that the prevalence of Alzheimer's Disease will only increase. Therefore, it is important to study which factors may impact the development of such a disease, in order to find preventative measures. As individuals are also becoming increasingly sedentary, and obesity rates are increasing, studying the correlation between physical activity, adiposity, and Alzheimer's Disease would be vital in determining ways to prevent this devastating neurological disease.

Further information about these impacts of Alzheimer's Disease and cardiorespiratory fitness can be found through the following literature:

  • MayoClinic - Alzheimer's Disease: Can exercise prevent memory loss?
  • Beckett, Ardern, & Rotondi: A meta-analysis of prospective studies on the role of physical activity and the prevention of Alzheimer's Disease in older adults
  • Hildreth, Pelt & Schwartz: Obesity, Insulin Resistance, and Alzheimer's Disease

This project utilizes a dataset found on Harvard Dataverse, that includes data from the "Alzheimer's Disease Exercise Program Trial", a clinical trial attempting to find how exercise impacts Alzheimer's Disease. Using Python, the data of interest will be extracted from the dataset, and will be used to complete a correlational study.

In [6]:
# Loading and Showing Dataset

import pandas as pd

alzheimers_df = pd.read_csv('ADEPT_Data.csv')

alzheimers_df
Out[6]:
IDn Timepoint Arm E4carrier CDR_Sum_BL GLOBALCDR_BL Age Sex Edu DurationPct ... Stroop_Interfer_Task Letter_Number SRT_Free_Total Verb_Animal Verb_Vegetable Max_VO2_mL MMSE TotalGrayVol B_Hippocampus Six_Min_Walk
0 11969-001 Baseline ST Carrier 3.5 0.5 75.6 Female 18 0.96 ... 40.0 8.0 5.0 14.0 8.0 15.8 25.0 493356.6549 4569.2 581.0
1 11969-001 Week_13 ST Carrier NaN NaN 75.6 Female 18 0.96 ... 44.0 9.0 4.0 16.0 7.0 NaN 24.0 NaN NaN NaN
2 11969-001 Week_26 ST Carrier NaN NaN 75.6 Female 18 0.96 ... 49.0 9.0 7.0 19.0 10.0 19.4 25.0 514883.3306 4603.4 587.0
3 11969-003 Baseline ST Carrier 5.0 1.0 83.6 Female 18 0.79 ... 15.0 2.0 12.0 9.0 5.0 16.9 23.0 464126.6173 5041.4 395.0
4 11969-003 Week_13 ST Carrier NaN NaN 83.6 Female 18 0.79 ... 18.0 1.0 8.0 16.0 9.0 NaN 26.0 NaN NaN NaN
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
223 11969-109 Week_13 ST Carrier NaN NaN 84.5 Male 18 0.40 ... NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN
224 11969-109 Week_26 ST Carrier NaN NaN 84.5 Male 18 0.40 ... NaN NaN NaN NaN NaN 16.8 NaN 601744.6056 7102.0 217.0
225 11969-112 Baseline AEx No_E4 8.0 1.0 77.9 Male 16 0.44 ... 22.0 7.0 7.0 14.0 10.0 13.9 27.0 562787.6559 4972.0 342.0
226 11969-112 Week_13 AEx No_E4 NaN NaN 77.9 Male 16 0.44 ... 22.0 6.0 10.0 13.0 7.0 NaN 26.0 NaN NaN NaN
227 11969-112 Week_26 AEx No_E4 NaN NaN 77.9 Male 16 0.44 ... 2.0 6.0 5.0 14.0 8.0 14.9 25.0 559959.8920 4834.4 400.0

228 rows × 31 columns

Data Dictionary¶

The Data Dictionary below shows a brief description of what each column in the Dataset means. For the purposes of this project, not all columns will be utilized. However, columns such as TotalGrayVol, MaxVO2_ml, BodyMass, and Lean_Mass will be of most importance. This is because TotalGrayVol is representative of each participant's total gray matter as they progress through the study; BodyMass and Lean_Mass are measures of adiposity, similar to Body Mass Index; MaxVO2_ml is representative of each participant's VO2 max score, which is a measure of cardiorespiratory fitness. These columns will mainly be used to find correlational data between each other. For instance, determining of VO2 max scores are proportional to the amount of gray matter volume.

Column Data Type Description
IDn String Participant ID Number
Timepoint String Timepoint of Measurement
Arms String Intervention Arm
E4 Carrier String APOE e4 Carrier Status
CDR_Sum_BL Float Clinical Dementia Rating Sum of Boxes at Baseline
GLOBALCDR_BL Float Clinical Dementia Rating Global Score at Baselin
Age Float Age in years
Sex String Sex
Edu Integer Education in years
DurationPct Float Percentage of prescribed exercise completed by participant
BodyMass Float Total body mass in kg
Lean_Mass Float Lean body mass in kg
CSD Integer Cornell Scale for Depression in Dementia
DAD_Assess_Total Integer Disability Assessment for Dementia Total Score
DAD.NA Integer Disability Assessment for Dementia not available items
Log_Mem Integer Logical Memory test
Delay_Memr Integer Delayed Logical Memory test
Digit_Back Integer WAIS Digit Span Backward task
Digit_Forw Integer WAIS Digit Span Forward task
DKEFS_CP Integer DKEFS Confirmed Perceptual Card Sorts sum of card set 1 and 2
DKEFS_FS Integer DKEFS Free Card Sorts sum of card set 1 and 2
Stroop_Interfer_Task Integer Stroop Interference task
Letter_Number Integer Letter Number sequencing task
SRT_Free_Total Integer Serial Reminding Task sum of 3 free recall attempts
Verb_Animal Integer Verbal Fluency - Animal Naming
Verb_Vegetable Integer Verbal Fluency - Vegetable Naming
Max_VO2_ml Float Maximal oxygen consumption during a graded exercise test in mL/kg body mass/minute
MMSE Integer Mini-mental State Exam
TotalGrayVol Float Total gray matter volume in mL
B_Hippocampus Float Bilateral hippocampal volume in mL
Six_Min_Walk Integer Six-minute Walk Test in yards

How Will The Data Be Used to Solve The Problem?¶

We will cluster the data by participant, as this dataset dedicates three rows per participant. Doing so allows us to track how the participant's volume of grey matter may have improved or depreciated over the course of the study. Then, we can use this to discover how exercise has impacted each individual participant, while also being able to take a "grand mean" of all of the data to see how exercise has impacted Alzheimer's Disease throughout all participants.