91261: 0/12

AS91264 Analysis from PPDAC Cycle

LICENSE

All of these features must be linked to the context for a MRT/EXC, including what the feature from your sample means (often comes from common sense), and if they make sense with regards to the context, and extended thinking (which also often comes from common sense).
(pretty sure you only have to talk about 3~4)

  • Symmetry - whether the shape is symmetrical or not (center is at median). Could be found by looking at the whisker.
    -> From this, the skew (right/left skewed) in terms of shape can be found.
  • Modelity - whether the group has 1, 2, or more modes (spikes) in the graph. Recommended that the group can be viewed distinctively.
  • Center (median) - where the median of the group is. Justify their median in numeric values, then differences between each group numerically.
  • Range - the interquartile range representing the middle 50% of the graph, or the whole spread (100% of graph) including the whisker section of the graph.
    -> Show calculations for IQR: UQ-LQ.
  • Cluster - show the cluster from the graph. Similar to modelity, since clusters have a mode most of the time.
  • Outliers (if any) - identify an obvious outlier (eg: “height is 20 meters”), and justify whether if you should keep them or not.

NOTE: Confidence Intervals (“C-I”) & (sampling) variabilities go in the conclusion.

My sample (not graded yet):
(context being age of sleep hours of the people who have their devices in bed or not)

In my sample, I notice that the graph representing the people who don’t have their 
devices in bed is symmetrical, while the graph of the people who don’t have their 
devices in bed does not have a symmetrical shape. Both graphs are skewed to the right, 
meaning that the data is more distributed towards the right hand side of the graph. 
This might mean that for the bottom graph, since it is asymmetrical, this means that 
there are people who can have high sleep hours even with their devices in bed, and 
people who cannot have high sleep hours with their devices in bed. Additionally, this 
means that for the top graph, a majority of people have a sleep time around the median 
of the graph, meaning that the people who don’t have their devices in bed have more of 
a consistent sleep time.

From this sample, I notice that for the people who don’t have their devices in bed (top 
graph), they have a unimodal graph, while for the people who have their devices in bed 
(bottom graph) have a bimodal graph with two groups – one near the median, and the 
other near the 18 hour point. This might mean, and makes sense that the people who do 
not have their devices in bed have a higher sleep hour compared to the students who do 
have their devices in bed. Moreover, this makes sense for the top bottom sample as well 
since people who have their devices in bed are likely to go on it during the night, and 
have less sleep, resulting in the people oversleeping, which may have led to a cluster 
near the 18 hour point. 

The center of my sample’s top graph representing the people who do not have their 
devices in bed has its median at 9, while for the people who do have their devices in 
bed have their median at 8.25, making it a difference of 0.75 hours, equivalent to 45 
minutes of sleep time. This indicates that people who don’t have their devices in bed 
sleep around 45 minutes more than the people who have their devices in bed. It makes 
sense for the bottom graph as using your devices a few hours before bed, (including 
taking your device to bed) can disrupt your sleep quality and time, resulting in a 
relatively lower sleep hour, while giving you the chance to sleep in, which the 
evidence is provided by the cluster near the 18 hour group. It also makes sense for the 
top graph as people who do not take their devices have a more consistent sleep hours 
compared to the people who do.

The interquartile range (“IQR”) of the people who do not have their devices in bed is 
10-8.5=1.5, while for the people who have their devices in bed they have an IQR of 
9.5-7.25=2.25. This means that the people who do not have their devices in bed have a 
more consistent sleep schedule, while the people who take their devices to bed have a 
more distributed sleep graph meaning that the sleep time is more random compared to the 
other group. The range of the sample of those who don’t have their devices in bed is 
20-4=16, while for the people who have their devices in bed, they have a range of 
20.5-3=17.5. This means that the people who have their devices in bed have a more 
randomized sleep time compared to the people who do not have their devices in bed. These 
make sense since people who have their devices in bed have more chances of sleeping in, 
and not sleeping at all, leading to a wider range than the people who do not have their 
devices in bed. 

As stated before in the modelity section, for the people who do not have their devices 
in bed have a unimodal graph, while for the other group, they have a bimodal group. This 
means that the first group who do not have their devices in bed only has one cluster, 
while the other group has 2. This makes sense as people who take their devices to bed 
usually have inconsistent sleep hours, leading to the two schedules, while for the 
people who do not have their devices in bed have a consistent sleep hours between 
themselves, leading to only one cluster in the sample. 

Author: 영
Source: wndnotes
Link: https://discord.gg/yHdQrahnAJ