This document presents an automated and fully reproducible R
pipeline designed for data profiling and exploratory data
analysis (EDA) on the Bellabeat datasets.
It establishes a foundation for all subsequent analytical, modelling,
and reporting activities.
The pipeline performs a comprehensive assessment of the raw data to
validate their structure, quality, and
temporal coherence prior to any advanced analysis. It
covers eleven datasets of
Fitabase Data 3.12.16–4.11.16 sourced from
activity-tracking sensors and examines:
A central function, profile_csv(),
standardizes all quality checks across datasets: automated type
detection, missing-value assessment, cardinality analysis, descriptive
statistics, temporal auditing, duplicate identification, and enhanced
summaries via skimr.
Batch execution through purrr::map()
ensures homogeneity, reproducibility, and scalability
across the entire processing workflow.
The function auto_parse_datetime()
automatically handles the detection and conversion of temporal columns,
ensuring reliable interpretation of dates and timestamps despite the
variety of formats present in the raw data. It provides consistent
timestamp handling and delivers the uniformity required for all
time-based analyses in the pipeline. By centralizing this logic, the
pipeline enforces coherent normalization and eliminates ambiguities
associated with textual time formats.
Together, these elements provide a technical backbone for subsequent phases of data cleaning, feature engineering, aggregation, and analytical modelling.
File name : dailyActivity_merged.csv
File size : 0.05 Mo
Number of observations : 457
Number of columns : 15
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 35 |
| ActivityDate | POSIXct | 0 | 0 | 32 |
| TotalSteps | numeric | 0 | 0 | 389 |
| TotalDistance | numeric | 0 | 0 | 334 |
| TrackerDistance | numeric | 0 | 0 | 336 |
| LoggedActivitiesDistance | numeric | 0 | 0 | 20 |
| VeryActiveDistance | numeric | 0 | 0 | 170 |
| ModeratelyActiveDistance | numeric | 0 | 0 | 140 |
| LightActiveDistance | numeric | 0 | 0 | 295 |
| SedentaryActiveDistance | numeric | 0 | 0 | 8 |
| VeryActiveMinutes | numeric | 0 | 0 | 85 |
| FairlyActiveMinutes | numeric | 0 | 0 | 62 |
| LightlyActiveMinutes | numeric | 0 | 0 | 251 |
| SedentaryMinutes | numeric | 0 | 0 | 315 |
| Calories | numeric | 0 | 0 | 383 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| ActivityDate | 2016-03-12 | 2016-04-12 | 2016-04-04 04:40:26 | 2016-04-05 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| TotalSteps | 0 | 28497.000000 | 6546.5623632 | 5986.00 |
| TotalDistance | 0 | 27.530001 | 4.6635230 | 4.09 |
| TrackerDistance | 0 | 27.530001 | 4.6098468 | 4.09 |
| LoggedActivitiesDistance | 0 | 6.727057 | 0.1794274 | 0.00 |
| VeryActiveDistance | 0 | 21.920000 | 1.1808972 | 0.00 |
| ModeratelyActiveDistance | 0 | 6.400000 | 0.4786433 | 0.02 |
| LightActiveDistance | 0 | 12.510000 | 2.8901969 | 2.93 |
| SedentaryActiveDistance | 0 | 0.100000 | 0.0019037 | 0.00 |
| VeryActiveMinutes | 0 | 202.000000 | 16.6236324 | 0.00 |
| FairlyActiveMinutes | 0 | 660.000000 | 13.0700219 | 1.00 |
| LightlyActiveMinutes | 0 | 720.000000 | 170.0700219 | 181.00 |
| SedentaryMinutes | 32 | 1440.000000 | 995.2822757 | 1057.00 |
| Calories | 0 | 4562.000000 | 2189.4529540 | 2062.00 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | TotalSteps | 0 | 1 | 6546.5623632 | 5398.4930639 | 0 | 1988.00 | 5986.00 | 10198.00 | 28497.000000 |
| numeric | TotalDistance | 0 | 1 | 4.6635230 | 4.0820723 | 0 | 1.41 | 4.09 | 7.16 | 27.530001 |
| numeric | TrackerDistance | 0 | 1 | 4.6098468 | 4.0685399 | 0 | 1.28 | 4.09 | 7.11 | 27.530001 |
| numeric | LoggedActivitiesDistance | 0 | 1 | 0.1794274 | 0.8492318 | 0 | 0.00 | 0.00 | 0.00 | 6.727057 |
| numeric | VeryActiveDistance | 0 | 1 | 1.1808972 | 2.4871586 | 0 | 0.00 | 0.00 | 1.31 | 21.920000 |
| numeric | ModeratelyActiveDistance | 0 | 1 | 0.4786433 | 0.8309952 | 0 | 0.00 | 0.02 | 0.67 | 6.400000 |
| numeric | LightActiveDistance | 0 | 1 | 2.8901969 | 2.2375233 | 0 | 0.87 | 2.93 | 4.46 | 12.510000 |
| numeric | SedentaryActiveDistance | 0 | 1 | 0.0019037 | 0.0084868 | 0 | 0.00 | 0.00 | 0.00 | 0.100000 |
| numeric | VeryActiveMinutes | 0 | 1 | 16.6236324 | 28.9197037 | 0 | 0.00 | 0.00 | 25.00 | 202.000000 |
| numeric | FairlyActiveMinutes | 0 | 1 | 13.0700219 | 36.2086352 | 0 | 0.00 | 1.00 | 16.00 | 660.000000 |
| numeric | LightlyActiveMinutes | 0 | 1 | 170.0700219 | 122.2053721 | 0 | 64.00 | 181.00 | 257.00 | 720.000000 |
| numeric | SedentaryMinutes | 0 | 1 | 995.2822757 | 337.0214040 | 32 | 728.00 | 1057.00 | 1285.00 | 1440.000000 |
| numeric | Calories | 0 | 1 | 2189.4529540 | 815.4845229 | 0 | 1776.00 | 2062.00 | 2667.00 | 4562.000000 |
| Id | ActivityDate | TotalSteps | TotalDistance | TrackerDistance | LoggedActivitiesDistance | VeryActiveDistance | ModeratelyActiveDistance | LightActiveDistance | SedentaryActiveDistance | VeryActiveMinutes | FairlyActiveMinutes | LightlyActiveMinutes | SedentaryMinutes | Calories |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1503960366 | 2016-03-25 | 11004 | 7.11 | 7.11 | 0 | 2.57 | 0.46 | 4.07 | 0 | 33 | 12 | 205 | 804 | 1819 |
| 1503960366 | 2016-03-26 | 17609 | 11.55 | 11.55 | 0 | 6.92 | 0.73 | 3.91 | 0 | 89 | 17 | 274 | 588 | 2154 |
| 1503960366 | 2016-03-27 | 12736 | 8.53 | 8.53 | 0 | 4.66 | 0.16 | 3.71 | 0 | 56 | 5 | 268 | 605 | 1944 |
| 1503960366 | 2016-03-28 | 13231 | 8.93 | 8.93 | 0 | 3.19 | 0.79 | 4.95 | 0 | 39 | 20 | 224 | 1080 | 1932 |
| 1503960366 | 2016-03-29 | 12041 | 7.85 | 7.85 | 0 | 2.16 | 1.09 | 4.61 | 0 | 28 | 28 | 243 | 763 | 1886 |
| 1503960366 | 2016-03-30 | 10970 | 7.16 | 7.16 | 0 | 2.36 | 0.51 | 4.29 | 0 | 30 | 13 | 223 | 1174 | 1820 |
| 1503960366 | 2016-03-31 | 12256 | 7.86 | 7.86 | 0 | 2.29 | 0.49 | 5.04 | 0 | 33 | 12 | 239 | 820 | 1889 |
| 1503960366 | 2016-04-01 | 12262 | 7.87 | 7.87 | 0 | 3.32 | 0.83 | 3.64 | 0 | 47 | 21 | 200 | 866 | 1868 |
| 1503960366 | 2016-04-02 | 11248 | 7.25 | 7.25 | 0 | 3.00 | 0.45 | 3.74 | 0 | 40 | 11 | 244 | 636 | 1843 |
| 1503960366 | 2016-04-03 | 10016 | 6.37 | 6.37 | 0 | 0.91 | 1.28 | 4.18 | 0 | 15 | 30 | 314 | 655 | 1850 |
File name : heartrate_seconds_merged.csv
File size : 39.17 Mo
Number of observations : 1154681
Number of columns : 3
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 14 |
| Time | POSIXct | 0 | 0 | 510597 |
| Value | numeric | 0 | 0 | 148 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| Time | 2016-03-29 00:00:05 | 2016-04-12 11:03:15 | 2016-04-06 03:14:15 | 2016-04-06 03:55:07 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| Value | 36 | 185 | 79.75687 | 77 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | Value | 0 | 1 | 79.75687 | 18.72988 | 36 | 66 | 77 | 90 | 185 |
| Id | Time | Value |
|---|---|---|
| 2022484408 | 2016-04-01 07:54:00 | 93 |
| 2022484408 | 2016-04-01 07:54:05 | 91 |
| 2022484408 | 2016-04-01 07:54:10 | 96 |
| 2022484408 | 2016-04-01 07:54:15 | 98 |
| 2022484408 | 2016-04-01 07:54:20 | 100 |
| 2022484408 | 2016-04-01 07:54:25 | 101 |
| 2022484408 | 2016-04-01 07:54:30 | 104 |
| 2022484408 | 2016-04-01 07:54:35 | 105 |
| 2022484408 | 2016-04-01 07:54:45 | 102 |
| 2022484408 | 2016-04-01 07:54:55 | 106 |
File name : hourlyCalories_merged.csv
File size : 0.83 Mo
Number of observations : 24084
Number of columns : 3
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 34 |
| ActivityHour | POSIXct | 0 | 0 | 755 |
| Calories | numeric | 0 | 0 | 444 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| ActivityHour | 2016-03-12 | 2016-04-12 10:00:00 | 2016-03-27 05:47:23 | 2016-03-27 04:00:00 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| Calories | 42 | 933 | 94.26669 | 77 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | Calories | 0 | 1 | 94.26669 | 59.39571 | 42 | 61 | 77 | 104 | 933 |
| Id | ActivityHour | Calories |
|---|---|---|
| 1503960366 | 2016-03-12 00:00:00 | 48 |
| 1503960366 | 2016-03-12 01:00:00 | 48 |
| 1503960366 | 2016-03-12 02:00:00 | 48 |
| 1503960366 | 2016-03-12 03:00:00 | 48 |
| 1503960366 | 2016-03-12 04:00:00 | 48 |
| 1503960366 | 2016-03-12 05:00:00 | 48 |
| 1503960366 | 2016-03-12 06:00:00 | 48 |
| 1503960366 | 2016-03-12 07:00:00 | 48 |
| 1503960366 | 2016-03-12 08:00:00 | 48 |
| 1503960366 | 2016-03-12 09:00:00 | 49 |
File name : hourlyIntensities_merged.csv
File size : 0.93 Mo
Number of observations : 24084
Number of columns : 4
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 34 |
| ActivityHour | POSIXct | 0 | 0 | 755 |
| TotalIntensity | numeric | 0 | 0 | 173 |
| AverageIntensity | numeric | 0 | 0 | 173 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| ActivityHour | 2016-03-12 | 2016-04-12 10:00:00 | 2016-03-27 05:47:23 | 2016-03-27 04:00:00 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| TotalIntensity | 0 | 180 | 10.8263162 | 1.000000 |
| AverageIntensity | 0 | 3 | 0.1804386 | 0.016667 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | TotalIntensity | 0 | 1 | 10.8263162 | 20.3143122 | 0 | 0 | 1.000000 | 14.000000 | 180 |
| numeric | AverageIntensity | 0 | 1 | 0.1804386 | 0.3385719 | 0 | 0 | 0.016667 | 0.233333 | 3 |
| Id | ActivityHour | TotalIntensity | AverageIntensity |
|---|---|---|---|
| 1503960366 | 2016-03-12 00:00:00 | 0 | 0.000000 |
| 1503960366 | 2016-03-12 01:00:00 | 0 | 0.000000 |
| 1503960366 | 2016-03-12 02:00:00 | 0 | 0.000000 |
| 1503960366 | 2016-03-12 03:00:00 | 0 | 0.000000 |
| 1503960366 | 2016-03-12 04:00:00 | 0 | 0.000000 |
| 1503960366 | 2016-03-12 05:00:00 | 0 | 0.000000 |
| 1503960366 | 2016-03-12 06:00:00 | 0 | 0.000000 |
| 1503960366 | 2016-03-12 07:00:00 | 0 | 0.000000 |
| 1503960366 | 2016-03-12 08:00:00 | 0 | 0.000000 |
| 1503960366 | 2016-03-12 09:00:00 | 1 | 0.016667 |
File name : hourlySteps_merged.csv
File size : 0.83 Mo
Number of observations : 24084
Number of columns : 3
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 34 |
| ActivityHour | POSIXct | 0 | 0 | 755 |
| StepTotal | numeric | 0 | 0 | 2216 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| ActivityHour | 2016-03-12 | 2016-04-12 10:00:00 | 2016-03-27 05:47:23 | 2016-03-27 04:00:00 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| StepTotal | 0 | 10565 | 286.2187 | 10 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | StepTotal | 0 | 1 | 286.2187 | 664.9242 | 0 | 0 | 10 | 289 | 10565 |
| Id | ActivityHour | StepTotal |
|---|---|---|
| 1503960366 | 2016-03-12 00:00:00 | 0 |
| 1503960366 | 2016-03-12 01:00:00 | 0 |
| 1503960366 | 2016-03-12 02:00:00 | 0 |
| 1503960366 | 2016-03-12 03:00:00 | 0 |
| 1503960366 | 2016-03-12 04:00:00 | 0 |
| 1503960366 | 2016-03-12 05:00:00 | 0 |
| 1503960366 | 2016-03-12 06:00:00 | 0 |
| 1503960366 | 2016-03-12 07:00:00 | 0 |
| 1503960366 | 2016-03-12 08:00:00 | 0 |
| 1503960366 | 2016-03-12 09:00:00 | 8 |
File name : minuteCaloriesNarrow_merged.csv
File size : 69.11 Mo
Number of observations : 1445040
Number of columns : 3
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 34 |
| ActivityMinute | POSIXct | 0 | 0 | 45300 |
| Calories | numeric | 0 | 0 | 6345 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| ActivityMinute | 2016-03-12 | 2016-04-12 10:59:00 | 2016-03-27 06:16:53 | 2016-03-27 04:54:00 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| Calories | 0 | 23.01264 | 1.571299 | 1.2176 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | Calories | 0 | 1 | 1.571299 | 1.361955 | 0 | 0.9357 | 1.2176 | 1.406 | 23.01264 |
| Id | ActivityMinute | Calories |
|---|---|---|
| 1503960366 | 2016-03-12 00:00:00 | 0.7973 |
| 1503960366 | 2016-03-12 00:01:00 | 0.7973 |
| 1503960366 | 2016-03-12 00:02:00 | 0.7973 |
| 1503960366 | 2016-03-12 00:03:00 | 0.7973 |
| 1503960366 | 2016-03-12 00:04:00 | 0.7973 |
| 1503960366 | 2016-03-12 00:05:00 | 0.7973 |
| 1503960366 | 2016-03-12 00:06:00 | 0.7973 |
| 1503960366 | 2016-03-12 00:07:00 | 0.7973 |
| 1503960366 | 2016-03-12 00:08:00 | 0.7973 |
| 1503960366 | 2016-03-12 00:09:00 | 0.7973 |
File name : minuteIntensitiesNarrow_merged.csv
File size : 48.18 Mo
Number of observations : 1445040
Number of columns : 3
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 34 |
| ActivityMinute | POSIXct | 0 | 0 | 45300 |
| Intensity | numeric | 0 | 0 | 4 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| ActivityMinute | 2016-03-12 | 2016-04-12 10:59:00 | 2016-03-27 06:16:53 | 2016-03-27 04:54:00 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| Intensity | 0 | 3 | 0.1804386 | 0 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | Intensity | 0 | 1 | 0.1804386 | 0.4942197 | 0 | 0 | 0 | 0 | 3 |
| Id | ActivityMinute | Intensity |
|---|---|---|
| 1503960366 | 2016-03-12 00:00:00 | 0 |
| 1503960366 | 2016-03-12 00:01:00 | 0 |
| 1503960366 | 2016-03-12 00:02:00 | 0 |
| 1503960366 | 2016-03-12 00:03:00 | 0 |
| 1503960366 | 2016-03-12 00:04:00 | 0 |
| 1503960366 | 2016-03-12 00:05:00 | 0 |
| 1503960366 | 2016-03-12 00:06:00 | 0 |
| 1503960366 | 2016-03-12 00:07:00 | 0 |
| 1503960366 | 2016-03-12 00:08:00 | 0 |
| 1503960366 | 2016-03-12 00:09:00 | 0 |
File name : minuteMETsNarrow_merged.csv
File size : 49.56 Mo
Number of observations : 1445040
Number of columns : 3
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 34 |
| ActivityMinute | POSIXct | 0 | 0 | 45300 |
| METs | numeric | 0 | 0 | 141 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| ActivityMinute | 2016-03-12 | 2016-04-12 10:59:00 | 2016-03-27 06:16:53 | 2016-03-27 04:54:00 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| METs | 0 | 189 | 14.23876 | 10 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | METs | 0 | 1 | 14.23876 | 11.54079 | 0 | 10 | 10 | 11 | 189 |
| Id | ActivityMinute | METs |
|---|---|---|
| 1503960366 | 2016-03-12 00:00:00 | 10 |
| 1503960366 | 2016-03-12 00:01:00 | 10 |
| 1503960366 | 2016-03-12 00:02:00 | 10 |
| 1503960366 | 2016-03-12 00:03:00 | 10 |
| 1503960366 | 2016-03-12 00:04:00 | 10 |
| 1503960366 | 2016-03-12 00:05:00 | 10 |
| 1503960366 | 2016-03-12 00:06:00 | 10 |
| 1503960366 | 2016-03-12 00:07:00 | 10 |
| 1503960366 | 2016-03-12 00:08:00 | 10 |
| 1503960366 | 2016-03-12 00:09:00 | 10 |
File name : minuteSleep_merged.csv
File size : 8.88 Mo
Number of observations : 198559
Number of columns : 4
Duplicates detecteds : Oui
Number of duplicated lines : 525
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 23 |
| date | POSIXct | 0 | 0 | 54523 |
| value | numeric | 0 | 0 | 3 |
| logId | numeric | 0 | 0 | 556 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| date | 2016-03-11 21:19:30 | 2016-04-12 08:35:00 | 2016-03-27 12:39:41 | 2016-03-28 03:21:00 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| value | 1 | 3 | 1.085899 | 1 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | value | 0 | 1 | 1.085899 | 0.3144799 | 1 | 1 | 1 | 1 | 3 |
| Id | date | value | logId |
|---|---|---|---|
| 1503960366 | 2016-03-13 02:39:30 | 1 | 11114919637 |
| 1503960366 | 2016-03-13 02:40:30 | 1 | 11114919637 |
| 1503960366 | 2016-03-13 02:41:30 | 1 | 11114919637 |
| 1503960366 | 2016-03-13 02:42:30 | 1 | 11114919637 |
| 1503960366 | 2016-03-13 02:43:30 | 1 | 11114919637 |
| 1503960366 | 2016-03-13 02:44:30 | 1 | 11114919637 |
| 1503960366 | 2016-03-13 02:45:30 | 2 | 11114919637 |
| 1503960366 | 2016-03-13 02:46:30 | 2 | 11114919637 |
| 1503960366 | 2016-03-13 02:47:30 | 1 | 11114919637 |
| 1503960366 | 2016-03-13 02:48:30 | 1 | 11114919637 |
| Id | date | value | logId |
|---|---|---|---|
| 4319703577 | 2016-04-05 22:50:00 | 3 | 11344563687 |
| 4319703577 | 2016-04-05 22:51:00 | 3 | 11344563687 |
| 4319703577 | 2016-04-05 22:52:00 | 2 | 11344563687 |
| 4319703577 | 2016-04-05 22:53:00 | 2 | 11344563687 |
| 4319703577 | 2016-04-05 22:54:00 | 2 | 11344563687 |
| 4319703577 | 2016-04-05 22:55:00 | 1 | 11344563687 |
| 4319703577 | 2016-04-05 22:56:00 | 1 | 11344563687 |
| 4319703577 | 2016-04-05 22:57:00 | 2 | 11344563687 |
| 4319703577 | 2016-04-05 22:58:00 | 2 | 11344563687 |
| 4319703577 | 2016-04-05 22:59:00 | 1 | 11344563687 |
File name : minuteStepsNarrow_merged.csv
File size : 48.34 Mo
Number of observations : 1445040
Number of columns : 3
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0 | 34 |
| ActivityMinute | POSIXct | 0 | 0 | 45300 |
| Steps | numeric | 0 | 0 | 190 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| ActivityMinute | 2016-03-12 | 2016-04-12 10:59:00 | 2016-03-27 06:16:53 | 2016-03-27 04:54:00 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| Steps | 0 | 204 | 4.770312 | 0 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | Steps | 0 | 1 | 4.770312 | 17.21853 | 0 | 0 | 0 | 0 | 204 |
| Id | ActivityMinute | Steps |
|---|---|---|
| 1503960366 | 2016-03-12 00:00:00 | 0 |
| 1503960366 | 2016-03-12 00:01:00 | 0 |
| 1503960366 | 2016-03-12 00:02:00 | 0 |
| 1503960366 | 2016-03-12 00:03:00 | 0 |
| 1503960366 | 2016-03-12 00:04:00 | 0 |
| 1503960366 | 2016-03-12 00:05:00 | 0 |
| 1503960366 | 2016-03-12 00:06:00 | 0 |
| 1503960366 | 2016-03-12 00:07:00 | 0 |
| 1503960366 | 2016-03-12 00:08:00 | 0 |
| 1503960366 | 2016-03-12 00:09:00 | 0 |
File name : weightLogInfo_merged.csv
File size : 0.00 Mo
Number of observations : 33
Number of columns : 8
Duplicates detecteds : Non
Number of duplicated lines : 0
| column | type | n_missing | pct_missing | n_distinct |
|---|---|---|---|---|
| Id | numeric | 0 | 0.0000000 | 11 |
| Date | POSIXct | 0 | 0.0000000 | 24 |
| WeightKg | numeric | 0 | 0.0000000 | 27 |
| WeightPounds | numeric | 0 | 0.0000000 | 27 |
| Fat | numeric | 31 | 0.9393939 | 3 |
| BMI | numeric | 0 | 0.0000000 | 27 |
| IsManualReport | logical | 0 | 0.0000000 | 2 |
| LogId | numeric | 0 | 0.0000000 | 24 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| Date | 2016-03-30 23:59:59 | 2016-04-12 23:59:59 | 2016-04-06 16:15:32 | 2016-04-06 23:59:59 |
| variable | min | max | mean | median |
|---|---|---|---|---|
| WeightKg | 53.3000 | 129.6000 | 73.44242 | 62.5000 |
| WeightPounds | 117.5064 | 285.7191 | 161.91283 | 137.7889 |
| Fat | 10.0000 | 22.0000 | 16.00000 | 16.0000 |
| BMI | 21.4500 | 46.1700 | 25.73091 | 24.3900 |
| skim_type | skim_variable | n_missing | complete_rate | numeric.mean | numeric.sd | numeric.p0 | numeric.p25 | numeric.p50 | numeric.p75 | numeric.p100 |
|---|---|---|---|---|---|---|---|---|---|---|
| numeric | WeightKg | 0 | 1.0000000 | 73.44242 | 16.531127 | 53.3000 | 61.7000 | 62.5000 | 85.8000 | 129.6000 |
| numeric | WeightPounds | 0 | 1.0000000 | 161.91283 | 36.444896 | 117.5064 | 136.0252 | 137.7889 | 189.1566 | 285.7191 |
| numeric | Fat | 31 | 0.0606061 | 16.00000 | 8.485281 | 10.0000 | 13.0000 | 16.0000 | 19.0000 | 22.0000 |
| numeric | BMI | 0 | 1.0000000 | 25.73091 | 4.330387 | 21.4500 | 24.1000 | 24.3900 | 25.7600 | 46.1700 |
| Id | Date | WeightKg | WeightPounds | Fat | BMI | IsManualReport | LogId |
|---|---|---|---|---|---|---|---|
| 1503960366 | 2016-04-05 23:59:59 | 53.3 | 117.5064 | 22 | 22.97 | TRUE | 1459900799000 |
| 1927972279 | 2016-04-10 18:33:26 | 129.6 | 285.7191 | NA | 46.17 | FALSE | 1460313206000 |
| 2347167796 | 2016-04-03 23:59:59 | 63.4 | 139.7731 | 10 | 24.77 | TRUE | 1459727999000 |
| 2873212765 | 2016-04-06 23:59:59 | 56.7 | 125.0021 | NA | 21.45 | TRUE | 1459987199000 |
| 2873212765 | 2016-04-07 23:59:59 | 57.2 | 126.1044 | NA | 21.65 | TRUE | 1460073599000 |
| 2891001357 | 2016-04-05 23:59:59 | 88.4 | 194.8886 | NA | 25.03 | TRUE | 1459900799000 |
| 4445114986 | 2016-03-30 23:59:59 | 92.4 | 203.7071 | NA | 35.01 | TRUE | 1459382399000 |
| 4558609924 | 2016-04-08 23:59:59 | 69.4 | 153.0008 | NA | 27.14 | TRUE | 1460159999000 |
| 4702921684 | 2016-04-04 23:59:59 | 99.7 | 219.8009 | NA | 26.11 | TRUE | 1459814399000 |
| 6962181067 | 2016-03-30 23:59:59 | 61.5 | 135.5843 | NA | 24.03 | TRUE | 1459382399000 |