€17.03 – €21.09
1. Data Type
- Type: {Numerical / Categorical / Textual / Date-Time}
- Example: If numerical, specify continuous or discrete.
2. Summary Statistics
For Numerical Data:
- Count: Total number of non-missing values in the column.
- Mean: Average value of the column.
- Median: Middle value when sorted.
- Standard Deviation: Measure of data spread.
- Minimum and Maximum Values: Range of data.
- Quartiles:
- 25th Percentile (Q1): Lower quartile.
- 50th Percentile (Q2): Median.
- 75th Percentile (Q3): Upper quartile.
For Categorical Data:
- Count: Total number of non-missing values.
- Unique Values: Number of distinct categories.
- Mode: Most frequently occurring value.
- Frequency Distribution: Breakdown of occurrences per category.
For Date-Time Data:
- Count: Total number of non-missing values.
- Earliest Date: Minimum value in the column.
- Latest Date: Maximum value in the column.
- Range: Time span covered by the data.
3. Missing Values
- Count of Missing Values: Total number of missing entries.
- Percentage of Missing Values: Proportion of missing data compared to total records.
4. Outlier Detection (Optional for Numerical Data)
- Z-Score Method: Values with Z-scores greater than 3 or less than -3.
- Interquartile Range (IQR):
- Outliers are defined as values < Q1 – 1.5 * IQR or > Q3 + 1.5 * IQR.