Overview

Dataset statistics

Number of variables14
Number of observations25000
Missing cells45602
Missing cells (%)13.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 MiB
Average record size in memory112.0 B

Variable types

NUM10
CAT2
BOOL2

Warnings

first_order_day has a high cardinality: 412 distinct values High cardinality
cnt_orders_60d_fwd is highly correlated with cnt_orders_30d_fwd and 1 other fieldsHigh correlation
cnt_orders_30d_fwd is highly correlated with cnt_orders_60d_fwdHigh correlation
cnt_orders_90d_fwd is highly correlated with cnt_orders_60d_fwd and 1 other fieldsHigh correlation
cnt_orders_6m_fwd is highly correlated with cnt_orders_90d_fwdHigh correlation
voucher_amount has 22801 (91.2%) missing values Missing
member_get_member_viral has 22801 (91.2%) missing values Missing
user_id has unique values Unique

Reproduction

Analysis started2020-10-06 14:14:03.172326
Analysis finished2020-10-06 14:14:53.666148
Duration50.49 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

user_id
Real number (ℝ≥0)

UNIQUE

Distinct25000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2135903.17
Minimum1988
Maximum5220624
Zeros0
Zeros (%)0.0%
Memory size195.3 KiB

Quantile statistics

Minimum1988
5-th percentile335317.3
Q1828961.5
median1865581
Q33280115.5
95-th percentile4753594.7
Maximum5220624
Range5218636
Interquartile range (IQR)2451154

Descriptive statistics

Standard deviation1443230.416
Coefficient of variation (CV)0.6757003019
Kurtosis-0.9984336108
Mean2135903.17
Median Absolute Deviation (MAD)1159394
Skewness0.4726336348
Sum5.339757924e+10
Variance2.082914035e+12
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
5519501< 0.1%
 
5546541< 0.1%
 
20916881< 0.1%
 
7358781< 0.1%
 
10472101< 0.1%
 
6979461< 0.1%
 
12992661< 0.1%
 
28084941< 0.1%
 
48096841< 0.1%
 
3713761< 0.1%
 
Other values (24990)24990> 99.9%
 
ValueCountFrequency (%) 
19881< 0.1%
 
81721< 0.1%
 
97461< 0.1%
 
355801< 0.1%
 
375141< 0.1%
 
ValueCountFrequency (%) 
52206241< 0.1%
 
52201421< 0.1%
 
52198241< 0.1%
 
52194521< 0.1%
 
52193521< 0.1%
 

first_order_day
Categorical

HIGH CARDINALITY

Distinct412
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size195.3 KiB
05/03/2016
 
162
27/02/2016
 
150
28/02/2016
 
143
14/02/2016
 
142
06/03/2016
 
139
Other values (407)
24264 
ValueCountFrequency (%) 
05/03/20161620.6%
 
27/02/20161500.6%
 
28/02/20161430.6%
 
14/02/20161420.6%
 
06/03/20161390.6%
 
20/02/20161380.6%
 
22/05/20161300.5%
 
19/03/20161290.5%
 
18/03/20161280.5%
 
06/02/20161270.5%
 
Other values (402)2361294.4%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%