Overview

Dataset statistics

Number of variables25
Number of observations30000
Missing cells0
Missing cells (%)0.0%
Total size in memory5.7 MiB
Average record size in memory200.0 B

Variable types

Numeric25

Alerts

PAY_AMT2 is highly skewed (γ1 = 30.45381745) Skewed
ID has unique values Unique
PAY_0 has 14737 (49.1%) zeros Zeros
PAY_2 has 15730 (52.4%) zeros Zeros
PAY_3 has 15764 (52.5%) zeros Zeros
PAY_4 has 16455 (54.9%) zeros Zeros
PAY_5 has 16947 (56.5%) zeros Zeros
PAY_6 has 16286 (54.3%) zeros Zeros
BILL_AMT1 has 2008 (6.7%) zeros Zeros
BILL_AMT2 has 2506 (8.4%) zeros Zeros
BILL_AMT3 has 2870 (9.6%) zeros Zeros
BILL_AMT4 has 3195 (10.7%) zeros Zeros
BILL_AMT5 has 3506 (11.7%) zeros Zeros
BILL_AMT6 has 4020 (13.4%) zeros Zeros
PAY_AMT1 has 5249 (17.5%) zeros Zeros
PAY_AMT2 has 5396 (18.0%) zeros Zeros
PAY_AMT3 has 5968 (19.9%) zeros Zeros
PAY_AMT4 has 6408 (21.4%) zeros Zeros
PAY_AMT5 has 6703 (22.3%) zeros Zeros
PAY_AMT6 has 7173 (23.9%) zeros Zeros
default.payment.next.month has 23364 (77.9%) zeros Zeros

Reproduction

Analysis started2022-06-21 11:17:38.406427
Analysis finished2022-06-21 11:17:38.562026
Duration0.16 seconds
Software versionpandas-profiling v3.2.0
Download configurationconfig.json

Variables

ID
Real number (ℝ≥0)

UNIQUE

Distinct30000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15000.5
Minimum1
Maximum30000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:39.848081image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1500.95
Q17500.75
median15000.5
Q322500.25
95-th percentile28500.05
Maximum30000
Range29999
Interquartile range (IQR)14999.5

Descriptive statistics

Standard deviation8660.398374
Coefficient of variation (CV)0.5773406469
Kurtosis-1.2
Mean15000.5
Median Absolute Deviation (MAD)7500
Skewness0
Sum450015000
Variance75002500
MonotonicityStrictly increasing
2022-06-21T14:17:39.925017image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11
 
< 0.1%
199971
 
< 0.1%
200091
 
< 0.1%
200081
 
< 0.1%
200071
 
< 0.1%
200061
 
< 0.1%
200051
 
< 0.1%
200041
 
< 0.1%
200031
 
< 0.1%
200021
 
< 0.1%
Other values (29990)29990
> 99.9%
ValueCountFrequency (%)
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
ValueCountFrequency (%)
300001
< 0.1%
299991
< 0.1%
299981
< 0.1%
299971
< 0.1%
299961
< 0.1%

LIMIT_BAL
Real number (ℝ≥0)

Distinct81
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean167484.3227
Minimum10000
Maximum1000000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:39.992118image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum10000
5-th percentile20000
Q150000
median140000
Q3240000
95-th percentile430000
Maximum1000000
Range990000
Interquartile range (IQR)190000

Descriptive statistics

Standard deviation129747.6616
Coefficient of variation (CV)0.7746854124
Kurtosis0.5362628964
Mean167484.3227
Median Absolute Deviation (MAD)90000
Skewness0.9928669605
Sum5024529680
Variance1.683445568 × 1010
MonotonicityNot monotonic
2022-06-21T14:17:40.065171image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
500003365
 
11.2%
200001976
 
6.6%
300001610
 
5.4%
800001567
 
5.2%
2000001528
 
5.1%
1500001110
 
3.7%
1000001048
 
3.5%
180000995
 
3.3%
360000881
 
2.9%
60000825
 
2.8%
Other values (71)15095
50.3%
ValueCountFrequency (%)
10000493
 
1.6%
160002
 
< 0.1%
200001976
6.6%
300001610
5.4%
40000230
 
0.8%
ValueCountFrequency (%)
10000001
 
< 0.1%
8000002
< 0.1%
7800002
< 0.1%
7600001
 
< 0.1%
7500004
< 0.1%

SEX
Real number (ℝ≥0)

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.603733333
Minimum1
Maximum2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:40.117702image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q32
95-th percentile2
Maximum2
Range1
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.4891291961
Coefficient of variation (CV)0.3049940947
Kurtosis-1.820189771
Mean1.603733333
Median Absolute Deviation (MAD)0
Skewness-0.4241834271
Sum48112
Variance0.2392473705
MonotonicityNot monotonic
2022-06-21T14:17:40.166647image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
218112
60.4%
111888
39.6%
ValueCountFrequency (%)
111888
39.6%
218112
60.4%
ValueCountFrequency (%)
218112
60.4%
111888
39.6%

EDUCATION
Real number (ℝ≥0)

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.853133333
Minimum0
Maximum6
Zeros14
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:40.274874image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q32
95-th percentile3
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.7903486597
Coefficient of variation (CV)0.426493143
Kurtosis2.078621603
Mean1.853133333
Median Absolute Deviation (MAD)1
Skewness0.9709720486
Sum55594
Variance0.6246510039
MonotonicityNot monotonic
2022-06-21T14:17:40.322242image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
214030
46.8%
110585
35.3%
34917
 
16.4%
5280
 
0.9%
4123
 
0.4%
651
 
0.2%
014
 
< 0.1%
ValueCountFrequency (%)
014
 
< 0.1%
110585
35.3%
214030
46.8%
34917
 
16.4%
4123
 
0.4%
ValueCountFrequency (%)
651
 
0.2%
5280
 
0.9%
4123
 
0.4%
34917
 
16.4%
214030
46.8%

MARRIAGE
Real number (ℝ≥0)

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.551866667
Minimum0
Maximum3
Zeros54
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:40.366690image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q32
95-th percentile2
Maximum3
Range3
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5219696006
Coefficient of variation (CV)0.336349515
Kurtosis-1.363367756
Mean1.551866667
Median Absolute Deviation (MAD)0
Skewness-0.01874168101
Sum46556
Variance0.272452264
MonotonicityNot monotonic
2022-06-21T14:17:40.414379image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=4)
ValueCountFrequency (%)
215964
53.2%
113659
45.5%
3323
 
1.1%
054
 
0.2%
ValueCountFrequency (%)
054
 
0.2%
113659
45.5%
215964
53.2%
3323
 
1.1%
ValueCountFrequency (%)
3323
 
1.1%
215964
53.2%
113659
45.5%
054
 
0.2%

AGE
Real number (ℝ≥0)

Distinct56
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.4855
Minimum21
Maximum79
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:40.475928image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum21
5-th percentile23
Q128
median34
Q341
95-th percentile53
Maximum79
Range58
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.217904068
Coefficient of variation (CV)0.2597653709
Kurtosis0.04430337824
Mean35.4855
Median Absolute Deviation (MAD)6
Skewness0.7322458688
Sum1064565
Variance84.96975541
MonotonicityNot monotonic
2022-06-21T14:17:40.544874image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
291605
 
5.3%
271477
 
4.9%
281409
 
4.7%
301395
 
4.7%
261256
 
4.2%
311217
 
4.1%
251186
 
4.0%
341162
 
3.9%
321158
 
3.9%
331146
 
3.8%
Other values (46)16989
56.6%
ValueCountFrequency (%)
2167
 
0.2%
22560
1.9%
23931
3.1%
241127
3.8%
251186
4.0%
ValueCountFrequency (%)
791
 
< 0.1%
753
< 0.1%
741
 
< 0.1%
734
< 0.1%
723
< 0.1%

PAY_0
Real number (ℝ)

ZEROS

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.0167
Minimum-2
Maximum8
Zeros14737
Zeros (%)49.1%
Negative8445
Negative (%)28.1%
Memory size234.5 KiB
2022-06-21T14:17:40.598223image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-2
5-th percentile-2
Q1-1
median0
Q30
95-th percentile2
Maximum8
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.123801528
Coefficient of variation (CV)-67.29350467
Kurtosis2.720715042
Mean-0.0167
Median Absolute Deviation (MAD)1
Skewness0.7319749269
Sum-501
Variance1.262929874
MonotonicityNot monotonic
2022-06-21T14:17:40.649357image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
014737
49.1%
-15686
 
19.0%
13688
 
12.3%
-22759
 
9.2%
22667
 
8.9%
3322
 
1.1%
476
 
0.3%
526
 
0.1%
819
 
0.1%
611
 
< 0.1%
ValueCountFrequency (%)
-22759
 
9.2%
-15686
 
19.0%
014737
49.1%
13688
 
12.3%
22667
 
8.9%
ValueCountFrequency (%)
819
 
0.1%
79
 
< 0.1%
611
 
< 0.1%
526
 
0.1%
476
0.3%

PAY_2
Real number (ℝ)

ZEROS

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.1337666667
Minimum-2
Maximum8
Zeros15730
Zeros (%)52.4%
Negative9832
Negative (%)32.8%
Memory size234.5 KiB
2022-06-21T14:17:40.698097image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-2
5-th percentile-2
Q1-1
median0
Q30
95-th percentile2
Maximum8
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.197185973
Coefficient of variation (CV)-8.949807922
Kurtosis1.57041773
Mean-0.1337666667
Median Absolute Deviation (MAD)0
Skewness0.7905650222
Sum-4013
Variance1.433254254
MonotonicityNot monotonic
2022-06-21T14:17:40.750388image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
015730
52.4%
-16050
 
20.2%
23927
 
13.1%
-23782
 
12.6%
3326
 
1.1%
499
 
0.3%
128
 
0.1%
525
 
0.1%
720
 
0.1%
612
 
< 0.1%
ValueCountFrequency (%)
-23782
 
12.6%
-16050
 
20.2%
015730
52.4%
128
 
0.1%
23927
 
13.1%
ValueCountFrequency (%)
81
 
< 0.1%
720
 
0.1%
612
 
< 0.1%
525
 
0.1%
499
0.3%

PAY_3
Real number (ℝ)

ZEROS

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.1662
Minimum-2
Maximum8
Zeros15764
Zeros (%)52.5%
Negative10023
Negative (%)33.4%
Memory size234.5 KiB
2022-06-21T14:17:40.798029image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-2
5-th percentile-2
Q1-1
median0
Q30
95-th percentile2
Maximum8
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.196867568
Coefficient of variation (CV)-7.201369245
Kurtosis2.084435875
Mean-0.1662
Median Absolute Deviation (MAD)0
Skewness0.8406818269
Sum-4986
Variance1.432491976
MonotonicityNot monotonic
2022-06-21T14:17:40.849839image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
015764
52.5%
-15938
 
19.8%
-24085
 
13.6%
23819
 
12.7%
3240
 
0.8%
476
 
0.3%
727
 
0.1%
623
 
0.1%
521
 
0.1%
14
 
< 0.1%
ValueCountFrequency (%)
-24085
 
13.6%
-15938
 
19.8%
015764
52.5%
14
 
< 0.1%
23819
 
12.7%
ValueCountFrequency (%)
83
 
< 0.1%
727
 
0.1%
623
 
0.1%
521
 
0.1%
476
0.3%

PAY_4
Real number (ℝ)

ZEROS

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.2206666667
Minimum-2
Maximum8
Zeros16455
Zeros (%)54.9%
Negative10035
Negative (%)33.5%
Memory size234.5 KiB
2022-06-21T14:17:40.898208image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-2
5-th percentile-2
Q1-1
median0
Q30
95-th percentile2
Maximum8
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.169138622
Coefficient of variation (CV)-5.29821128
Kurtosis3.496983496
Mean-0.2206666667
Median Absolute Deviation (MAD)0
Skewness0.9996294133
Sum-6620
Variance1.366885118
MonotonicityNot monotonic
2022-06-21T14:17:40.950622image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
016455
54.9%
-15687
 
19.0%
-24348
 
14.5%
23159
 
10.5%
3180
 
0.6%
469
 
0.2%
758
 
0.2%
535
 
0.1%
65
 
< 0.1%
12
 
< 0.1%
ValueCountFrequency (%)
-24348
 
14.5%
-15687
 
19.0%
016455
54.9%
12
 
< 0.1%
23159
 
10.5%
ValueCountFrequency (%)
82
 
< 0.1%
758
0.2%
65
 
< 0.1%
535
0.1%
469
0.2%

PAY_5
Real number (ℝ)

ZEROS

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.2662
Minimum-2
Maximum8
Zeros16947
Zeros (%)56.5%
Negative10085
Negative (%)33.6%
Memory size234.5 KiB
2022-06-21T14:17:40.998545image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-2
5-th percentile-2
Q1-1
median0
Q30
95-th percentile2
Maximum8
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.133187406
Coefficient of variation (CV)-4.256902352
Kurtosis3.989748144
Mean-0.2662
Median Absolute Deviation (MAD)0
Skewness1.008197025
Sum-7986
Variance1.284113697
MonotonicityNot monotonic
2022-06-21T14:17:41.050893image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
016947
56.5%
-15539
 
18.5%
-24546
 
15.2%
22626
 
8.8%
3178
 
0.6%
484
 
0.3%
758
 
0.2%
517
 
0.1%
64
 
< 0.1%
81
 
< 0.1%
ValueCountFrequency (%)
-24546
 
15.2%
-15539
 
18.5%
016947
56.5%
22626
 
8.8%
3178
 
0.6%
ValueCountFrequency (%)
81
 
< 0.1%
758
0.2%
64
 
< 0.1%
517
 
0.1%
484
0.3%

PAY_6
Real number (ℝ)

ZEROS

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.2911
Minimum-2
Maximum8
Zeros16286
Zeros (%)54.3%
Negative10635
Negative (%)35.4%
Memory size234.5 KiB
2022-06-21T14:17:41.153319image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-2
5-th percentile-2
Q1-1
median0
Q30
95-th percentile2
Maximum8
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.149987626
Coefficient of variation (CV)-3.950489954
Kurtosis3.42653413
Mean-0.2911
Median Absolute Deviation (MAD)0
Skewness0.9480293916
Sum-8733
Variance1.322471539
MonotonicityNot monotonic
2022-06-21T14:17:41.203644image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
016286
54.3%
-15740
 
19.1%
-24895
 
16.3%
22766
 
9.2%
3184
 
0.6%
449
 
0.2%
746
 
0.2%
619
 
0.1%
513
 
< 0.1%
82
 
< 0.1%
ValueCountFrequency (%)
-24895
 
16.3%
-15740
 
19.1%
016286
54.3%
22766
 
9.2%
3184
 
0.6%
ValueCountFrequency (%)
82
 
< 0.1%
746
0.2%
619
 
0.1%
513
 
< 0.1%
449
0.2%

BILL_AMT1
Real number (ℝ)

ZEROS

Distinct22723
Distinct (%)75.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51223.3309
Minimum-165580
Maximum964511
Zeros2008
Zeros (%)6.7%
Negative590
Negative (%)2.0%
Memory size234.5 KiB
2022-06-21T14:17:41.265347image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-165580
5-th percentile0
Q13558.75
median22381.5
Q367091
95-th percentile201203.05
Maximum964511
Range1130091
Interquartile range (IQR)63532.25

Descriptive statistics

Standard deviation73635.86058
Coefficient of variation (CV)1.437545339
Kurtosis9.806289341
Mean51223.3309
Median Absolute Deviation (MAD)21800.5
Skewness2.663861022
Sum1536699927
Variance5422239963
MonotonicityNot monotonic
2022-06-21T14:17:41.335989image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
02008
 
6.7%
390244
 
0.8%
78076
 
0.3%
32672
 
0.2%
31663
 
0.2%
250059
 
0.2%
39649
 
0.2%
240039
 
0.1%
41629
 
0.1%
50025
 
0.1%
Other values (22713)27336
91.1%
ValueCountFrequency (%)
-1655801
< 0.1%
-1549731
< 0.1%
-153081
< 0.1%
-143861
< 0.1%
-115451
< 0.1%
ValueCountFrequency (%)
9645111
< 0.1%
7468141
< 0.1%
6530621
< 0.1%
6304581
< 0.1%
6266481
< 0.1%

BILL_AMT2
Real number (ℝ)

ZEROS

Distinct22346
Distinct (%)74.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49179.07517
Minimum-69777
Maximum983931
Zeros2506
Zeros (%)8.4%
Negative669
Negative (%)2.2%
Memory size234.5 KiB
2022-06-21T14:17:41.404928image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-69777
5-th percentile0
Q12984.75
median21200
Q364006.25
95-th percentile194792.2
Maximum983931
Range1053708
Interquartile range (IQR)61021.5

Descriptive statistics

Standard deviation71173.76878
Coefficient of variation (CV)1.447236829
Kurtosis10.30294592
Mean49179.07517
Median Absolute Deviation (MAD)20810
Skewness2.705220853
Sum1475372255
Variance5065705363
MonotonicityNot monotonic
2022-06-21T14:17:41.475489image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
02506
 
8.4%
390231
 
0.8%
32675
 
0.2%
78075
 
0.2%
31672
 
0.2%
39651
 
0.2%
250051
 
0.2%
240042
 
0.1%
-20029
 
0.1%
41628
 
0.1%
Other values (22336)26840
89.5%
ValueCountFrequency (%)
-697771
< 0.1%
-675261
< 0.1%
-333501
< 0.1%
-300001
< 0.1%
-262141
< 0.1%
ValueCountFrequency (%)
9839311
< 0.1%
7439701
< 0.1%
6715631
< 0.1%
6467701
< 0.1%
6244751
< 0.1%

BILL_AMT3
Real number (ℝ)

ZEROS

Distinct22026
Distinct (%)73.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47013.1548
Minimum-157264
Maximum1664089
Zeros2870
Zeros (%)9.6%
Negative655
Negative (%)2.2%
Memory size234.5 KiB
2022-06-21T14:17:41.542919image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-157264
5-th percentile0
Q12666.25
median20088.5
Q360164.75
95-th percentile187821.05
Maximum1664089
Range1821353
Interquartile range (IQR)57498.5

Descriptive statistics

Standard deviation69349.38743
Coefficient of variation (CV)1.475106015
Kurtosis19.78325514
Mean47013.1548
Median Absolute Deviation (MAD)19708.5
Skewness3.087830046
Sum1410394644
Variance4809337537
MonotonicityNot monotonic
2022-06-21T14:17:41.614395image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
02870
 
9.6%
390275
 
0.9%
78074
 
0.2%
32663
 
0.2%
31662
 
0.2%
39648
 
0.2%
250040
 
0.1%
240039
 
0.1%
41629
 
0.1%
20027
 
0.1%
Other values (22016)26473
88.2%
ValueCountFrequency (%)
-1572641
< 0.1%
-615061
< 0.1%
-461271
< 0.1%
-340411
< 0.1%
-254431
< 0.1%
ValueCountFrequency (%)
16640891
< 0.1%
8550861
< 0.1%
6931311
< 0.1%
6896431
< 0.1%
6896271
< 0.1%

BILL_AMT4
Real number (ℝ)

ZEROS

Distinct21548
Distinct (%)71.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43262.94897
Minimum-170000
Maximum891586
Zeros3195
Zeros (%)10.7%
Negative675
Negative (%)2.2%
Memory size234.5 KiB
2022-06-21T14:17:41.683692image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-170000
5-th percentile0
Q12326.75
median19052
Q354506
95-th percentile174333.35
Maximum891586
Range1061586
Interquartile range (IQR)52179.25

Descriptive statistics

Standard deviation64332.85613
Coefficient of variation (CV)1.487019671
Kurtosis11.30932483
Mean43262.94897
Median Absolute Deviation (MAD)18656
Skewness2.821965291
Sum1297888469
Variance4138716378
MonotonicityNot monotonic
2022-06-21T14:17:41.754405image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
03195
 
10.7%
390246
 
0.8%
780101
 
0.3%
31668
 
0.2%
32662
 
0.2%
39644
 
0.1%
240039
 
0.1%
15039
 
0.1%
250034
 
0.1%
41633
 
0.1%
Other values (21538)26139
87.1%
ValueCountFrequency (%)
-1700001
< 0.1%
-813341
< 0.1%
-651671
< 0.1%
-506161
< 0.1%
-466271
< 0.1%
ValueCountFrequency (%)
8915861
< 0.1%
7068641
< 0.1%
6286991
< 0.1%
6168361
< 0.1%
5728051
< 0.1%

BILL_AMT5
Real number (ℝ)

ZEROS

Distinct21010
Distinct (%)70.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40311.40097
Minimum-81334
Maximum927171
Zeros3506
Zeros (%)11.7%
Negative655
Negative (%)2.2%
Memory size234.5 KiB
2022-06-21T14:17:41.821248image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-81334
5-th percentile0
Q11763
median18104.5
Q350190.5
95-th percentile165794.3
Maximum927171
Range1008505
Interquartile range (IQR)48427.5

Descriptive statistics

Standard deviation60797.15577
Coefficient of variation (CV)1.508187617
Kurtosis12.30588129
Mean40311.40097
Median Absolute Deviation (MAD)17688.5
Skewness2.876379867
Sum1209342029
Variance3696294150
MonotonicityNot monotonic
2022-06-21T14:17:41.889194image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
03506
 
11.7%
390235
 
0.8%
78094
 
0.3%
31679
 
0.3%
32662
 
0.2%
15058
 
0.2%
39647
 
0.2%
240039
 
0.1%
250037
 
0.1%
41636
 
0.1%
Other values (21000)25807
86.0%
ValueCountFrequency (%)
-813341
< 0.1%
-613721
< 0.1%
-530071
< 0.1%
-466271
< 0.1%
-375941
< 0.1%
ValueCountFrequency (%)
9271711
< 0.1%
8235401
< 0.1%
5870671
< 0.1%
5517021
< 0.1%
5478801
< 0.1%

BILL_AMT6
Real number (ℝ)

ZEROS

Distinct20604
Distinct (%)68.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38871.7604
Minimum-339603
Maximum961664
Zeros4020
Zeros (%)13.4%
Negative688
Negative (%)2.3%
Memory size234.5 KiB
2022-06-21T14:17:41.955541image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum-339603
5-th percentile0
Q11256
median17071
Q349198.25
95-th percentile161912
Maximum961664
Range1301267
Interquartile range (IQR)47942.25

Descriptive statistics

Standard deviation59554.10754
Coefficient of variation (CV)1.53206613
Kurtosis12.27070529
Mean38871.7604
Median Absolute Deviation (MAD)16755
Skewness2.846644576
Sum1166152812
Variance3546691724
MonotonicityNot monotonic
2022-06-21T14:17:42.029999image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
04020
 
13.4%
390207
 
0.7%
78086
 
0.3%
15078
 
0.3%
31677
 
0.3%
32656
 
0.2%
39645
 
0.1%
41636
 
0.1%
-1833
 
0.1%
240032
 
0.1%
Other values (20594)25330
84.4%
ValueCountFrequency (%)
-3396031
< 0.1%
-2090511
< 0.1%
-1509531
< 0.1%
-946251
< 0.1%
-738951
< 0.1%
ValueCountFrequency (%)
9616641
< 0.1%
6999441
< 0.1%
5686381
< 0.1%
5277111
< 0.1%
5275661
< 0.1%

PAY_AMT1
Real number (ℝ≥0)

ZEROS

Distinct7943
Distinct (%)26.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5663.5805
Minimum0
Maximum873552
Zeros5249
Zeros (%)17.5%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:42.171352image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11000
median2100
Q35006
95-th percentile18428.2
Maximum873552
Range873552
Interquartile range (IQR)4006

Descriptive statistics

Standard deviation16563.28035
Coefficient of variation (CV)2.924524575
Kurtosis415.2547427
Mean5663.5805
Median Absolute Deviation (MAD)1932
Skewness14.66836433
Sum169907415
Variance274342256.1
MonotonicityNot monotonic
2022-06-21T14:17:42.237483image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
05249
 
17.5%
20001363
 
4.5%
3000891
 
3.0%
5000698
 
2.3%
1500507
 
1.7%
4000426
 
1.4%
10000401
 
1.3%
1000365
 
1.2%
2500298
 
1.0%
6000294
 
1.0%
Other values (7933)19508
65.0%
ValueCountFrequency (%)
05249
17.5%
19
 
< 0.1%
214
 
< 0.1%
315
 
0.1%
418
 
0.1%
ValueCountFrequency (%)
8735521
< 0.1%
5050001
< 0.1%
4933581
< 0.1%
4239031
< 0.1%
4050161
< 0.1%

PAY_AMT2
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct7899
Distinct (%)26.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5921.1635
Minimum0
Maximum1684259
Zeros5396
Zeros (%)18.0%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:42.303418image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1833
median2009
Q35000
95-th percentile19004.35
Maximum1684259
Range1684259
Interquartile range (IQR)4167

Descriptive statistics

Standard deviation23040.8704
Coefficient of variation (CV)3.891274139
Kurtosis1641.631911
Mean5921.1635
Median Absolute Deviation (MAD)1991
Skewness30.45381745
Sum177634905
Variance530881708.9
MonotonicityNot monotonic
2022-06-21T14:17:42.373835image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
05396
 
18.0%
20001290
 
4.3%
3000857
 
2.9%
5000717
 
2.4%
1000594
 
2.0%
1500521
 
1.7%
4000410
 
1.4%
10000318
 
1.1%
6000283
 
0.9%
2500251
 
0.8%
Other values (7889)19363
64.5%
ValueCountFrequency (%)
05396
18.0%
115
 
0.1%
220
 
0.1%
318
 
0.1%
411
 
< 0.1%
ValueCountFrequency (%)
16842591
< 0.1%
12270821
< 0.1%
12154711
< 0.1%
10245161
< 0.1%
5804641
< 0.1%

PAY_AMT3
Real number (ℝ≥0)

ZEROS

Distinct7518
Distinct (%)25.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5225.6815
Minimum0
Maximum896040
Zeros5968
Zeros (%)19.9%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:42.442254image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1390
median1800
Q34505
95-th percentile17589.4
Maximum896040
Range896040
Interquartile range (IQR)4115

Descriptive statistics

Standard deviation17606.96147
Coefficient of variation (CV)3.36931393
Kurtosis564.3112295
Mean5225.6815
Median Absolute Deviation (MAD)1795
Skewness17.21663544
Sum156770445
Variance310005092.2
MonotonicityNot monotonic
2022-06-21T14:17:42.509649image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
05968
 
19.9%
20001285
 
4.3%
10001103
 
3.7%
3000870
 
2.9%
5000721
 
2.4%
1500490
 
1.6%
4000381
 
1.3%
10000312
 
1.0%
1200243
 
0.8%
6000241
 
0.8%
Other values (7508)18386
61.3%
ValueCountFrequency (%)
05968
19.9%
113
 
< 0.1%
219
 
0.1%
314
 
< 0.1%
415
 
0.1%
ValueCountFrequency (%)
8960401
< 0.1%
8890431
< 0.1%
5082291
< 0.1%
4175881
< 0.1%
4009721
< 0.1%

PAY_AMT4
Real number (ℝ≥0)

ZEROS

Distinct6937
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4826.076867
Minimum0
Maximum621000
Zeros6408
Zeros (%)21.4%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:42.573918image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1296
median1500
Q34013.25
95-th percentile16014.95
Maximum621000
Range621000
Interquartile range (IQR)3717.25

Descriptive statistics

Standard deviation15666.15974
Coefficient of variation (CV)3.246147995
Kurtosis277.3337677
Mean4826.076867
Median Absolute Deviation (MAD)1500
Skewness12.90498482
Sum144782306
Variance245428561.1
MonotonicityNot monotonic
2022-06-21T14:17:42.639481image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
06408
 
21.4%
10001394
 
4.6%
20001214
 
4.0%
3000887
 
3.0%
5000810
 
2.7%
1500441
 
1.5%
4000402
 
1.3%
10000341
 
1.1%
2500259
 
0.9%
500258
 
0.9%
Other values (6927)17586
58.6%
ValueCountFrequency (%)
06408
21.4%
122
 
0.1%
222
 
0.1%
313
 
< 0.1%
420
 
0.1%
ValueCountFrequency (%)
6210001
< 0.1%
5288971
< 0.1%
4970001
< 0.1%
4321301
< 0.1%
4000461
< 0.1%

PAY_AMT5
Real number (ℝ≥0)

ZEROS

Distinct6897
Distinct (%)23.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4799.387633
Minimum0
Maximum426529
Zeros6703
Zeros (%)22.3%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:42.707703image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1252.5
median1500
Q34031.5
95-th percentile16000
Maximum426529
Range426529
Interquartile range (IQR)3779

Descriptive statistics

Standard deviation15278.30568
Coefficient of variation (CV)3.183386475
Kurtosis180.0639402
Mean4799.387633
Median Absolute Deviation (MAD)1500
Skewness11.12741705
Sum143981629
Variance233426624.4
MonotonicityNot monotonic
2022-06-21T14:17:42.774250image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
06703
 
22.3%
10001340
 
4.5%
20001323
 
4.4%
3000947
 
3.2%
5000814
 
2.7%
1500426
 
1.4%
4000401
 
1.3%
10000343
 
1.1%
500250
 
0.8%
6000247
 
0.8%
Other values (6887)17206
57.4%
ValueCountFrequency (%)
06703
22.3%
121
 
0.1%
213
 
< 0.1%
313
 
< 0.1%
412
 
< 0.1%
ValueCountFrequency (%)
4265291
< 0.1%
4179901
< 0.1%
3880711
< 0.1%
3792671
< 0.1%
3320001
< 0.1%

PAY_AMT6
Real number (ℝ≥0)

ZEROS

Distinct6939
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5215.502567
Minimum0
Maximum528666
Zeros7173
Zeros (%)23.9%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:42.837470image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1117.75
median1500
Q34000
95-th percentile17343.8
Maximum528666
Range528666
Interquartile range (IQR)3882.25

Descriptive statistics

Standard deviation17777.46578
Coefficient of variation (CV)3.408581541
Kurtosis167.1614296
Mean5215.502567
Median Absolute Deviation (MAD)1500
Skewness10.64072733
Sum156465077
Variance316038289.4
MonotonicityNot monotonic
2022-06-21T14:17:42.906594image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
07173
23.9%
10001299
 
4.3%
20001295
 
4.3%
3000914
 
3.0%
5000808
 
2.7%
1500439
 
1.5%
4000411
 
1.4%
10000356
 
1.2%
500247
 
0.8%
6000220
 
0.7%
Other values (6929)16838
56.1%
ValueCountFrequency (%)
07173
23.9%
120
 
0.1%
29
 
< 0.1%
314
 
< 0.1%
412
 
< 0.1%
ValueCountFrequency (%)
5286661
< 0.1%
5271431
< 0.1%
4430011
< 0.1%
4220001
< 0.1%
4035001
< 0.1%

default.payment.next.month
Real number (ℝ≥0)

ZEROS

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2212
Minimum0
Maximum1
Zeros23364
Zeros (%)77.9%
Negative0
Negative (%)0.0%
Memory size234.5 KiB
2022-06-21T14:17:42.957156image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4150618057
Coefficient of variation (CV)1.87640961
Kurtosis-0.195010139
Mean0.2212
Median Absolute Deviation (MAD)0
Skewness1.343503951
Sum6636
Variance0.1722763025
MonotonicityNot monotonic
2022-06-21T14:17:43.001440image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
023364
77.9%
16636
 
22.1%
ValueCountFrequency (%)
023364
77.9%
16636
 
22.1%
ValueCountFrequency (%)
16636
 
22.1%
023364
77.9%