Dataset statistics
Number of variables | 32 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 14.8 MiB |
Average record size in memory | 1.5 KiB |
Variable types
CAT | 22 |
---|---|
NUM | 7 |
BOOL | 2 |
DATE | 1 |
Reproduction
Analysis started | 2020-07-28 16:54:02.455401 |
---|---|
Analysis finished | 2020-07-28 16:54:14.638727 |
Duration | 12.18 seconds |
Version | pandas-profiling v2.8.0 |
Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
Download configuration | config.yaml |
Warnings
_FILE has constant value "dim_customer.csv" | Constant |
NAME_STYLE has constant value "False" | Constant |
FIRST_NAME has a high cardinality: 626 distinct values | High cardinality |
LAST_NAME has a high cardinality: 318 distinct values | High cardinality |
BIRTH_DATE has a high cardinality: 6196 distinct values | High cardinality |
ADDRESS_LINE_1 has a high cardinality: 8300 distinct values | High cardinality |
ADDRESS_LINE_2 has a high cardinality: 104 distinct values | High cardinality |
PHONE has a high cardinality: 4855 distinct values | High cardinality |
DATE_FIRST_PURCHASE has a high cardinality: 1097 distinct values | High cardinality |
BIRTH_DATE is uniformly distributed | Uniform |
ADDRESS_LINE_1 is uniformly distributed | Uniform |
_LINE has unique values | Unique |
CUSTOMER_KEY has unique values | Unique |
CUSTOMER_ALTERNATE_KEY has unique values | Unique |
EMAIL_ADDRESS has unique values | Unique |
TOTAL_CHILDREN has 2758 (27.6%) zeros | Zeros |
NUMBER_CHILDREN_AT_HOME has 6021 (60.2%) zeros | Zeros |
NUMBER_CARS_OWNED has 2354 (23.5%) zeros | Zeros |
Distinct count | 1 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
dim_customer.csv |
---|
Value | Count | Frequency (%) | |
dim_customer.csv | 10000 | 100.0% |
Length
Max length | 16 |
---|---|
Median length | 16 |
Mean length | 16 |
Min length | 16 |
Most occurring characters
Value | Count | Frequency (%) | |
m | 20000 | 12.5% | |
c | 20000 | 12.5% | |
s | 20000 | 12.5% | |
d | 10000 | 6.2% | |
i | 10000 | 6.2% | |
_ | 10000 | 6.2% | |
u | 10000 | 6.2% | |
t | 10000 | 6.2% | |
o | 10000 | 6.2% | |
e | 10000 | 6.2% | |
r | 10000 | 6.2% | |
. | 10000 | 6.2% | |
v | 10000 | 6.2% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 140000 | 87.5% | |
Connector Punctuation | 10000 | 6.2% | |
Other Punctuation | 10000 | 6.2% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
m | 20000 | 14.3% | |
c | 20000 | 14.3% | |
s | 20000 | 14.3% | |
d | 10000 | 7.1% | |
i | 10000 | 7.1% | |
u | 10000 | 7.1% | |
t | 10000 | 7.1% | |
o | 10000 | 7.1% | |
e | 10000 | 7.1% | |
r | 10000 | 7.1% | |
v | 10000 | 7.1% |
Most frequent Connector Punctuation characters
Value | Count | Frequency (%) | |
_ | 10000 | 100.0% |
Most frequent Other Punctuation characters
Value | Count | Frequency (%) | |
. | 10000 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 140000 | 87.5% | |
Common | 20000 | 12.5% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
m | 20000 | 14.3% | |
c | 20000 | 14.3% | |
s | 20000 | 14.3% | |
d | 10000 | 7.1% | |
i | 10000 | 7.1% | |
u | 10000 | 7.1% | |
t | 10000 | 7.1% | |
o | 10000 | 7.1% | |
e | 10000 | 7.1% | |
r | 10000 | 7.1% | |
v | 10000 | 7.1% |
Most frequent Common characters
Value | Count | Frequency (%) | |
_ | 10000 | 50.0% | |
. | 10000 | 50.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 160000 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
m | 20000 | 12.5% | |
c | 20000 | 12.5% | |
s | 20000 | 12.5% | |
d | 10000 | 6.2% | |
i | 10000 | 6.2% | |
_ | 10000 | 6.2% | |
u | 10000 | 6.2% | |
t | 10000 | 6.2% | |
o | 10000 | 6.2% | |
e | 10000 | 6.2% | |
r | 10000 | 6.2% | |
. | 10000 | 6.2% | |
v | 10000 | 6.2% |
Distinct count | 10000 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12313.6977 |
---|---|
Minimum | 0 |
Maximum | 18483 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Memory size | 19.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1345.95 |
Q1 | 10983.75 |
median | 13483.5 |
Q3 | 15983.25 |
95-th percentile | 17983.05 |
Maximum | 18483 |
Range | 18483 |
Interquartile range (IQR) | 4999.5 |
Descriptive statistics
Standard deviation | 5093.443851 |
---|---|
Coefficient of variation (CV) | 0.4136404819 |
Kurtosis | 0.3780766517 |
Mean | 12313.6977 |
Median Absolute Deviation (MAD) | 2500 |
Skewness | -1.185599209 |
Sum | 123136977 |
Variance | 25943170.26 |
Histogram with fixed size bins (bins=10)
Value | Count | Frequency (%) | |
18431 | 1 | < 0.1% | |
11583 | 1 | < 0.1% | |
1354 | 1 | < 0.1% | |
11591 | 1 | < 0.1% | |
18023 | 1 | < 0.1% | |
15685 | 1 | < 0.1% | |
13636 | 1 | < 0.1% | |
18050 | 1 | < 0.1% | |
1346 | 1 | < 0.1% | |
15677 | 1 | < 0.1% | |
15693 | 1 | < 0.1% | |
13628 | 1 | < 0.1% | |
1338 | 1 | < 0.1% | |
11575 | 1 | < 0.1% | |
18168 | 1 | < 0.1% | |
15669 | 1 | < 0.1% | |
13620 | 1 | < 0.1% | |
1330 | 1 | < 0.1% | |
13644 | 1 | < 0.1% | |
1970 | 1 | < 0.1% | |
11767 | 1 | < 0.1% | |
1378 | 1 | < 0.1% | |
13676 | 1 | < 0.1% | |
16536 | 1 | < 0.1% | |
1386 | 1 | < 0.1% | |
Other values (9975) | 9975 | 99.8% |
Value | Count | Frequency (%) | |
0 | 1 | < 0.1% | |
1 | 1 | < 0.1% | |
2 | 1 | < 0.1% | |
10 | 1 | < 0.1% | |
11 | 1 | < 0.1% | |
12 | 1 | < 0.1% | |
13 | 1 | < 0.1% | |
14 | 1 | < 0.1% | |
15 | 1 | < 0.1% | |
16 | 1 | < 0.1% |
Value | Count | Frequency (%) | |
18483 | 1 | < 0.1% | |
18482 | 1 | < 0.1% | |
18481 | 1 | < 0.1% | |
18480 | 1 | < 0.1% | |
18479 | 1 | < 0.1% | |
18478 | 1 | < 0.1% | |
18477 | 1 | < 0.1% | |
18476 | 1 | < 0.1% | |
18475 | 1 | < 0.1% | |
18474 | 1 | < 0.1% |
Distinct count | 10000 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20241.5513 |
---|---|
Minimum | 11000 |
Maximum | 29388 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 19.7 KiB |
Quantile statistics
Minimum | 11000 |
---|---|
5-th percentile | 12151.95 |
Q1 | 15132.75 |
median | 20715.5 |
Q3 | 24746.25 |
95-th percentile | 28201.05 |
Maximum | 29388 |
Range | 18388 |
Interquartile range (IQR) | 9613.5 |
Descriptive statistics
Standard deviation | 5211.229748 |
---|---|
Coefficient of variation (CV) | 0.257452093 |
Kurtosis | -1.172990799 |
Mean | 20241.5513 |
Median Absolute Deviation (MAD) | 4334 |
Skewness | -0.0528301809 |
Sum | 202415513 |
Variance | 27156915.49 |
Histogram with fixed size bins (bins=10)
Value | Count | Frequency (%) | |
18431 | 1 | < 0.1% | |
13404 | 1 | < 0.1% | |
13420 | 1 | < 0.1% | |
19563 | 1 | < 0.1% | |
17514 | 1 | < 0.1% | |
23657 | 1 | < 0.1% | |
23649 | 1 | < 0.1% | |
25694 | 1 | < 0.1% | |
29276 | 1 | < 0.1% | |
17498 | 1 | < 0.1% | |
17522 | 1 | < 0.1% | |
25686 | 1 | < 0.1% | |
13396 | 1 | < 0.1% | |
17490 | 1 | < 0.1% | |
23633 | 1 | < 0.1% | |
27727 | 1 | < 0.1% | |
25678 | 1 | < 0.1% | |
13388 | 1 | < 0.1% | |
13363 | 1 | < 0.1% | |
19571 | 1 | < 0.1% | |
13620 | 1 | < 0.1% | |
17546 | 1 | < 0.1% | |
17562 | 1 | < 0.1% | |
23705 | 1 | < 0.1% | |
13460 | 1 | < 0.1% | |
Other values (9975) | 9975 | 99.8% |
Value | Count | Frequency (%) | |
11000 | 1 | < 0.1% | |
11002 | 1 | < 0.1% | |
11006 | 1 | < 0.1% | |
11015 | 1 | < 0.1% | |
11019 | 1 | < 0.1% | |
11023 | 1 | < 0.1% | |
11024 | 1 | < 0.1% | |
11025 | 1 | < 0.1% | |
11026 | 1 | < 0.1% | |
11027 | 1 | < 0.1% |
Value | Count | Frequency (%) | |
29388 | 1 | < 0.1% | |
29374 | 1 | < 0.1% | |
29373 | 1 | < 0.1% | |
29372 | 1 | < 0.1% | |
29369 | 1 | < 0.1% | |
29368 | 1 | < 0.1% | |
29367 | 1 | < 0.1% | |
29366 | 1 | < 0.1% | |
29365 | 1 | < 0.1% | |
29363 | 1 | < 0.1% |
GEOGRAPHY_KEY
Real number (ℝ≥0)
Distinct count | 314 |
---|---|
Unique (%) | 3.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 255.531 |
---|---|
Minimum | 2 |
Maximum | 653 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 19.7 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 13 |
Q1 | 66 |
median | 237 |
Q3 | 339 |
95-th percentile | 631 |
Maximum | 653 |
Range | 651 |
Interquartile range (IQR) | 273 |
Descriptive statistics
Standard deviation | 192.6080004 |
---|---|
Coefficient of variation (CV) | 0.753755906 |
Kurtosis | -0.5996480352 |
Mean | 255.531 |
Median Absolute Deviation (MAD) | 118 |
Skewness | 0.5855306408 |
Sum | 2555310 |
Variance | 37097.84182 |
Histogram with fixed size bins (bins=10)
Value | Count | Frequency (%) | |
311 | 129 | 1.3% | |
612 | 110 | 1.1% | |
299 | 107 | 1.1% | |
298 | 106 | 1.1% | |
609 | 106 | 1.1% | |
49 | 104 | 1.0% | |
302 | 103 | 1.0% | |
307 | 101 | 1.0% | |
536 | 101 | 1.0% | |
301 | 93 | 0.9% | |
300 | 93 | 0.9% | |
611 | 92 | 0.9% | |
310 | 82 | 0.8% | |
312 | 71 | 0.7% | |
316 | 63 | 0.6% | |
539 | 62 | 0.6% | |
543 | 61 | 0.6% | |
315 | 61 | 0.6% | |
343 | 61 | 0.6% | |
51 | 61 | 0.6% | |
20 | 61 | 0.6% | |
4 | 60 | 0.6% | |
335 | 60 | 0.6% | |
25 | 60 | 0.6% | |
32 | 59 | 0.6% | |
Other values (289) | 7933 | 79.3% |
Value | Count | Frequency (%) | |
2 | 58 | 0.6% | |
3 | 41 | 0.4% | |
4 | 60 | 0.6% | |
5 | 39 | 0.4% | |
6 | 48 | 0.5% | |
7 | 38 | 0.4% | |
8 | 44 | 0.4% | |
9 | 41 | 0.4% | |
10 | 34 | 0.3% | |
11 | 42 | 0.4% |
Value | Count | Frequency (%) | |
653 | 1 | < 0.1% | |
648 | 48 | 0.5% | |
644 | 54 | 0.5% | |
642 | 47 | 0.5% | |
641 | 41 | 0.4% | |
638 | 49 | 0.5% | |
637 | 47 | 0.5% | |
635 | 50 | 0.5% | |
634 | 46 | 0.5% | |
633 | 57 | 0.6% |
Distinct count | 10000 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
AW00017662 | 1 |
---|---|
AW00020303 | 1 |
AW00022525 | 1 |
AW00022628 | 1 |
AW00016404 | 1 |
Other values (9995) |
Value | Count | Frequency (%) | |
AW00017662 | 1 | < 0.1% | |
AW00020303 | 1 | < 0.1% | |
AW00022525 | 1 | < 0.1% | |
AW00022628 | 1 | < 0.1% | |
AW00016404 | 1 | < 0.1% | |
AW00017003 | 1 | < 0.1% | |
AW00023314 | 1 | < 0.1% | |
AW00012544 | 1 | < 0.1% | |
AW00016262 | 1 | < 0.1% | |
AW00026926 | 1 | < 0.1% | |
AW00017114 | 1 | < 0.1% | |
AW00028904 | 1 | < 0.1% | |
AW00020024 | 1 | < 0.1% | |
AW00020647 | 1 | < 0.1% | |
AW00023786 | 1 | < 0.1% | |
AW00022519 | 1 | < 0.1% | |
AW00029238 | 1 | < 0.1% | |
AW00014696 | 1 | < 0.1% | |
AW00020545 | 1 | < 0.1% | |
AW00027451 | 1 | < 0.1% | |
AW00012796 | 1 | < 0.1% | |
AW00021258 | 1 | < 0.1% | |
AW00012102 | 1 | < 0.1% | |
AW00017231 | 1 | < 0.1% | |
AW00025512 | 1 | < 0.1% | |
Other values (9975) | 9975 | 99.8% |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Most occurring characters
Value | Count | Frequency (%) | |
0 | 34030 | 34.0% | |
2 | 10186 | 10.2% | |
A | 10000 | 10.0% | |
W | 10000 | 10.0% | |
1 | 8380 | 8.4% | |
4 | 4290 | 4.3% | |
7 | 4161 | 4.2% | |
3 | 4041 | 4.0% | |
5 | 3938 | 3.9% | |
6 | 3867 | 3.9% | |
8 | 3680 | 3.7% | |
9 | 3427 | 3.4% |
Most occurring categories
Value | Count | Frequency (%) | |
Decimal Number | 80000 | 80.0% | |
Uppercase Letter | 20000 | 20.0% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
A | 10000 | 50.0% | |
W | 10000 | 50.0% |
Most frequent Decimal Number characters
Value | Count | Frequency (%) | |
0 | 34030 | 42.5% | |
2 | 10186 | 12.7% | |
1 | 8380 | 10.5% | |
4 | 4290 | 5.4% | |
7 | 4161 | 5.2% | |
3 | 4041 | 5.1% | |
5 | 3938 | 4.9% | |
6 | 3867 | 4.8% | |
8 | 3680 | 4.6% | |
9 | 3427 | 4.3% |
Most occurring scripts
Value | Count | Frequency (%) | |
Common | 80000 | 80.0% | |
Latin | 20000 | 20.0% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
A | 10000 | 50.0% | |
W | 10000 | 50.0% |
Most frequent Common characters
Value | Count | Frequency (%) | |
0 | 34030 | 42.5% | |
2 | 10186 | 12.7% | |
1 | 8380 | 10.5% | |
4 | 4290 | 5.4% | |
7 | 4161 | 5.2% | |
3 | 4041 | 5.1% | |
5 | 3938 | 4.9% | |
6 | 3867 | 4.8% | |
8 | 3680 | 4.6% | |
9 | 3427 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 100000 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
0 | 34030 | 34.0% | |
2 | 10186 | 10.2% | |
A | 10000 | 10.0% | |
W | 10000 | 10.0% | |
1 | 8380 | 8.4% | |
4 | 4290 | 4.3% | |
7 | 4161 | 4.2% | |
3 | 4041 | 4.0% | |
5 | 3938 | 3.9% | |
6 | 3867 | 3.9% | |
8 | 3680 | 3.7% | |
9 | 3427 | 3.4% |
TITLE
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
null | |
---|---|
Mr. | 30 |
Ms. | 20 |
Sr. | 3 |
Ms | 1 |
Value | Count | Frequency (%) | |
null | 9946 | 99.5% | |
Mr. | 30 | 0.3% | |
Ms. | 20 | 0.2% | |
Sr. | 3 | < 0.1% | |
Ms | 1 | < 0.1% |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9945 |
Min length | 2 |
Most occurring characters
Value | Count | Frequency (%) | |
l | 19892 | 49.8% | |
n | 9946 | 24.9% | |
u | 9946 | 24.9% | |
. | 53 | 0.1% | |
M | 51 | 0.1% | |
r | 33 | 0.1% | |
s | 21 | 0.1% | |
S | 3 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 39838 | 99.7% | |
Uppercase Letter | 54 | 0.1% | |
Other Punctuation | 53 | 0.1% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
l | 19892 | 49.9% | |
n | 9946 | 25.0% | |
u | 9946 | 25.0% | |
r | 33 | 0.1% | |
s | 21 | 0.1% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
M | 51 | 94.4% | |
S | 3 | 5.6% |
Most frequent Other Punctuation characters
Value | Count | Frequency (%) | |
. | 53 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 39892 | 99.9% | |
Common | 53 | 0.1% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
l | 19892 | 49.9% | |
n | 9946 | 24.9% | |
u | 9946 | 24.9% | |
M | 51 | 0.1% | |
r | 33 | 0.1% | |
s | 21 | 0.1% | |
S | 3 | < 0.1% |
Most frequent Common characters
Value | Count | Frequency (%) | |
. | 53 | 100.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 39945 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
l | 19892 | 49.8% | |
n | 9946 | 24.9% | |
u | 9946 | 24.9% | |
. | 53 | 0.1% | |
M | 51 | 0.1% | |
r | 33 | 0.1% | |
s | 21 | 0.1% | |
S | 3 | < 0.1% |
Distinct count | 626 |
---|---|
Unique (%) | 6.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Isabella | 52 |
---|---|
Marcus | 52 |
Julia | 52 |
Seth | 51 |
Xavier | 51 |
Other values (621) |
Value | Count | Frequency (%) | |
Isabella | 52 | 0.5% | |
Marcus | 52 | 0.5% | |
Julia | 52 | 0.5% | |
Seth | 51 | 0.5% | |
Xavier | 51 | 0.5% | |
Eduardo | 50 | 0.5% | |
Sydney | 50 | 0.5% | |
Kaitlyn | 50 | 0.5% | |
Natalie | 49 | 0.5% | |
Chloe | 48 | 0.5% | |
Rachel | 47 | 0.5% | |
Lucas | 47 | 0.5% | |
Katherine | 47 | 0.5% | |
Devin | 45 | 0.4% | |
Amanda | 45 | 0.4% | |
Jonathan | 45 | 0.4% | |
Olivia | 44 | 0.4% | |
Alexandra | 44 | 0.4% | |
Richard | 44 | 0.4% | |
James | 44 | 0.4% | |
Dalton | 43 | 0.4% | |
Charles | 43 | 0.4% | |
Morgan | 43 | 0.4% | |
Wyatt | 43 | 0.4% | |
Jennifer | 42 | 0.4% | |
Other values (601) | 8829 | 88.3% |
Length
Max length | 11 |
---|---|
Median length | 6 |
Mean length | 5.9349 |
Min length | 2 |
Most occurring characters
Value | Count | Frequency (%) | |
a | 7903 | 13.3% | |
e | 5987 | 10.1% | |
n | 5101 | 8.6% | |
i | 4464 | 7.5% | |
r | 4204 | 7.1% | |
l | 3576 | 6.0% | |
o | 2542 | 4.3% | |
s | 2141 | 3.6% | |
t | 2044 | 3.4% | |
y | 2006 | 3.4% | |
h | 1813 | 3.1% | |
d | 1560 | 2.6% | |
c | 1287 | 2.2% | |
J | 1241 | 2.1% | |
A | 1052 | 1.8% | |
u | 1032 | 1.7% | |
m | 904 | 1.5% | |
C | 883 | 1.5% | |
M | 876 | 1.5% | |
K | 644 | 1.1% | |
S | 610 | 1.0% | |
b | 608 | 1.0% | |
D | 607 | 1.0% | |
R | 574 | 1.0% | |
g | 488 | 0.8% | |
Other values (31) | 5202 | 8.8% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 49344 | 83.1% | |
Uppercase Letter | 10002 | 16.9% | |
Dash Punctuation | 1 | < 0.1% | |
Space Separator | 1 | < 0.1% | |
Other Punctuation | 1 | < 0.1% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
J | 1241 | 12.4% | |
A | 1052 | 10.5% | |
C | 883 | 8.8% | |
M | 876 | 8.8% | |
K | 644 | 6.4% | |
S | 610 | 6.1% | |
D | 607 | 6.1% | |
R | 574 | 5.7% | |
B | 470 | 4.7% | |
E | 455 | 4.5% | |
T | 442 | 4.4% | |
L | 429 | 4.3% | |
G | 297 | 3.0% | |
N | 282 | 2.8% | |
H | 208 | 2.1% | |
I | 169 | 1.7% | |
P | 163 | 1.6% | |
W | 163 | 1.6% | |
F | 136 | 1.4% | |
V | 109 | 1.1% | |
O | 92 | 0.9% | |
X | 51 | 0.5% | |
Z | 41 | 0.4% | |
Y | 8 | 0.1% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
a | 7903 | 16.0% | |
e | 5987 | 12.1% | |
n | 5101 | 10.3% | |
i | 4464 | 9.0% | |
r | 4204 | 8.5% | |
l | 3576 | 7.2% | |
o | 2542 | 5.2% | |
s | 2141 | 4.3% | |
t | 2044 | 4.1% | |
y | 2006 | 4.1% | |
h | 1813 | 3.7% | |
d | 1560 | 3.2% | |
c | 1287 | 2.6% | |
u | 1032 | 2.1% | |
m | 904 | 1.8% | |
b | 608 | 1.2% | |
g | 488 | 1.0% | |
v | 442 | 0.9% | |
k | 354 | 0.7% | |
w | 189 | 0.4% | |
f | 189 | 0.4% | |
x | 161 | 0.3% | |
p | 122 | 0.2% | |
j | 77 | 0.2% | |
z | 66 | 0.1% | |
Other values (4) | 84 | 0.2% |
Most frequent Dash Punctuation characters
Value | Count | Frequency (%) | |
- | 1 | 100.0% |
Most frequent Space Separator characters
Value | Count | Frequency (%) | |
1 | 100.0% |
Most frequent Other Punctuation characters
Value | Count | Frequency (%) | |
. | 1 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 59346 | > 99.9% | |
Common | 3 | < 0.1% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
a | 7903 | 13.3% | |
e | 5987 | 10.1% | |
n | 5101 | 8.6% | |
i | 4464 | 7.5% | |
r | 4204 | 7.1% | |
l | 3576 | 6.0% | |
o | 2542 | 4.3% | |
s | 2141 | 3.6% | |
t | 2044 | 3.4% | |
y | 2006 | 3.4% | |
h | 1813 | 3.1% | |
d | 1560 | 2.6% | |
c | 1287 | 2.2% | |
J | 1241 | 2.1% | |
A | 1052 | 1.8% | |
u | 1032 | 1.7% | |
m | 904 | 1.5% | |
C | 883 | 1.5% | |
M | 876 | 1.5% | |
K | 644 | 1.1% | |
S | 610 | 1.0% | |
b | 608 | 1.0% | |
D | 607 | 1.0% | |
R | 574 | 1.0% | |
g | 488 | 0.8% | |
Other values (28) | 5199 | 8.8% |
Most frequent Common characters
Value | Count | Frequency (%) | |
- | 1 | 33.3% | |
1 | 33.3% | ||
. | 1 | 33.3% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 59328 | > 99.9% | |
None | 21 | < 0.1% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
a | 7903 | 13.3% | |
e | 5987 | 10.1% | |
n | 5101 | 8.6% | |
i | 4464 | 7.5% | |
r | 4204 | 7.1% | |
l | 3576 | 6.0% | |
o | 2542 | 4.3% | |
s | 2141 | 3.6% | |
t | 2044 | 3.4% | |
y | 2006 | 3.4% | |
h | 1813 | 3.1% | |
d | 1560 | 2.6% | |
c | 1287 | 2.2% | |
J | 1241 | 2.1% | |
A | 1052 | 1.8% | |
u | 1032 | 1.7% | |
m | 904 | 1.5% | |
C | 883 | 1.5% | |
M | 876 | 1.5% | |
K | 644 | 1.1% | |
S | 610 | 1.0% | |
b | 608 | 1.0% | |
D | 607 | 1.0% | |
R | 574 | 1.0% | |
g | 488 | 0.8% | |
Other values (28) | 5181 | 8.7% |
Most frequent None characters
Value | Count | Frequency (%) | |
é | 19 | 90.5% | |
í | 1 | 4.8% | |
ñ | 1 | 4.8% |
MIDDLE_NAME
Categorical
Distinct count | 40 |
---|---|
Unique (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
null | |
---|---|
L | 696 |
A | 684 |
M | 619 |
C | 521 |
Other values (35) |
Value | Count | Frequency (%) | |
null | 4213 | 42.1% | |
L | 696 | 7.0% | |
A | 684 | 6.8% | |
M | 619 | 6.2% | |
C | 521 | 5.2% | |
J | 517 | 5.2% | |
E | 387 | 3.9% | |
R | 368 | 3.7% | |
D | 300 | 3.0% | |
S | 252 | 2.5% | |
K | 199 | 2.0% | |
W | 191 | 1.9% | |
G | 154 | 1.5% | |
B | 145 | 1.5% | |
H | 132 | 1.3% | |
T | 127 | 1.3% | |
P | 115 | 1.1% | |
F | 113 | 1.1% | |
V | 76 | 0.8% | |
N | 53 | 0.5% | |
I | 52 | 0.5% | |
O | 35 | 0.4% | |
Y | 10 | 0.1% | |
Z | 10 | 0.1% | |
Q | 5 | 0.1% | |
Other values (15) | 26 | 0.3% |
Length
Max length | 6 |
---|---|
Median length | 1 |
Mean length | 2.2668 |
Min length | 1 |
Most occurring characters
Value | Count | Frequency (%) | |
l | 8429 | 37.2% | |
n | 4214 | 18.6% | |
u | 4214 | 18.6% | |
L | 697 | 3.1% | |
A | 685 | 3.0% | |
M | 622 | 2.7% | |
J | 523 | 2.3% | |
C | 521 | 2.3% | |
E | 387 | 1.7% | |
R | 373 | 1.6% | |
D | 300 | 1.3% | |
S | 252 | 1.1% | |
K | 200 | 0.9% | |
W | 191 | 0.8% | |
G | 155 | 0.7% | |
B | 146 | 0.6% | |
H | 133 | 0.6% | |
T | 127 | 0.6% | |
P | 115 | 0.5% | |
F | 114 | 0.5% | |
V | 76 | 0.3% | |
N | 54 | 0.2% | |
I | 52 | 0.2% | |
O | 35 | 0.2% | |
. | 19 | 0.1% | |
Other values (10) | 34 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 16865 | 74.4% | |
Uppercase Letter | 5784 | 25.5% | |
Other Punctuation | 19 | 0.1% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
L | 697 | 12.1% | |
A | 685 | 11.8% | |
M | 622 | 10.8% | |
J | 523 | 9.0% | |
C | 521 | 9.0% | |
E | 387 | 6.7% | |
R | 373 | 6.4% | |
D | 300 | 5.2% | |
S | 252 | 4.4% | |
K | 200 | 3.5% | |
W | 191 | 3.3% | |
G | 155 | 2.7% | |
B | 146 | 2.5% | |
H | 133 | 2.3% | |
T | 127 | 2.2% | |
P | 115 | 2.0% | |
F | 114 | 2.0% | |
V | 76 | 1.3% | |
N | 54 | 0.9% | |
I | 52 | 0.9% | |
O | 35 | 0.6% | |
Z | 10 | 0.2% | |
Y | 10 | 0.2% | |
Q | 5 | 0.1% | |
X | 1 | < 0.1% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
l | 8429 | 50.0% | |
n | 4214 | 25.0% | |
u | 4214 | 25.0% | |
a | 2 | < 0.1% | |
r | 2 | < 0.1% | |
d | 1 | < 0.1% | |
i | 1 | < 0.1% | |
e | 1 | < 0.1% | |
o | 1 | < 0.1% |
Most frequent Other Punctuation characters
Value | Count | Frequency (%) | |
. | 19 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 22649 | 99.9% | |
Common | 19 | 0.1% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
l | 8429 | 37.2% | |
n | 4214 | 18.6% | |
u | 4214 | 18.6% | |
L | 697 | 3.1% | |
A | 685 | 3.0% | |
M | 622 | 2.7% | |
J | 523 | 2.3% | |
C | 521 | 2.3% | |
E | 387 | 1.7% | |
R | 373 | 1.6% | |
D | 300 | 1.3% | |
S | 252 | 1.1% | |
K | 200 | 0.9% | |
W | 191 | 0.8% | |
G | 155 | 0.7% | |
B | 146 | 0.6% | |
H | 133 | 0.6% | |
T | 127 | 0.6% | |
P | 115 | 0.5% | |
F | 114 | 0.5% | |
V | 76 | 0.3% | |
N | 54 | 0.2% | |
I | 52 | 0.2% | |
O | 35 | 0.2% | |
Z | 10 | < 0.1% | |
Other values (9) | 24 | 0.1% |
Most frequent Common characters
Value | Count | Frequency (%) | |
. | 19 | 100.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 22668 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
l | 8429 | 37.2% | |
n | 4214 | 18.6% | |
u | 4214 | 18.6% | |
L | 697 | 3.1% | |
A | 685 | 3.0% | |
M | 622 | 2.7% | |
J | 523 | 2.3% | |
C | 521 | 2.3% | |
E | 387 | 1.7% | |
R | 373 | 1.6% | |
D | 300 | 1.3% | |
S | 252 | 1.1% | |
K | 200 | 0.9% | |
W | 191 | 0.8% | |
G | 155 | 0.7% | |
B | 146 | 0.6% | |
H | 133 | 0.6% | |
T | 127 | 0.6% | |
P | 115 | 0.5% | |
F | 114 | 0.5% | |
V | 76 | 0.3% | |
N | 54 | 0.2% | |
I | 52 | 0.2% | |
O | 35 | 0.2% | |
. | 19 | 0.1% | |
Other values (10) | 34 | 0.1% |
Distinct count | 318 |
---|---|
Unique (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Diaz | 118 |
---|---|
Martin | 101 |
Hernandez | 100 |
Sanchez | 97 |
Xu | 94 |
Other values (313) |
Value | Count | Frequency (%) | |
Diaz | 118 | 1.2% | |
Martin | 101 | 1.0% | |
Hernandez | 100 | 1.0% | |
Sanchez | 97 | 1.0% | |
Xu | 94 | 0.9% | |
Torres | 93 | 0.9% | |
Martinez | 91 | 0.9% | |
Lopez | 89 | 0.9% | |
Perez | 88 | 0.9% | |
Rodriguez | 87 | 0.9% | |
Gonzalez | 80 | 0.8% | |
Garcia | 79 | 0.8% | |
Shan | 77 | 0.8% | |
Kumar | 75 | 0.8% | |
Jai | 74 | 0.7% | |
Perry | 74 | 0.7% | |
Hughes | 72 | 0.7% | |
Russell | 70 | 0.7% | |
Lal | 69 | 0.7% | |
Washington | 68 | 0.7% | |
Ross | 68 | 0.7% | |
Patterson | 66 | 0.7% | |
Butler | 66 | 0.7% | |
Carlson | 66 | 0.7% | |
Romero | 65 | 0.7% | |
Other values (293) | 7973 | 79.7% |
Length
Max length | 16 |
---|---|
Median length | 6 |
Mean length | 5.5231 |
Min length | 2 |
Most occurring characters
Value | Count | Frequency (%) | |
a | 5642 | 10.2% | |
e | 5529 | 10.0% | |
r | 4700 | 8.5% | |
n | 4572 | 8.3% | |
o | 3985 | 7.2% | |
i | 2800 | 5.1% | |
s | 2548 | 4.6% | |
l | 2530 | 4.6% | |
z | 1627 | 2.9% | |
u | 1627 | 2.9% | |
t | 1579 | 2.9% | |
h | 1534 | 2.8% | |
d | 1231 | 2.2% | |
S | 1095 | 2.0% | |
g | 1091 | 2.0% | |
m | 1083 | 2.0% | |
R | 1068 | 1.9% | |
M | 776 | 1.4% | |
C | 702 | 1.3% | |
G | 648 | 1.2% | |
H | 605 | 1.1% | |
P | 588 | 1.1% | |
L | 577 | 1.0% | |
c | 571 | 1.0% | |
W | 539 | 1.0% | |
Other values (33) | 5984 | 10.8% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 45216 | 81.9% | |
Uppercase Letter | 10011 | 18.1% | |
Space Separator | 3 | < 0.1% | |
Dash Punctuation | 1 | < 0.1% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
S | 1095 | 10.9% | |
R | 1068 | 10.7% | |
M | 776 | 7.8% | |
C | 702 | 7.0% | |
G | 648 | 6.5% | |
H | 605 | 6.0% | |
P | 588 | 5.9% | |
L | 577 | 5.8% | |
W | 539 | 5.4% | |
B | 529 | 5.3% | |
A | 461 | 4.6% | |
J | 377 | 3.8% | |
T | 330 | 3.3% | |
D | 289 | 2.9% | |
Z | 228 | 2.3% | |
K | 223 | 2.2% | |
Y | 203 | 2.0% | |
N | 202 | 2.0% | |
F | 184 | 1.8% | |
X | 153 | 1.5% | |
V | 97 | 1.0% | |
E | 89 | 0.9% | |
O | 45 | 0.4% | |
U | 3 | < 0.1% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
a | 5642 | 12.5% | |
e | 5529 | 12.2% | |
r | 4700 | 10.4% | |
n | 4572 | 10.1% | |
o | 3985 | 8.8% | |
i | 2800 | 6.2% | |
s | 2548 | 5.6% | |
l | 2530 | 5.6% | |
z | 1627 | 3.6% | |
u | 1627 | 3.6% | |
t | 1579 | 3.5% | |
h | 1534 | 3.4% | |
d | 1231 | 2.7% | |
g | 1091 | 2.4% | |
m | 1083 | 2.4% | |
c | 571 | 1.3% | |
k | 450 | 1.0% | |
y | 448 | 1.0% | |
p | 364 | 0.8% | |
w | 332 | 0.7% | |
v | 296 | 0.7% | |
b | 242 | 0.5% | |
x | 118 | 0.3% | |
f | 112 | 0.2% | |
j | 111 | 0.2% | |
Other values (7) | 94 | 0.2% |
Most frequent Dash Punctuation characters
Value | Count | Frequency (%) | |
- | 1 | 100.0% |
Most frequent Space Separator characters
Value | Count | Frequency (%) | |
3 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 55227 | > 99.9% | |
Common | 4 | < 0.1% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
a | 5642 | 10.2% | |
e | 5529 | 10.0% | |
r | 4700 | 8.5% | |
n | 4572 | 8.3% | |
o | 3985 | 7.2% | |
i | 2800 | 5.1% | |
s | 2548 | 4.6% | |
l | 2530 | 4.6% | |
z | 1627 | 2.9% | |
u | 1627 | 2.9% | |
t | 1579 | 2.9% | |
h | 1534 | 2.8% | |
d | 1231 | 2.2% | |
S | 1095 | 2.0% | |
g | 1091 | 2.0% | |
m | 1083 | 2.0% | |
R | 1068 | 1.9% | |
M | 776 | 1.4% | |
C | 702 | 1.3% | |
G | 648 | 1.2% | |
H | 605 | 1.1% | |
P | 588 | 1.1% | |
L | 577 | 1.0% | |
c | 571 | 1.0% | |
W | 539 | 1.0% | |
Other values (31) | 5980 | 10.8% |
Most frequent Common characters
Value | Count | Frequency (%) | |
3 | 75.0% | ||
- | 1 | 25.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 55202 | 99.9% | |
None | 29 | 0.1% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
a | 5642 | 10.2% | |
e | 5529 | 10.0% | |
r | 4700 | 8.5% | |
n | 4572 | 8.3% | |
o | 3985 | 7.2% | |
i | 2800 | 5.1% | |
s | 2548 | 4.6% | |
l | 2530 | 4.6% | |
z | 1627 | 2.9% | |
u | 1627 | 2.9% | |
t | 1579 | 2.9% | |
h | 1534 | 2.8% | |
d | 1231 | 2.2% | |
S | 1095 | 2.0% | |
g | 1091 | 2.0% | |
m | 1083 | 2.0% | |
R | 1068 | 1.9% | |
M | 776 | 1.4% | |
C | 702 | 1.3% | |
G | 648 | 1.2% | |
H | 605 | 1.1% | |
P | 588 | 1.1% | |
L | 577 | 1.0% | |
c | 571 | 1.0% | |
W | 539 | 1.0% | |
Other values (27) | 5955 | 10.8% |
Most frequent None characters
Value | Count | Frequency (%) | |
é | 19 | 65.5% | |
á | 5 | 17.2% | |
ñ | 2 | 6.9% | |
ø | 1 | 3.4% | |
ó | 1 | 3.4% | |
ã | 1 | 3.4% |
Distinct count | 1 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
False |
---|
Value | Count | Frequency (%) | |
False | 10000 | 100.0% |
Distinct count | 6196 |
---|---|
Unique (%) | 62.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
1967-06-11T00:00:00.0000000 | 7 |
---|---|
1972-03-13T00:00:00.0000000 | 7 |
1973-08-06T00:00:00.0000000 | 6 |
1963-09-03T00:00:00.0000000 | 6 |
1966-07-26T00:00:00.0000000 | 6 |
Other values (6191) |
Value | Count | Frequency (%) | |
1967-06-11T00:00:00.0000000 | 7 | 0.1% | |
1972-03-13T00:00:00.0000000 | 7 | 0.1% | |
1973-08-06T00:00:00.0000000 | 6 | 0.1% | |
1963-09-03T00:00:00.0000000 | 6 | 0.1% | |
1966-07-26T00:00:00.0000000 | 6 | 0.1% | |
1970-03-05T00:00:00.0000000 | 6 | 0.1% | |
1962-05-04T00:00:00.0000000 | 6 | 0.1% | |
1972-01-15T00:00:00.0000000 | 6 | 0.1% | |
1960-05-14T00:00:00.0000000 | 6 | 0.1% | |
1955-09-23T00:00:00.0000000 | 6 | 0.1% | |
1952-07-11T00:00:00.0000000 | 6 | 0.1% | |
1964-05-12T00:00:00.0000000 | 6 | 0.1% | |
1961-02-04T00:00:00.0000000 | 6 | 0.1% | |
1960-05-22T00:00:00.0000000 | 6 | 0.1% | |
1965-10-04T00:00:00.0000000 | 6 | 0.1% | |
1979-08-20T00:00:00.0000000 | 6 | 0.1% | |
1962-06-24T00:00:00.0000000 | 6 | 0.1% | |
1971-06-15T00:00:00.0000000 | 5 | 0.1% | |
1957-07-06T00:00:00.0000000 | 5 | 0.1% | |
1960-07-27T00:00:00.0000000 | 5 | 0.1% | |
1979-08-23T00:00:00.0000000 | 5 | 0.1% | |
1964-07-14T00:00:00.0000000 | 5 | 0.1% | |
1965-06-23T00:00:00.0000000 | 5 | 0.1% | |
1962-04-02T00:00:00.0000000 | 5 | 0.1% | |
1965-08-27T00:00:00.0000000 | 5 | 0.1% | |
Other values (6171) | 9856 | 98.6% |
Length
Max length | 27 |
---|---|
Median length | 27 |
Mean length | 27 |
Min length | 27 |
Most occurring characters
Value | Count | Frequency (%) | |
0 | 143575 | 53.2% | |
- | 20000 | 7.4% | |
: | 20000 | 7.4% | |
1 | 19232 | 7.1% | |
9 | 12644 | 4.7% | |
T | 10000 | 3.7% | |
. | 10000 | 3.7% | |
2 | 6641 | 2.5% | |
6 | 6447 | 2.4% | |
7 | 5497 | 2.0% | |
5 | 5250 | 1.9% | |
4 | 4239 | 1.6% | |
3 | 3463 | 1.3% | |
8 | 3012 | 1.1% |
Most occurring categories
Value | Count | Frequency (%) | |
Decimal Number | 210000 | 77.8% | |
Other Punctuation | 30000 | 11.1% | |
Dash Punctuation | 20000 | 7.4% | |
Uppercase Letter | 10000 | 3.7% |
Most frequent Decimal Number characters
Value | Count | Frequency (%) | |
0 | 143575 | 68.4% | |
1 | 19232 | 9.2% | |
9 | 12644 | 6.0% | |
2 | 6641 | 3.2% | |
6 | 6447 | 3.1% | |
7 | 5497 | 2.6% | |
5 | 5250 | 2.5% | |
4 | 4239 | 2.0% | |
3 | 3463 | 1.6% | |
8 | 3012 | 1.4% |
Most frequent Dash Punctuation characters
Value | Count | Frequency (%) | |
- | 20000 | 100.0% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
T | 10000 | 100.0% |
Most frequent Other Punctuation characters
Value | Count | Frequency (%) | |
: | 20000 | 66.7% | |
. | 10000 | 33.3% |
Most occurring scripts
Value | Count | Frequency (%) | |
Common | 260000 | 96.3% | |
Latin | 10000 | 3.7% |
Most frequent Common characters
Value | Count | Frequency (%) | |
0 | 143575 | 55.2% | |
- | 20000 | 7.7% | |
: | 20000 | 7.7% | |
1 | 19232 | 7.4% | |
9 | 12644 | 4.9% | |
. | 10000 | 3.8% | |
2 | 6641 | 2.6% | |
6 | 6447 | 2.5% | |
7 | 5497 | 2.1% | |
5 | 5250 | 2.0% | |
4 | 4239 | 1.6% | |
3 | 3463 | 1.3% | |
8 | 3012 | 1.2% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
T | 10000 | 100.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 270000 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
0 | 143575 | 53.2% | |
- | 20000 | 7.4% | |
: | 20000 | 7.4% | |
1 | 19232 | 7.1% | |
9 | 12644 | 4.7% | |
T | 10000 | 3.7% | |
. | 10000 | 3.7% | |
2 | 6641 | 2.5% | |
6 | 6447 | 2.4% | |
7 | 5497 | 2.0% | |
5 | 5250 | 1.9% | |
4 | 4239 | 1.6% | |
3 | 3463 | 1.3% | |
8 | 3012 | 1.1% |
MARITAL_STATUS
Categorical
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
M | |
---|---|
S |
Value | Count | Frequency (%) | |
M | 5437 | 54.4% | |
S | 4563 | 45.6% |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Most occurring characters
Value | Count | Frequency (%) | |
M | 5437 | 54.4% | |
S | 4563 | 45.6% |
Most occurring categories
Value | Count | Frequency (%) | |
Uppercase Letter | 10000 | 100.0% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
M | 5437 | 54.4% | |
S | 4563 | 45.6% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 10000 | 100.0% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
M | 5437 | 54.4% | |
S | 4563 | 45.6% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 10000 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
M | 5437 | 54.4% | |
S | 4563 | 45.6% |
SUFFIX
Categorical
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
null | |
---|---|
Jr. | 1 |
Value | Count | Frequency (%) | |
null | 9999 | > 99.9% | |
Jr. | 1 | < 0.1% |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9999 |
Min length | 3 |
Most occurring characters
Value | Count | Frequency (%) | |
l | 19998 | 50.0% | |
n | 9999 | 25.0% | |
u | 9999 | 25.0% | |
J | 1 | < 0.1% | |
r | 1 | < 0.1% | |
. | 1 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 39997 | > 99.9% | |
Uppercase Letter | 1 | < 0.1% | |
Other Punctuation | 1 | < 0.1% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
l | 19998 | 50.0% | |
n | 9999 | 25.0% | |
u | 9999 | 25.0% | |
r | 1 | < 0.1% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
J | 1 | 100.0% |
Most frequent Other Punctuation characters
Value | Count | Frequency (%) | |
. | 1 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 39998 | > 99.9% | |
Common | 1 | < 0.1% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
l | 19998 | 50.0% | |
n | 9999 | 25.0% | |
u | 9999 | 25.0% | |
J | 1 | < 0.1% | |
r | 1 | < 0.1% |
Most frequent Common characters
Value | Count | Frequency (%) | |
. | 1 | 100.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 39999 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
l | 19998 | 50.0% | |
n | 9999 | 25.0% | |
u | 9999 | 25.0% | |
J | 1 | < 0.1% | |
r | 1 | < 0.1% | |
. | 1 | < 0.1% |
GENDER
Categorical
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
M | |
---|---|
F |
Value | Count | Frequency (%) | |
M | 5112 | 51.1% | |
F | 4888 | 48.9% |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Most occurring characters
Value | Count | Frequency (%) | |
M | 5112 | 51.1% | |
F | 4888 | 48.9% |
Most occurring categories
Value | Count | Frequency (%) | |
Uppercase Letter | 10000 | 100.0% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
M | 5112 | 51.1% | |
F | 4888 | 48.9% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 10000 | 100.0% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
M | 5112 | 51.1% | |
F | 4888 | 48.9% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 10000 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
M | 5112 | 51.1% | |
F | 4888 | 48.9% |
Distinct count | 10000 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
[email protected] | 1 |
---|---|
[email protected] | 1 |
[email protected] | 1 |
[email protected] | 1 |
[email protected] | 1 |
Other values (9995) |
Value | Count | Frequency (%) | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
[email protected] | 1 | < 0.1% | |
Other values (9975) | 9975 | 99.8% |
Length
Max length | 33 |
---|---|
Median length | 28 |
Mean length | 27.6572 |
Min length | 22 |
Most occurring characters
Value | Count | Frequency (%) | |
e | 26442 | 9.6% | |
r | 24778 | 9.0% | |
o | 22634 | 8.2% | |
a | 18955 | 6.9% | |
n | 15383 | 5.6% | |
s | 12751 | 4.6% | |
t | 12486 | 4.5% | |
c | 12170 | 4.4% | |
d | 12167 | 4.4% | |
m | 11780 | 4.3% | |
u | 11032 | 4.0% | |
k | 10998 | 4.0% | |
v | 10551 | 3.8% | |
w | 10352 | 3.7% | |
- | 10001 | 3.6% | |
@ | 10000 | 3.6% | |
. | 10000 | 3.6% | |
i | 4633 | 1.7% | |
1 | 4028 | 1.5% | |
l | 4005 | 1.4% | |
2 | 2675 | 1.0% | |
h | 2021 | 0.7% | |
y | 2014 | 0.7% | |
3 | 1992 | 0.7% | |
4 | 1725 | 0.6% | |
Other values (17) | 10999 | 4.0% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 229346 | 82.9% | |
Other Punctuation | 20000 | 7.2% | |
Decimal Number | 17225 | 6.2% | |
Dash Punctuation | 10001 | 3.6% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
e | 26442 | 11.5% | |
r | 24778 | 10.8% | |
o | 22634 | 9.9% | |
a | 18955 | 8.3% | |
n | 15383 | 6.7% | |
s | 12751 | 5.6% | |
t | 12486 | 5.4% | |
c | 12170 | 5.3% | |
d | 12167 | 5.3% | |
m | 11780 | 5.1% | |
u | 11032 | 4.8% | |
k | 10998 | 4.8% | |
v | 10551 | 4.6% | |
w | 10352 | 4.5% | |
i | 4633 | 2.0% | |
l | 4005 | 1.7% | |
h | 2021 | 0.9% | |
y | 2014 | 0.9% | |
j | 1318 | 0.6% | |
b | 1078 | 0.5% | |
g | 785 | 0.3% | |
f | 325 | 0.1% | |
p | 285 | 0.1% | |
x | 212 | 0.1% | |
z | 107 | < 0.1% | |
Other values (4) | 84 | < 0.1% |
Most frequent Decimal Number characters
Value | Count | Frequency (%) | |
1 | 4028 | 23.4% | |
2 | 2675 | 15.5% | |
3 | 1992 | 11.6% | |
4 | 1725 | 10.0% | |
5 | 1375 | 8.0% | |
6 | 1234 | 7.2% | |
0 | 1107 | 6.4% | |
7 | 1093 | 6.3% | |
8 | 1061 | 6.2% | |
9 | 935 | 5.4% |
Most frequent Other Punctuation characters
Value | Count | Frequency (%) | |
@ | 10000 | 50.0% | |
. | 10000 | 50.0% |
Most frequent Dash Punctuation characters
Value | Count | Frequency (%) | |
- | 10001 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 229346 | 82.9% | |
Common | 47226 | 17.1% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
e | 26442 | 11.5% | |
r | 24778 | 10.8% | |
o | 22634 | 9.9% | |
a | 18955 | 8.3% | |
n | 15383 | 6.7% | |
s | 12751 | 5.6% | |
t | 12486 | 5.4% | |
c | 12170 | 5.3% | |
d | 12167 | 5.3% | |
m | 11780 | 5.1% | |
u | 11032 | 4.8% | |
k | 10998 | 4.8% | |
v | 10551 | 4.6% | |
w | 10352 | 4.5% | |
i | 4633 | 2.0% | |
l | 4005 | 1.7% | |
h | 2021 | 0.9% | |
y | 2014 | 0.9% | |
j | 1318 | 0.6% | |
b | 1078 | 0.5% | |
g | 785 | 0.3% | |
f | 325 | 0.1% | |
p | 285 | 0.1% | |
x | 212 | 0.1% | |
z | 107 | < 0.1% | |
Other values (4) | 84 | < 0.1% |
Most frequent Common characters
Value | Count | Frequency (%) | |
- | 10001 | 21.2% | |
@ | 10000 | 21.2% | |
. | 10000 | 21.2% | |
1 | 4028 | 8.5% | |
2 | 2675 | 5.7% | |
3 | 1992 | 4.2% | |
4 | 1725 | 3.7% | |
5 | 1375 | 2.9% | |
6 | 1234 | 2.6% | |
0 | 1107 | 2.3% | |
7 | 1093 | 2.3% | |
8 | 1061 | 2.2% | |
9 | 935 | 2.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 276551 | > 99.9% | |
None | 21 | < 0.1% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
e | 26442 | 9.6% | |
r | 24778 | 9.0% | |
o | 22634 | 8.2% | |
a | 18955 | 6.9% | |
n | 15383 | 5.6% | |
s | 12751 | 4.6% | |
t | 12486 | 4.5% | |
c | 12170 | 4.4% | |
d | 12167 | 4.4% | |
m | 11780 | 4.3% | |
u | 11032 | 4.0% | |
k | 10998 | 4.0% | |
v | 10551 | 3.8% | |
w | 10352 | 3.7% | |
- | 10001 | 3.6% | |
@ | 10000 | 3.6% | |
. | 10000 | 3.6% | |
i | 4633 | 1.7% | |
1 | 4028 | 1.5% | |
l | 4005 | 1.4% | |
2 | 2675 | 1.0% | |
h | 2021 | 0.7% | |
y | 2014 | 0.7% | |
3 | 1992 | 0.7% | |
4 | 1725 | 0.6% | |
Other values (14) | 10978 | 4.0% |
Most frequent None characters
Value | Count | Frequency (%) | |
é | 19 | 90.5% | |
í | 1 | 4.8% | |
ñ | 1 | 4.8% |
YEARLY_INCOME
Real number (ℝ≥0)
Distinct count | 16 |
---|---|
Unique (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 56410.0 |
---|---|
Minimum | 10000.0 |
Maximum | 170000.0 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 10000 |
---|---|
5-th percentile | 10000 |
Q1 | 30000 |
median | 60000 |
Q3 | 70000 |
95-th percentile | 120000 |
Maximum | 170000 |
Range | 160000 |
Interquartile range (IQR) | 40000 |
Descriptive statistics
Standard deviation | 32681.47347 |
---|---|
Coefficient of variation (CV) | 0.5793560267 |
Kurtosis | 0.6254303313 |
Mean | 56410 |
Median Absolute Deviation (MAD) | 20000 |
Skewness | 0.8421766808 |
Sum | 564100000 |
Variance | 1068078708 |
Histogram with fixed size bins (bins=10)
Value | Count | Frequency (%) | |
60000 | 1564 | 15.6% | |
40000 | 1492 | 14.9% | |
30000 | 1301 | 13.0% | |
70000 | 1240 | 12.4% | |
20000 | 1028 | 10.3% | |
80000 | 734 | 7.3% | |
10000 | 690 | 6.9% | |
90000 | 432 | 4.3% | |
50000 | 335 | 3.4% | |
100000 | 310 | 3.1% | |
130000 | 281 | 2.8% | |
110000 | 253 | 2.5% | |
120000 | 174 | 1.7% | |
170000 | 64 | 0.6% | |
160000 | 52 | 0.5% | |
150000 | 50 | 0.5% |
Value | Count | Frequency (%) | |
10000 | 690 | 6.9% | |
20000 | 1028 | 10.3% | |
30000 | 1301 | 13.0% | |
40000 | 1492 | 14.9% | |
50000 | 335 | 3.4% | |
60000 | 1564 | 15.6% | |
70000 | 1240 | 12.4% | |
80000 | 734 | 7.3% | |
90000 | 432 | 4.3% | |
100000 | 310 | 3.1% |
Value | Count | Frequency (%) | |
170000 | 64 | 0.6% | |
160000 | 52 | 0.5% | |
150000 | 50 | 0.5% | |
130000 | 281 | 2.8% | |
120000 | 174 | 1.7% | |
110000 | 253 | 2.5% | |
100000 | 310 | 3.1% | |
90000 | 432 | 4.3% | |
80000 | 734 | 7.3% | |
70000 | 1240 | 12.4% |
Distinct count | 6 |
---|---|
Unique (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.846 |
---|---|
Minimum | 0 |
Maximum | 5 |
Zeros | 2758 |
Zeros (%) | 27.6% |
Memory size | 9.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 2 |
Q3 | 3 |
95-th percentile | 5 |
Maximum | 5 |
Range | 5 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 1.613426266 |
---|---|
Coefficient of variation (CV) | 0.8740120615 |
Kurtosis | -0.9306013637 |
Mean | 1.846 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.4862396023 |
Sum | 18460 |
Variance | 2.603144314 |
Histogram with fixed size bins (bins=10)
Value | Count | Frequency (%) | |
0 | 2758 | 27.6% | |
2 | 2049 | 20.5% | |
1 | 2017 | 20.2% | |
4 | 1227 | 12.3% | |
3 | 1154 | 11.5% | |
5 | 795 | 8.0% |
Value | Count | Frequency (%) | |
0 | 2758 | 27.6% | |
1 | 2017 | 20.2% | |
2 | 2049 | 20.5% | |
3 | 1154 | 11.5% | |
4 | 1227 | 12.3% | |
5 | 795 | 8.0% |
Value | Count | Frequency (%) | |
5 | 795 | 8.0% | |
4 | 1227 | 12.3% | |
3 | 1154 | 11.5% | |
2 | 2049 | 20.5% | |
1 | 2017 | 20.2% | |
0 | 2758 | 27.6% |
Distinct count | 6 |
---|---|
Unique (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.9978 |
---|---|
Minimum | 0 |
Maximum | 5 |
Zeros | 6021 |
Zeros (%) | 60.2% |
Memory size | 9.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 2 |
95-th percentile | 5 |
Maximum | 5 |
Range | 5 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.517835809 |
---|---|
Coefficient of variation (CV) | 1.52118241 |
Kurtosis | 0.7074371296 |
Mean | 0.9978 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.401347865 |
Sum | 9978 |
Variance | 2.303825543 |
Histogram with fixed size bins (bins=10)
Value | Count | Frequency (%) | |
0 | 6021 | 60.2% | |
1 | 1364 | 13.6% | |
2 | 844 | 8.4% | |
3 | 669 | 6.7% | |
4 | 591 | 5.9% | |
5 | 511 | 5.1% |
Value | Count | Frequency (%) | |
0 | 6021 | 60.2% | |
1 | 1364 | 13.6% | |
2 | 844 | 8.4% | |
3 | 669 | 6.7% | |
4 | 591 | 5.9% | |
5 | 511 | 5.1% |
Value | Count | Frequency (%) | |
5 | 511 | 5.1% | |
4 | 591 | 5.9% | |
3 | 669 | 6.7% | |
2 | 844 | 8.4% | |
1 | 1364 | 13.6% | |
0 | 6021 | 60.2% |
ENGLISH_EDUCATION
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Bachelors | |
---|---|
Partial College | |
High School | |
Graduate Degree | |
Partial High School |
Value | Count | Frequency (%) | |
Bachelors | 2842 | 28.4% | |
Partial College | 2786 | 27.9% | |
High School | 1745 | 17.4% | |
Graduate Degree | 1732 | 17.3% | |
Partial High School | 895 | 8.9% |
Length
Max length | 19 |
---|---|
Median length | 15 |
Mean length | 12.9548 |
Min length | 9 |
Most occurring characters
Value | Count | Frequency (%) | |
e | 15342 | 11.8% | |
l | 14735 | 11.4% | |
a | 13668 | 10.6% | |
o | 10908 | 8.4% | |
r | 9987 | 7.7% | |
h | 8122 | 6.3% | |
8053 | 6.2% | ||
g | 7158 | 5.5% | |
i | 6321 | 4.9% | |
c | 5482 | 4.2% | |
t | 5413 | 4.2% | |
P | 3681 | 2.8% | |
B | 2842 | 2.2% | |
s | 2842 | 2.2% | |
C | 2786 | 2.2% | |
H | 2640 | 2.0% | |
S | 2640 | 2.0% | |
G | 1732 | 1.3% | |
d | 1732 | 1.3% | |
u | 1732 | 1.3% | |
D | 1732 | 1.3% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 103442 | 79.8% | |
Uppercase Letter | 18053 | 13.9% | |
Space Separator | 8053 | 6.2% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
P | 3681 | 20.4% | |
B | 2842 | 15.7% | |
C | 2786 | 15.4% | |
H | 2640 | 14.6% | |
S | 2640 | 14.6% | |
G | 1732 | 9.6% | |
D | 1732 | 9.6% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
e | 15342 | 14.8% | |
l | 14735 | 14.2% | |
a | 13668 | 13.2% | |
o | 10908 | 10.5% | |
r | 9987 | 9.7% | |
h | 8122 | 7.9% | |
g | 7158 | 6.9% | |
i | 6321 | 6.1% | |
c | 5482 | 5.3% | |
t | 5413 | 5.2% | |
s | 2842 | 2.7% | |
d | 1732 | 1.7% | |
u | 1732 | 1.7% |
Most frequent Space Separator characters
Value | Count | Frequency (%) | |
8053 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 121495 | 93.8% | |
Common | 8053 | 6.2% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
e | 15342 | 12.6% | |
l | 14735 | 12.1% | |
a | 13668 | 11.2% | |
o | 10908 | 9.0% | |
r | 9987 | 8.2% | |
h | 8122 | 6.7% | |
g | 7158 | 5.9% | |
i | 6321 | 5.2% | |
c | 5482 | 4.5% | |
t | 5413 | 4.5% | |
P | 3681 | 3.0% | |
B | 2842 | 2.3% | |
s | 2842 | 2.3% | |
C | 2786 | 2.3% | |
H | 2640 | 2.2% | |
S | 2640 | 2.2% | |
G | 1732 | 1.4% | |
d | 1732 | 1.4% | |
u | 1732 | 1.4% | |
D | 1732 | 1.4% |
Most frequent Common characters
Value | Count | Frequency (%) | |
8053 | 100.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 129548 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
e | 15342 | 11.8% | |
l | 14735 | 11.4% | |
a | 13668 | 10.6% | |
o | 10908 | 8.4% | |
r | 9987 | 7.7% | |
h | 8122 | 6.3% | |
8053 | 6.2% | ||
g | 7158 | 5.5% | |
i | 6321 | 4.9% | |
c | 5482 | 4.2% | |
t | 5413 | 4.2% | |
P | 3681 | 2.8% | |
B | 2842 | 2.2% | |
s | 2842 | 2.2% | |
C | 2786 | 2.2% | |
H | 2640 | 2.0% | |
S | 2640 | 2.0% | |
G | 1732 | 1.3% | |
d | 1732 | 1.3% | |
u | 1732 | 1.3% | |
D | 1732 | 1.3% |
SPANISH_EDUCATION
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Licenciatura | |
---|---|
Estudios universitarios (en curso) | |
Educación secundaria | |
Estudios de postgrado | |
Educación secundaria (en curso) |
Value | Count | Frequency (%) | |
Licenciatura | 2842 | 28.4% | |
Estudios universitarios (en curso) | 2786 | 27.9% | |
Educación secundaria | 1745 | 17.4% | |
Estudios de postgrado | 1732 | 17.3% | |
Educación secundaria (en curso) | 895 | 8.9% |
Length
Max length | 34 |
---|---|
Median length | 21 |
Mean length | 22.7845 |
Min length | 12 |
Most occurring characters
Value | Count | Frequency (%) | |
i | 23840 | 10.5% | |
s | 22661 | 9.9% | |
u | 19107 | 8.4% | |
a | 18122 | 8.0% | |
c | 17285 | 7.6% | |
r | 16467 | 7.2% | |
16252 | 7.1% | ||
n | 14589 | 6.4% | |
o | 14449 | 6.3% | |
e | 13681 | 6.0% | |
d | 13262 | 5.8% | |
t | 11878 | 5.2% | |
E | 7158 | 3.1% | |
( | 3681 | 1.6% | |
) | 3681 | 1.6% | |
L | 2842 | 1.2% | |
v | 2786 | 1.2% | |
ó | 2640 | 1.2% | |
p | 1732 | 0.8% | |
g | 1732 | 0.8% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 194231 | 85.2% | |
Space Separator | 16252 | 7.1% | |
Uppercase Letter | 10000 | 4.4% | |
Open Punctuation | 3681 | 1.6% | |
Close Punctuation | 3681 | 1.6% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
E | 7158 | 71.6% | |
L | 2842 | 28.4% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
i | 23840 | 12.3% | |
s | 22661 | 11.7% | |
u | 19107 | 9.8% | |
a | 18122 | 9.3% | |
c | 17285 | 8.9% | |
r | 16467 | 8.5% | |
n | 14589 | 7.5% | |
o | 14449 | 7.4% | |
e | 13681 | 7.0% | |
d | 13262 | 6.8% | |
t | 11878 | 6.1% | |
v | 2786 | 1.4% | |
ó | 2640 | 1.4% | |
p | 1732 | 0.9% | |
g | 1732 | 0.9% |
Most frequent Space Separator characters
Value | Count | Frequency (%) | |
16252 | 100.0% |
Most frequent Open Punctuation characters
Value | Count | Frequency (%) | |
( | 3681 | 100.0% |
Most frequent Close Punctuation characters
Value | Count | Frequency (%) | |
) | 3681 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 204231 | 89.6% | |
Common | 23614 | 10.4% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
i | 23840 | 11.7% | |
s | 22661 | 11.1% | |
u | 19107 | 9.4% | |
a | 18122 | 8.9% | |
c | 17285 | 8.5% | |
r | 16467 | 8.1% | |
n | 14589 | 7.1% | |
o | 14449 | 7.1% | |
e | 13681 | 6.7% | |
d | 13262 | 6.5% | |
t | 11878 | 5.8% | |
E | 7158 | 3.5% | |
L | 2842 | 1.4% | |
v | 2786 | 1.4% | |
ó | 2640 | 1.3% | |
p | 1732 | 0.8% | |
g | 1732 | 0.8% |
Most frequent Common characters
Value | Count | Frequency (%) | |
16252 | 68.8% | ||
( | 3681 | 15.6% | |
) | 3681 | 15.6% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 225205 | 98.8% | |
None | 2640 | 1.2% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
i | 23840 | 10.6% | |
s | 22661 | 10.1% | |
u | 19107 | 8.5% | |
a | 18122 | 8.0% | |
c | 17285 | 7.7% | |
r | 16467 | 7.3% | |
16252 | 7.2% | ||
n | 14589 | 6.5% | |
o | 14449 | 6.4% | |
e | 13681 | 6.1% | |
d | 13262 | 5.9% | |
t | 11878 | 5.3% | |
E | 7158 | 3.2% | |
( | 3681 | 1.6% | |
) | 3681 | 1.6% | |
L | 2842 | 1.3% | |
v | 2786 | 1.2% | |
p | 1732 | 0.8% | |
g | 1732 | 0.8% |
Most frequent None characters
Value | Count | Frequency (%) | |
ó | 2640 | 100.0% |
FRENCH_EDUCATION
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Bac + 4 | |
---|---|
Baccalauréat | |
Bac + 2 | |
Bac + 3 | |
Niveau bac |
Value | Count | Frequency (%) | |
Bac + 4 | 2842 | 28.4% | |
Baccalauréat | 2786 | 27.9% | |
Bac + 2 | 1745 | 17.4% | |
Bac + 3 | 1732 | 17.3% | |
Niveau bac | 895 | 8.9% |
Length
Max length | 12 |
---|---|
Median length | 7 |
Mean length | 8.6615 |
Min length | 7 |
Most occurring characters
Value | Count | Frequency (%) | |
a | 19253 | 22.2% | |
13533 | 15.6% | ||
c | 12786 | 14.8% | |
B | 9105 | 10.5% | |
+ | 6319 | 7.3% | |
u | 3681 | 4.2% | |
4 | 2842 | 3.3% | |
l | 2786 | 3.2% | |
r | 2786 | 3.2% | |
é | 2786 | 3.2% | |
t | 2786 | 3.2% | |
2 | 1745 | 2.0% | |
3 | 1732 | 2.0% | |
N | 895 | 1.0% | |
i | 895 | 1.0% | |
v | 895 | 1.0% | |
e | 895 | 1.0% | |
b | 895 | 1.0% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 50444 | 58.2% | |
Space Separator | 13533 | 15.6% | |
Uppercase Letter | 10000 | 11.5% | |
Math Symbol | 6319 | 7.3% | |
Decimal Number | 6319 | 7.3% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
B | 9105 | 91.0% | |
N | 895 | 8.9% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
a | 19253 | 38.2% | |
c | 12786 | 25.3% | |
u | 3681 | 7.3% | |
l | 2786 | 5.5% | |
r | 2786 | 5.5% | |
é | 2786 | 5.5% | |
t | 2786 | 5.5% | |
i | 895 | 1.8% | |
v | 895 | 1.8% | |
e | 895 | 1.8% | |
b | 895 | 1.8% |
Most frequent Space Separator characters
Value | Count | Frequency (%) | |
13533 | 100.0% |
Most frequent Math Symbol characters
Value | Count | Frequency (%) | |
+ | 6319 | 100.0% |
Most frequent Decimal Number characters
Value | Count | Frequency (%) | |
4 | 2842 | 45.0% | |
2 | 1745 | 27.6% | |
3 | 1732 | 27.4% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 60444 | 69.8% | |
Common | 26171 | 30.2% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
a | 19253 | 31.9% | |
c | 12786 | 21.2% | |
B | 9105 | 15.1% | |
u | 3681 | 6.1% | |
l | 2786 | 4.6% | |
r | 2786 | 4.6% | |
é | 2786 | 4.6% | |
t | 2786 | 4.6% | |
N | 895 | 1.5% | |
i | 895 | 1.5% | |
v | 895 | 1.5% | |
e | 895 | 1.5% | |
b | 895 | 1.5% |
Most frequent Common characters
Value | Count | Frequency (%) | |
13533 | 51.7% | ||
+ | 6319 | 24.1% | |
4 | 2842 | 10.9% | |
2 | 1745 | 6.7% | |
3 | 1732 | 6.6% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 83829 | 96.8% | |
None | 2786 | 3.2% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
a | 19253 | 23.0% | |
13533 | 16.1% | ||
c | 12786 | 15.3% | |
B | 9105 | 10.9% | |
+ | 6319 | 7.5% | |
u | 3681 | 4.4% | |
4 | 2842 | 3.4% | |
l | 2786 | 3.3% | |
r | 2786 | 3.3% | |
t | 2786 | 3.3% | |
2 | 1745 | 2.1% | |
3 | 1732 | 2.1% | |
N | 895 | 1.1% | |
i | 895 | 1.1% | |
v | 895 | 1.1% | |
e | 895 | 1.1% | |
b | 895 | 1.1% |
Most frequent None characters
Value | Count | Frequency (%) | |
é | 2786 | 100.0% |
ENGLISH_OCCUPATION
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Professional | |
---|---|
Skilled Manual | |
Clerical | |
Management | |
Manual |
Value | Count | Frequency (%) | |
Professional | 2824 | 28.2% | |
Skilled Manual | 2340 | 23.4% | |
Clerical | 1729 | 17.3% | |
Management | 1676 | 16.8% | |
Manual | 1431 | 14.3% |
Length
Max length | 14 |
---|---|
Median length | 12 |
Mean length | 10.5826 |
Min length | 6 |
Most occurring characters
Value | Count | Frequency (%) | |
a | 15447 | 14.6% | |
l | 14733 | 13.9% | |
e | 10245 | 9.7% | |
n | 9947 | 9.4% | |
i | 6893 | 6.5% | |
o | 5648 | 5.3% | |
s | 5648 | 5.3% | |
M | 5447 | 5.1% | |
r | 4553 | 4.3% | |
u | 3771 | 3.6% | |
P | 2824 | 2.7% | |
f | 2824 | 2.7% | |
S | 2340 | 2.2% | |
k | 2340 | 2.2% | |
d | 2340 | 2.2% | |
2340 | 2.2% | ||
C | 1729 | 1.6% | |
c | 1729 | 1.6% | |
g | 1676 | 1.6% | |
m | 1676 | 1.6% | |
t | 1676 | 1.6% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 91146 | 86.1% | |
Uppercase Letter | 12340 | 11.7% | |
Space Separator | 2340 | 2.2% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
M | 5447 | 44.1% | |
P | 2824 | 22.9% | |
S | 2340 | 19.0% | |
C | 1729 | 14.0% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
a | 15447 | 16.9% | |
l | 14733 | 16.2% | |
e | 10245 | 11.2% | |
n | 9947 | 10.9% | |
i | 6893 | 7.6% | |
o | 5648 | 6.2% | |
s | 5648 | 6.2% | |
r | 4553 | 5.0% | |
u | 3771 | 4.1% | |
f | 2824 | 3.1% | |
k | 2340 | 2.6% | |
d | 2340 | 2.6% | |
c | 1729 | 1.9% | |
g | 1676 | 1.8% | |
m | 1676 | 1.8% | |
t | 1676 | 1.8% |
Most frequent Space Separator characters
Value | Count | Frequency (%) | |
2340 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 103486 | 97.8% | |
Common | 2340 | 2.2% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
a | 15447 | 14.9% | |
l | 14733 | 14.2% | |
e | 10245 | 9.9% | |
n | 9947 | 9.6% | |
i | 6893 | 6.7% | |
o | 5648 | 5.5% | |
s | 5648 | 5.5% | |
M | 5447 | 5.3% | |
r | 4553 | 4.4% | |
u | 3771 | 3.6% | |
P | 2824 | 2.7% | |
f | 2824 | 2.7% | |
S | 2340 | 2.3% | |
k | 2340 | 2.3% | |
d | 2340 | 2.3% | |
C | 1729 | 1.7% | |
c | 1729 | 1.7% | |
g | 1676 | 1.6% | |
m | 1676 | 1.6% | |
t | 1676 | 1.6% |
Most frequent Common characters
Value | Count | Frequency (%) | |
2340 | 100.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 105826 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
a | 15447 | 14.6% | |
l | 14733 | 13.9% | |
e | 10245 | 9.7% | |
n | 9947 | 9.4% | |
i | 6893 | 6.5% | |
o | 5648 | 5.3% | |
s | 5648 | 5.3% | |
M | 5447 | 5.1% | |
r | 4553 | 4.3% | |
u | 3771 | 3.6% | |
P | 2824 | 2.7% | |
f | 2824 | 2.7% | |
S | 2340 | 2.2% | |
k | 2340 | 2.2% | |
d | 2340 | 2.2% | |
2340 | 2.2% | ||
C | 1729 | 1.6% | |
c | 1729 | 1.6% | |
g | 1676 | 1.6% | |
m | 1676 | 1.6% | |
t | 1676 | 1.6% |
SPANISH_OCCUPATION
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Profesional | |
---|---|
Obrero especializado | |
Administrativo | |
Gestión | |
Obrero |
Value | Count | Frequency (%) | |
Profesional | 2824 | 28.2% | |
Obrero especializado | 2340 | 23.4% | |
Administrativo | 1729 | 17.3% | |
Gestión | 1676 | 16.8% | |
Obrero | 1431 | 14.3% |
Length
Max length | 20 |
---|---|
Median length | 11 |
Mean length | 12.2388 |
Min length | 6 |
Most occurring characters
Value | Count | Frequency (%) | |
i | 14367 | 11.7% | |
o | 13488 | 11.0% | |
e | 12951 | 10.6% | |
r | 12095 | 9.9% | |
a | 9233 | 7.5% | |
s | 8569 | 7.0% | |
n | 6229 | 5.1% | |
l | 5164 | 4.2% | |
t | 5134 | 4.2% | |
d | 4069 | 3.3% | |
O | 3771 | 3.1% | |
b | 3771 | 3.1% | |
P | 2824 | 2.3% | |
f | 2824 | 2.3% | |
2340 | 1.9% | ||
p | 2340 | 1.9% | |
c | 2340 | 1.9% | |
z | 2340 | 1.9% | |
A | 1729 | 1.4% | |
m | 1729 | 1.4% | |
v | 1729 | 1.4% | |
G | 1676 | 1.4% | |
ó | 1676 | 1.4% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 110048 | 89.9% | |
Uppercase Letter | 10000 | 8.2% | |
Space Separator | 2340 | 1.9% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
O | 3771 | 37.7% | |
P | 2824 | 28.2% | |
A | 1729 | 17.3% | |
G | 1676 | 16.8% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
i | 14367 | 13.1% | |
o | 13488 | 12.3% | |
e | 12951 | 11.8% | |
r | 12095 | 11.0% | |
a | 9233 | 8.4% | |
s | 8569 | 7.8% | |
n | 6229 | 5.7% | |
l | 5164 | 4.7% | |
t | 5134 | 4.7% | |
d | 4069 | 3.7% | |
b | 3771 | 3.4% | |
f | 2824 | 2.6% | |
p | 2340 | 2.1% | |
c | 2340 | 2.1% | |
z | 2340 | 2.1% | |
m | 1729 | 1.6% | |
v | 1729 | 1.6% | |
ó | 1676 | 1.5% |
Most frequent Space Separator characters
Value | Count | Frequency (%) | |
2340 | 100.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 120048 | 98.1% | |
Common | 2340 | 1.9% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
i | 14367 | 12.0% | |
o | 13488 | 11.2% | |
e | 12951 | 10.8% | |
r | 12095 | 10.1% | |
a | 9233 | 7.7% | |
s | 8569 | 7.1% | |
n | 6229 | 5.2% | |
l | 5164 | 4.3% | |
t | 5134 | 4.3% | |
d | 4069 | 3.4% | |
O | 3771 | 3.1% | |
b | 3771 | 3.1% | |
P | 2824 | 2.4% | |
f | 2824 | 2.4% | |
p | 2340 | 1.9% | |
c | 2340 | 1.9% | |
z | 2340 | 1.9% | |
A | 1729 | 1.4% | |
m | 1729 | 1.4% | |
v | 1729 | 1.4% | |
G | 1676 | 1.4% | |
ó | 1676 | 1.4% |
Most frequent Common characters
Value | Count | Frequency (%) | |
2340 | 100.0% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 120712 | 98.6% | |
None | 1676 | 1.4% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
i | 14367 | 11.9% | |
o | 13488 | 11.2% | |
e | 12951 | 10.7% | |
r | 12095 | 10.0% | |
a | 9233 | 7.6% | |
s | 8569 | 7.1% | |
n | 6229 | 5.2% | |
l | 5164 | 4.3% | |
t | 5134 | 4.3% | |
d | 4069 | 3.4% | |
O | 3771 | 3.1% | |
b | 3771 | 3.1% | |
P | 2824 | 2.3% | |
f | 2824 | 2.3% | |
2340 | 1.9% | ||
p | 2340 | 1.9% | |
c | 2340 | 1.9% | |
z | 2340 | 1.9% | |
A | 1729 | 1.4% | |
m | 1729 | 1.4% | |
v | 1729 | 1.4% | |
G | 1676 | 1.4% |
Most frequent None characters
Value | Count | Frequency (%) | |
ó | 1676 | 100.0% |
FRENCH_OCCUPATION
Categorical
Distinct count | 5 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Cadre | |
---|---|
Technicien | |
Employé | |
Direction | |
Ouvrier |
Value | Count | Frequency (%) | |
Cadre | 2824 | 28.2% | |
Technicien | 2340 | 23.4% | |
Employé | 1729 | 17.3% | |
Direction | 1676 | 16.8% | |
Ouvrier | 1431 | 14.3% |
Length
Max length | 10 |
---|---|
Median length | 7 |
Mean length | 7.4724 |
Min length | 5 |
Most occurring characters
Value | Count | Frequency (%) | |
e | 10611 | 14.2% | |
i | 9463 | 12.7% | |
r | 7362 | 9.9% | |
c | 6356 | 8.5% | |
n | 6356 | 8.5% | |
o | 3405 | 4.6% | |
C | 2824 | 3.8% | |
a | 2824 | 3.8% | |
d | 2824 | 3.8% | |
T | 2340 | 3.1% | |
h | 2340 | 3.1% | |
E | 1729 | 2.3% | |
m | 1729 | 2.3% | |
p | 1729 | 2.3% | |
l | 1729 | 2.3% | |
y | 1729 | 2.3% | |
é | 1729 | 2.3% | |
D | 1676 | 2.2% | |
t | 1676 | 2.2% | |
O | 1431 | 1.9% | |
u | 1431 | 1.9% | |
v | 1431 | 1.9% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 64724 | 86.6% | |
Uppercase Letter | 10000 | 13.4% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
C | 2824 | 28.2% | |
T | 2340 | 23.4% | |
E | 1729 | 17.3% | |
D | 1676 | 16.8% | |
O | 1431 | 14.3% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
e | 10611 | 16.4% | |
i | 9463 | 14.6% | |
r | 7362 | 11.4% | |
c | 6356 | 9.8% | |
n | 6356 | 9.8% | |
o | 3405 | 5.3% | |
a | 2824 | 4.4% | |
d | 2824 | 4.4% | |
h | 2340 | 3.6% | |
m | 1729 | 2.7% | |
p | 1729 | 2.7% | |
l | 1729 | 2.7% | |
y | 1729 | 2.7% | |
é | 1729 | 2.7% | |
t | 1676 | 2.6% | |
u | 1431 | 2.2% | |
v | 1431 | 2.2% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 74724 | 100.0% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
e | 10611 | 14.2% | |
i | 9463 | 12.7% | |
r | 7362 | 9.9% | |
c | 6356 | 8.5% | |
n | 6356 | 8.5% | |
o | 3405 | 4.6% | |
C | 2824 | 3.8% | |
a | 2824 | 3.8% | |
d | 2824 | 3.8% | |
T | 2340 | 3.1% | |
h | 2340 | 3.1% | |
E | 1729 | 2.3% | |
m | 1729 | 2.3% | |
p | 1729 | 2.3% | |
l | 1729 | 2.3% | |
y | 1729 | 2.3% | |
é | 1729 | 2.3% | |
D | 1676 | 2.2% | |
t | 1676 | 2.2% | |
O | 1431 | 1.9% | |
u | 1431 | 1.9% | |
v | 1431 | 1.9% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 72995 | 97.7% | |
None | 1729 | 2.3% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
e | 10611 | 14.5% | |
i | 9463 | 13.0% | |
r | 7362 | 10.1% | |
c | 6356 | 8.7% | |
n | 6356 | 8.7% | |
o | 3405 | 4.7% | |
C | 2824 | 3.9% | |
a | 2824 | 3.9% | |
d | 2824 | 3.9% | |
T | 2340 | 3.2% | |
h | 2340 | 3.2% | |
E | 1729 | 2.4% | |
m | 1729 | 2.4% | |
p | 1729 | 2.4% | |
l | 1729 | 2.4% | |
y | 1729 | 2.4% | |
D | 1676 | 2.3% | |
t | 1676 | 2.3% | |
O | 1431 | 2.0% | |
u | 1431 | 2.0% | |
v | 1431 | 2.0% |
Most frequent None characters
Value | Count | Frequency (%) | |
é | 1729 | 100.0% |
HOUSE_OWNER_FLAG
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.9 KiB |
1 | |
---|---|
0 |
Value | Count | Frequency (%) | |
1 | 6820 | 68.2% | |
0 | 3180 | 31.8% |