Overview

Dataset info

Number of variables16
Number of observations156
Missing cells156 (6.2%)
Duplicate rows0 (0.0%)
Total size in memory19.6 KiB
Average record size in memory128.8 B

Variables types

Numeric5
Categorical6
Boolean1
Date0
URL0
Text (Unique)2
Rejected2
Unsupported0

Warnings

Age has 30 (19.2%) missing values Missing
Cabin has 125 (80.1%) missing values Missing
Parch has 121 (77.6%) zeros Zeros
Pclass is a recoding of ClasseRejected
SibSp has 98 (62.8%) zeros Zeros
Ticket has a high cardinality: 145 distinct values Warning
Unnamed_0 is highly correlated with PassengerId (ρ = 1) Rejected

Variables

Age
Numeric

Distinct count57
Unique (%)36.5%
Missing (%)19.2%
Missing (n)30
Infinite (%)0.0%
Infinite (n)0
Mean28.14150794
Minimum0.83
Maximum71
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.83
5-th percentile4.25
Q119
Median26
Q335
95-th percentile55.375
Maximum71
Range70.17
Interquartile range16

Descriptive statistics

Standard deviation14.61387993
Coef of variation0.5192998172
Kurtosis0.6137465312
Mean28.14150794
MAD11.1428584
Skewness0.7003666519
Sum3545.83
Variance213.5654865
Memory size1.3 KiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
21 8 5.1%
 
29 6 3.8%
 
19 6 3.8%
 
22 6 3.8%
 
24 5 3.2%
 
28 5 3.2%
 
20 4 2.6%
 
38 4 2.6%
 
26 4 2.6%
 
17 3 1.9%
 
Other values (46) 75 48.1%
 
(Missing) 30 19.2%
 

Minimum 5 values

ValueCountFrequency (%) 
0.83 1 0.6%
 
2 3 1.9%
 
3 1 0.6%
 
4 2 1.3%
 
5 1 0.6%
 

Maximum 5 values

ValueCountFrequency (%) 
71 1 0.6%
 
70.5 1 0.6%
 
66 1 0.6%
 
65 1 0.6%
 
59 1 0.6%
 

Cabin
Categorical

Distinct count29
Unique (%)18.6%
Missing (%)80.1%
Missing (n)125
C23 C25 C27
 
2
D26
 
2
C123
 
2
Other values (25)
 
25
(Missing)
125
ValueCountFrequency (%) 
C23 C25 C27 2 1.3%
 
D26 2 1.3%
 
C123 2 1.3%
 
B86 1 0.6%
 
D47 1 0.6%
 
C2 1 0.6%
 
E101 1 0.6%
 
A5 1 0.6%
 
G6 1 0.6%
 
F E69 1 0.6%
 
Other values (18) 18 11.5%
 
(Missing) 125 80.1%
 
Max length11
Mean length3.179487179
Min length2
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

CLASS2
Categorical

Distinct count2
Unique (%)1.3%
Missing (%)0.0%
Missing (n)0
Bassa
126
Alta
30
ValueCountFrequency (%) 
Bassa 126 80.8%
 
Alta 30 19.2%
 
Max length5
Mean length4.807692308
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Classe
Categorical

Distinct count3
Unique (%)1.9%
Missing (%)0.0%
Missing (n)0
TERZA
96
SECONDA
30
PRIMA
30
ValueCountFrequency (%) 
TERZA 96 61.5%
 
SECONDA 30 19.2%
 
PRIMA 30 19.2%
 
Max length7
Mean length5.384615385
Min length5
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Embarked
Categorical

Distinct count4
Unique (%)2.6%
Missing (%)0.6%
Missing (n)1
S
110
C
32
Q
 
13
(Missing)
 
1
ValueCountFrequency (%) 
S 110 70.5%
 
C 32 20.5%
 
Q 13 8.3%
 
(Missing) 1 0.6%
 
Max length3
Mean length1.012820513
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Fare
Numeric

Distinct count93
Unique (%)59.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean28.10958718
Minimum6.75
Maximum263
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum6.75
5-th percentile7.2292
Q18.00315
Median14.4542
Q330.37185
95-th percentile77.765625
Maximum263
Range256.25
Interquartile range22.3687

Descriptive statistics

Standard deviation39.4010467
Coef of variation1.401694249
Kurtosis21.10998543
Mean28.10958718
MAD22.61971348
Skewness4.17640798
Sum4385.0956
Variance1552.442481
Memory size1.3 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 6.75 7.7 8.10415 15.925 36.125 82.8229 263. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
8.05 12 7.7%
 
7.8958 8 5.1%
 
10.5 5 3.2%
 
13 5 3.2%
 
26 5 3.2%
 
7.75 5 3.2%
 
7.925 4 2.6%
 
7.2292 3 1.9%
 
77.2875 2 1.3%
 
7.225 2 1.3%
 
Other values (83) 105 67.3%
 

Minimum 5 values

ValueCountFrequency (%) 
6.75 1 0.6%
 
6.975 1 0.6%
 
7.05 1 0.6%
 
7.1417 1 0.6%
 
7.225 2 1.3%
 

Maximum 5 values

ValueCountFrequency (%) 
263 2 1.3%
 
247.5208 1 0.6%
 
146.5208 1 0.6%
 
83.475 1 0.6%
 
82.1708 1 0.6%
 

Name
Categorical, Unique

First 5 values
Ahlin, Mrs. Johan (Johanna Persdotter Larsson)
Allen, Mr. William Henry
Andersson, Miss. Ellis Anna Maria
Andersson, Miss. Erna Alexandra
Andersson, Mr. Anders Johan
Last 5 values
Williams, Mr. Charles Duane
Williams, Mr. Charles Eugene
Woolner, Mr. Hugh
Zabour, Miss. Hileni
van Billiard, Mr. Austin Blyler

First 5 values

ValueCountFrequency (%) 
Ahlin, Mrs. Johan (Johanna Persdotter Larsson) 1 0.6%
 
Allen, Mr. William Henry 1 0.6%
 
Andersson, Miss. Ellis Anna Maria 1 0.6%
 
Andersson, Miss. Erna Alexandra 1 0.6%
 
Andersson, Mr. Anders Johan 1 0.6%
 

Last 5 values

ValueCountFrequency (%) 
van Billiard, Mr. Austin Blyler 1 0.6%
 
Zabour, Miss. Hileni 1 0.6%
 
Woolner, Mr. Hugh 1 0.6%
 
Williams, Mr. Charles Eugene 1 0.6%
 
Williams, Mr. Charles Duane 1 0.6%
 

Parch
Numeric

Distinct count5
Unique (%)3.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.3974358974
Minimum0
Maximum5
Zeros (%)77.6%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile2
Maximum5
Range5
Interquartile range0

Descriptive statistics

Standard deviation0.8701463516
Coef of variation2.189400498
Kurtosis9.346661872
Mean0.3974358974
MAD0.6165351742
Skewness2.760421073
Sum62
Variance0.7571546733
Memory size1.3 KiB
Histogram
Histogram with fixed size bins (bins=5)
Histogram
Histogram with variable size bins (bins=[0. 0.5 2.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 121 77.6%
 
2 17 10.9%
 
1 15 9.6%
 
5 2 1.3%
 
3 1 0.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0 121 77.6%
 
1 15 9.6%
 
2 17 10.9%
 
3 1 0.6%
 
5 2 1.3%
 

Maximum 5 values

ValueCountFrequency (%) 
5 2 1.3%
 
3 1 0.6%
 
2 17 10.9%
 
1 15 9.6%
 
0 121 77.6%
 

PassengerId
Numeric

Distinct count156
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean78.5
Minimum1
Maximum156
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile8.75
Q139.75
Median78.5
Q3117.25
95-th percentile148.25
Maximum156
Range155
Interquartile range77.5

Descriptive statistics

Standard deviation45.17742799
Coef of variation0.5755086368
Kurtosis-1.2
Mean78.5
MAD39
Skewness0
Sum12246
Variance2041
Memory size1.3 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 1. 156.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
156 1 0.6%
 
49 1 0.6%
 
56 1 0.6%
 
55 1 0.6%
 
54 1 0.6%
 
53 1 0.6%
 
52 1 0.6%
 
51 1 0.6%
 
50 1 0.6%
 
48 1 0.6%
 
Other values (146) 146 93.6%
 

Minimum 5 values

ValueCountFrequency (%) 
1 1 0.6%
 
2 1 0.6%
 
3 1 0.6%
 
4 1 0.6%
 
5 1 0.6%
 

Maximum 5 values

ValueCountFrequency (%) 
156 1 0.6%
 
155 1 0.6%
 
154 1 0.6%
 
153 1 0.6%
 
152 1 0.6%
 

Pclass
Recoded

This variable is a recoding of Classe and should be ignored for analysis

Sex
Categorical

Distinct count2
Unique (%)1.3%
Missing (%)0.0%
Missing (n)0
male
100
female
56
ValueCountFrequency (%) 
male 100 64.1%
 
female 56 35.9%
 
Max length6
Mean length4.717948718
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

SibSp
Numeric

Distinct count6
Unique (%)3.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.6153846154
Minimum0
Maximum5
Zeros (%)62.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q31
95-th percentile3
Maximum5
Range5
Interquartile range1

Descriptive statistics

Standard deviation1.056235179
Coef of variation1.716382167
Kurtosis4.996881448
Mean0.6153846154
MAD0.7731755424
Skewness2.220684855
Sum96
Variance1.115632754
Memory size1.3 KiB
Histogram
Histogram with fixed size bins (bins=6)
Histogram
Histogram with variable size bins (bins=[0. 0.5 1.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 98 62.8%
 
1 40 25.6%
 
3 6 3.8%
 
2 6 3.8%
 
4 4 2.6%
 
5 2 1.3%
 

Minimum 5 values

ValueCountFrequency (%) 
0 98 62.8%
 
1 40 25.6%
 
2 6 3.8%
 
3 6 3.8%
 
4 4 2.6%
 

Maximum 5 values

ValueCountFrequency (%) 
5 2 1.3%
 
4 4 2.6%
 
3 6 3.8%
 
2 6 3.8%
 
1 40 25.6%
 

Survived
Boolean

Distinct count2
Unique (%)1.3%
Missing (%)0.0%
Missing (n)0
0
102
1
54
ValueCountFrequency (%) 
0 102 65.4%
 
1 54 34.6%
 

Ticket
Categorical

Distinct count145
Unique (%)92.9%
Missing (%)0.0%
Missing (n)0
19950
 
2
347082
 
2
35281
 
2
Other values (142)
150
ValueCountFrequency (%) 
19950 2 1.3%
 
347082 2 1.3%
 
35281 2 1.3%
 
S.O.C. 14879 2 1.3%
 
11668 2 1.3%
 
237736 2 1.3%
 
2651 2 1.3%
 
113803 2 1.3%
 
349909 2 1.3%
 
CA 2144 2 1.3%
 
Other values (135) 136 87.2%
 
Max length18
Mean length6.961538462
Min length4
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

try
Categorical, Unique

First 5 values
Ahlin, Mrs. Johan (Johanna Persdotter Larsson)female
Allen, Mr. William Henrymale
Andersson, Miss. Ellis Anna Mariafemale
Andersson, Miss. Erna Alexandrafemale
Andersson, Mr. Anders Johanmale
Last 5 values
Williams, Mr. Charles Duanemale
Williams, Mr. Charles Eugenemale
Woolner, Mr. Hughmale
Zabour, Miss. Hilenifemale
van Billiard, Mr. Austin Blylermale

First 5 values

ValueCountFrequency (%) 
Ahlin, Mrs. Johan (Johanna Persdotter Larsson)female 1 0.6%
 
Allen, Mr. William Henrymale 1 0.6%
 
Andersson, Miss. Ellis Anna Mariafemale 1 0.6%
 
Andersson, Miss. Erna Alexandrafemale 1 0.6%
 
Andersson, Mr. Anders Johanmale 1 0.6%
 

Last 5 values

ValueCountFrequency (%) 
van Billiard, Mr. Austin Blylermale 1 0.6%
 
Zabour, Miss. Hilenifemale 1 0.6%
 
Woolner, Mr. Hughmale 1 0.6%
 
Williams, Mr. Charles Eugenemale 1 0.6%
 
Williams, Mr. Charles Duanemale 1 0.6%
 

Unnamed_0
Highly correlated

This variable is highly correlated with PassengerId and should be ignored for analysis

Correlation1

Correlations

Missing values

Sample

First rows

AgeCabinCLASS2ClasseEmbarkedFareNameParchPassengerIdPclassSexSibSpSurvivedTickettryUnnamed_0
022.0NaNBassaTERZAS7.2500Braund, Mr. Owen Harris013male10A/5 21171Braund, Mr. Owen Harrismale0
138.0C85AltaPRIMAC71.2833Cumings, Mrs. John Bradley (Florence Briggs Thayer)021female11PC 17599Cumings, Mrs. John Bradley (Florence Briggs Thayer)female1
226.0NaNBassaTERZAS7.9250Heikkinen, Miss. Laina033female01STON/O2. 3101282Heikkinen, Miss. Lainafemale2
335.0C123AltaPRIMAS53.1000Futrelle, Mrs. Jacques Heath (Lily May Peel)041female11113803Futrelle, Mrs. Jacques Heath (Lily May Peel)female3
435.0NaNBassaTERZAS8.0500Allen, Mr. William Henry053male00373450Allen, Mr. William Henrymale4
5NaNNaNBassaTERZAQ8.4583Moran, Mr. James063male00330877Moran, Mr. Jamesmale5
654.0E46AltaPRIMAS51.8625McCarthy, Mr. Timothy J071male0017463McCarthy, Mr. Timothy Jmale6
72.0NaNBassaTERZAS21.0750Palsson, Master. Gosta Leonard183male30349909Palsson, Master. Gosta Leonardmale7
827.0NaNBassaTERZAS11.1333Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg)293female01347742Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg)female8
914.0NaNBassaSECONDAC30.0708Nasser, Mrs. Nicholas (Adele Achem)0102female11237736Nasser, Mrs. Nicholas (Adele Achem)female9

Last rows

AgeCabinCLASS2ClasseEmbarkedFareNameParchPassengerIdPclassSexSibSpSurvivedTickettryUnnamed_0
14627.0NaNBassaTERZAS7.7958Andersson, Mr. August Edvard ("Wennerstrom")01473male01350043Andersson, Mr. August Edvard ("Wennerstrom")male146
1479.0NaNBassaTERZAS34.3750Ford, Miss. Robina Maggie "Ruby"21483female20W./C. 6608Ford, Miss. Robina Maggie "Ruby"female147
14836.5F2BassaSECONDAS26.0000Navratil, Mr. Michel ("Louis M Hoffman")21492male00230080Navratil, Mr. Michel ("Louis M Hoffman")male148
14942.0NaNBassaSECONDAS13.0000Byles, Rev. Thomas Roussel Davids01502male00244310Byles, Rev. Thomas Roussel Davidsmale149
15051.0NaNBassaSECONDAS12.5250Bateman, Rev. Robert James01512male00S.O.P. 1166Bateman, Rev. Robert Jamesmale150
15122.0C2AltaPRIMAS66.6000Pears, Mrs. Thomas (Edith Wearne)01521female11113776Pears, Mrs. Thomas (Edith Wearne)female151
15255.5NaNBassaTERZAS8.0500Meo, Mr. Alfonzo01533male00A.5. 11206Meo, Mr. Alfonzomale152
15340.5NaNBassaTERZAS14.5000van Billiard, Mr. Austin Blyler21543male00A/5. 851van Billiard, Mr. Austin Blylermale153
154NaNNaNBassaTERZAS7.3125Olsen, Mr. Ole Martin01553male00Fa 265302Olsen, Mr. Ole Martinmale154
15551.0NaNAltaPRIMAC61.3792Williams, Mr. Charles Duane11561male00PC 17597Williams, Mr. Charles Duanemale155