Overview

Dataset info

Number of variables 16
Number of observations 20058
Total Missing (%) 0.0%
Total size in memory 2.3 MiB
Average record size in memory 121.0 B

Variables types

Numeric 5
Categorical 9
Boolean 1
Date 0
Text (Unique) 0
Rejected 1
Unsupported 0

Warnings

  • black_id has a high cardinality: 9331 distinct values Warning
  • id has a high cardinality: 19113 distinct values Warning
  • increment_code has a high cardinality: 400 distinct values Warning
  • last_move_at is highly correlated with created_at (ρ = 1) Rejected
  • moves has a high cardinality: 18920 distinct values Warning
  • opening_eco has a high cardinality: 365 distinct values Warning
  • opening_name has a high cardinality: 1477 distinct values Warning
  • white_id has a high cardinality: 9438 distinct values Warning
  • Dataset has 429 duplicate rows Warning

Variables

black_id
Categorical

Distinct count 9331
Unique (%) 46.5%
Missing (%) 0.0%
Missing (n) 0
taranga
 
82
vladimir-kramnik-1
 
60
a_p_t_e_m_u_u
 
47
Other values (9328)
19869
Value Count Frequency (%)  
taranga 82 0.4%
 
vladimir-kramnik-1 60 0.3%
 
a_p_t_e_m_u_u 47 0.2%
 
king5891 44 0.2%
 
docboss 44 0.2%
 
ducksandcats 41 0.2%
 
saviter 38 0.2%
 
cape217 38 0.2%
 
anakgreget 36 0.2%
 
artem555 34 0.2%
 
Other values (9321) 19594 97.7%
 

black_rating
Numeric

Distinct count 1521
Unique (%) 7.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1588.8
Minimum 789
Maximum 2723
Zeros (%) 0.0%

Quantile statistics

Minimum 789
5-th percentile 1135
Q1 1391
Median 1562
Q3 1784
95-th percentile 2105.1
Maximum 2723
Range 1934
Interquartile range 393

Descriptive statistics

Standard deviation 291.04
Coef of variation 0.18318
Kurtosis -0.072277
Mean 1588.8
MAD 232.66
Skewness 0.25851
Sum 31868792
Variance 84702
Memory size 156.8 KiB
Value Count Frequency (%)  
1500 797 4.0%
 
1400 69 0.3%
 
1501 53 0.3%
 
1810 49 0.2%
 
1562 45 0.2%
 
1466 42 0.2%
 
1484 41 0.2%
 
1621 41 0.2%
 
1802 41 0.2%
 
1480 40 0.2%
 
Other values (1511) 18840 93.9%
 

Minimum 5 values

Value Count Frequency (%)  
789 1 0.0%
 
791 1 0.0%
 
795 2 0.0%
 
796 1 0.0%
 
800 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2571 1 0.0%
 
2577 1 0.0%
 
2588 1 0.0%
 
2621 15 0.1%
 
2723 1 0.0%
 

created_at
Numeric

Distinct count 13151
Unique (%) 65.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1483600000000
Minimum 1376800000000
Maximum 1504500000000
Zeros (%) 0.0%

Quantile statistics

Minimum 1376800000000
5-th percentile 1411900000000
Q1 1477500000000
Median 1496000000000
Q3 1503200000000
95-th percentile 1504300000000
Maximum 1504500000000
Range 127720000000
Interquartile range 25622000000

Descriptive statistics

Standard deviation 28502000000
Coef of variation 0.019211
Kurtosis 2.3963
Mean 1483600000000
MAD 21167000000
Skewness -1.7826
Sum 2.9758e+16
Variance 8.1234e+2
Memory size 156.8 KiB
Value Count Frequency (%)  
1504210000000.0 45 0.2%
 
1504140000000.0 39 0.2%
 
1504200000000.0 38 0.2%
 
1503860000000.0 37 0.2%
 
1504050000000.0 32 0.2%
 
1503170000000.0 30 0.1%
 
1503870000000.0 30 0.1%
 
1503970000000.0 29 0.1%
 
1504040000000.0 29 0.1%
 
1504130000000.0 28 0.1%
 
Other values (13141) 19721 98.3%
 

Minimum 5 values

Value Count Frequency (%)  
1376771633173.0 1 0.0%
 
1376771868314.0 1 0.0%
 
1376930287783.0 1 0.0%
 
1376933025599.0 1 0.0%
 
1376945789445.0 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1504488461297.0 1 0.0%
 
1504488533111.0 1 0.0%
 
1504489280676.0 1 0.0%
 
1504492259427.0 1 0.0%
 
1504493143790.0 1 0.0%
 

id
Categorical

Distinct count 19113
Unique (%) 95.3%
Missing (%) 0.0%
Missing (n) 0
XRuQPSzH
 
5
1b0kpInt
 
4
j5KY62yS
 
4
Other values (19110)
20045
Value Count Frequency (%)  
XRuQPSzH 5 0.0%
 
1b0kpInt 4 0.0%
 
j5KY62yS 4 0.0%
 
DijZlfMy 4 0.0%
 
GstYv2mJ 4 0.0%
 
t7vvcwqO 4 0.0%
 
facMwkUo 4 0.0%
 
QurxyQkA 4 0.0%
 
dFQ5D7CS 4 0.0%
 
dJEtAQp7 4 0.0%
 
Other values (19103) 20017 99.8%
 

increment_code
Categorical

Distinct count 400
Unique (%) 2.0%
Missing (%) 0.0%
Missing (n) 0
10+0
7721
15+0
 
1311
15+15
 
850
Other values (397)
10176
Value Count Frequency (%)  
10+0 7721 38.5%
 
15+0 1311 6.5%
 
15+15 850 4.2%
 
5+5 738 3.7%
 
5+8 697 3.5%
 
8+0 588 2.9%
 
10+5 579 2.9%
 
15+10 461 2.3%
 
20+0 448 2.2%
 
30+0 375 1.9%
 
Other values (390) 6290 31.4%
 

last_move_at
Highly correlated

This variable is highly correlated with created_at and should be ignored for analysis

Correlation 1

moves
Categorical

Distinct count 18920
Unique (%) 94.3%
Missing (%) 0.0%
Missing (n) 0
e4 e5
 
27
e4 d5
 
21
d4 d5
 
17
Other values (18917)
19993
Value Count Frequency (%)  
e4 e5 27 0.1%
 
e4 d5 21 0.1%
 
d4 d5 17 0.1%
 
e4 e5 Nf3 16 0.1%
 
f4 e6 g4 Qh4# 14 0.1%
 
e4 12 0.1%
 
e4 c5 10 0.0%
 
d3 e6 10 0.0%
 
e3 e5 9 0.0%
 
e4 e6 9 0.0%
 
Other values (18910) 19913 99.3%
 

opening_eco
Categorical

Distinct count 365
Unique (%) 1.8%
Missing (%) 0.0%
Missing (n) 0
A00
 
1007
C00
 
844
D00
 
739
Other values (362)
17468
Value Count Frequency (%)  
A00 1007 5.0%
 
C00 844 4.2%
 
D00 739 3.7%
 
B01 716 3.6%
 
C41 691 3.4%
 
C20 675 3.4%
 
A40 618 3.1%
 
B00 611 3.0%
 
B20 567 2.8%
 
C50 538 2.7%
 
Other values (355) 13052 65.1%
 

opening_name
Categorical

Distinct count 1477
Unique (%) 7.4%
Missing (%) 0.0%
Missing (n) 0
Van't Kruijs Opening
 
368
Sicilian Defense
 
358
Sicilian Defense: Bowdler Attack
 
296
Other values (1474)
19036
Value Count Frequency (%)  
Van't Kruijs Opening 368 1.8%
 
Sicilian Defense 358 1.8%
 
Sicilian Defense: Bowdler Attack 296 1.5%
 
Scotch Game 271 1.4%
 
French Defense: Knight Variation 271 1.4%
 
Scandinavian Defense: Mieses-Kotroc Variation 259 1.3%
 
Queen's Pawn Game: Mason Attack 232 1.2%
 
Queen's Pawn Game: Chigorin Variation 229 1.1%
 
Scandinavian Defense 223 1.1%
 
Horwitz Defense 209 1.0%
 
Other values (1467) 17342 86.5%
 

opening_ply
Numeric

Distinct count 23
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 4.817
Minimum 1
Maximum 28
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 1
Q1 3
Median 4
Q3 6
95-th percentile 10
Maximum 28
Range 27
Interquartile range 3

Descriptive statistics

Standard deviation 2.7972
Coef of variation 0.58069
Kurtosis 3.0897
Mean 4.817
MAD 2.1437
Skewness 1.3346
Sum 96619
Variance 7.8241
Memory size 156.8 KiB
Value Count Frequency (%)  
3 3490 17.4%
 
4 3308 16.5%
 
2 2935 14.6%
 
5 2730 13.6%
 
6 2020 10.1%
 
7 1344 6.7%
 
8 1116 5.6%
 
1 1097 5.5%
 
9 687 3.4%
 
10 432 2.2%
 
Other values (13) 899 4.5%
 

Minimum 5 values

Value Count Frequency (%)  
1 1097 5.5%
 
2 2935 14.6%
 
3 3490 17.4%
 
4 3308 16.5%
 
5 2730 13.6%
 

Maximum 5 values

Value Count Frequency (%)  
19 11 0.1%
 
20 8 0.0%
 
22 1 0.0%
 
24 1 0.0%
 
28 4 0.0%
 

rated
Boolean

Distinct count 2
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Mean 0.80541
True
16155
(Missing)
3903
Value Count Frequency (%)  
True 16155 80.5%
 
(Missing) 3903 19.5%
 

turns
Numeric

Distinct count 211
Unique (%) 1.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 60.466
Minimum 1
Maximum 349
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 14
Q1 37
Median 55
Q3 79
95-th percentile 124
Maximum 349
Range 348
Interquartile range 42

Descriptive statistics

Standard deviation 33.571
Coef of variation 0.5552
Kurtosis 1.3852
Mean 60.466
MAD 26.12
Skewness 0.89728
Sum 1212827
Variance 1127
Memory size 156.8 KiB
Value Count Frequency (%)  
53 303 1.5%
 
45 302 1.5%
 
51 299 1.5%
 
57 297 1.5%
 
39 297 1.5%
 
41 295 1.5%
 
43 293 1.5%
 
52 290 1.4%
 
47 283 1.4%
 
54 283 1.4%
 
Other values (201) 17116 85.3%
 

Minimum 5 values

Value Count Frequency (%)  
1 18 0.1%
 
2 185 0.9%
 
3 87 0.4%
 
4 52 0.3%
 
5 40 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
222 2 0.0%
 
226 1 0.0%
 
255 1 0.0%
 
259 1 0.0%
 
349 2 0.0%
 

victory_status
Categorical

Distinct count 4
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
resign
11147
mate
6325
outoftime
 
1680
Value Count Frequency (%)  
resign 11147 55.6%
 
mate 6325 31.5%
 
outoftime 1680 8.4%
 
draw 906 4.5%
 

white_id
Categorical

Distinct count 9438
Unique (%) 47.1%
Missing (%) 0.0%
Missing (n) 0
taranga
 
72
chess-brahs
 
53
a_p_t_e_m_u_u
 
49
Other values (9435)
19884
Value Count Frequency (%)  
taranga 72 0.4%
 
chess-brahs 53 0.3%
 
a_p_t_e_m_u_u 49 0.2%
 
ssf7 48 0.2%
 
bleda 48 0.2%
 
hassan1365416 44 0.2%
 
khelil 41 0.2%
 
saviter 38 0.2%
 
ozguragarr 38 0.2%
 
1240100948 38 0.2%
 
Other values (9428) 19589 97.7%
 

white_rating
Numeric

Distinct count 1516
Unique (%) 7.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1596.6
Minimum 784
Maximum 2700
Zeros (%) 0.0%

Quantile statistics

Minimum 784
5-th percentile 1144
Q1 1398
Median 1567
Q3 1793
95-th percentile 2111
Maximum 2700
Range 1916
Interquartile range 395

Descriptive statistics

Standard deviation 291.25
Coef of variation 0.18242
Kurtosis 0.0089036
Mean 1596.6
MAD 232.76
Skewness 0.30077
Sum 32025242
Variance 84829
Memory size 156.8 KiB
Value Count Frequency (%)  
1500 812 4.0%
 
1480 51 0.3%
 
1400 48 0.2%
 
1536 46 0.2%
 
1708 45 0.2%
 
1501 44 0.2%
 
1562 43 0.2%
 
1527 43 0.2%
 
1383 42 0.2%
 
1621 42 0.2%
 
Other values (1506) 18842 93.9%
 

Minimum 5 values

Value Count Frequency (%)  
784 2 0.0%
 
788 1 0.0%
 
793 1 0.0%
 
795 1 0.0%
 
798 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2617 1 0.0%
 
2619 2 0.0%
 
2621 24 0.1%
 
2622 1 0.0%
 
2700 1 0.0%
 

winner
Categorical

Distinct count 3
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
white
10001
black
9107
draw
 
950
Value Count Frequency (%)  
white 10001 49.9%
 
black 9107 45.4%
 
draw 950 4.7%
 

Correlations

Sample

id rated created_at last_move_at turns victory_status winner increment_code white_id white_rating black_id black_rating moves opening_eco opening_name opening_ply
0 TZJHLljE False 1.504210e+12 1.504210e+12 13 outoftime white 15+2 bourgris 1500 a-00 1191 d4 d5 c4 c6 cxd5 e6 dxe6 fxe6 Nf3 Bb4+ Nc3 Ba5... D10 Slav Defense: Exchange Variation 5
1 l1NXvwaE True 1.504130e+12 1.504130e+12 16 resign black 5+10 a-00 1322 skinnerua 1261 d4 Nc6 e4 e5 f4 f6 dxe5 fxe5 fxe5 Nxe5 Qd4 Nc6... B00 Nimzowitsch Defense: Kennedy Variation 4
2 mIICvQHh True 1.504130e+12 1.504130e+12 61 mate white 5+10 ischia 1496 a-00 1500 e4 e5 d3 d6 Be3 c6 Be2 b5 Nd2 a5 a4 c5 axb5 Nc... C20 King's Pawn Game: Leonardis Variation 3
3 kWKvrqYL True 1.504110e+12 1.504110e+12 61 mate white 20+0 daniamurashov 1439 adivanov2009 1454 d4 d5 Nf3 Bf5 Nc3 Nf6 Bf4 Ng4 e3 Nc6 Be2 Qd7 O... D02 Queen's Pawn Game: Zukertort Variation 3
4 9tXo1AUZ True 1.504030e+12 1.504030e+12 95 mate white 30+3 nik221107 1523 adivanov2009 1469 e4 e5 Nf3 d6 d4 Nc6 d5 Nb4 a3 Na6 Nc3 Be7 b4 N... C41 Philidor Defense 5