statistique d'un dataset avec skim et glimpse
11 déc. 2018#stat d'un datag
diamonds %>% skimr::skim()
Skim summary statistics
n obs: 53940
n variables: 10
-- Variable type:factor --------------------------------------------------------
variable missing complete n n_unique top_counts ordered
clarity 0 53940 53940 8 SI1: 13065, VS2: 12258, SI2: 9194, VS1: 8171 TRUE
color 0 53940 53940 7 G: 11292, E: 9797, F: 9542, H: 8304 TRUE
cut 0 53940 53940 5 Ide: 21551, Pre: 13791, Ver: 12082, Goo: 4906 TRUE
-- Variable type:integer -------------------------------------------------------
variable missing complete n mean sd p0 p25 p50 p75 p100 hist
price 0 53940 53940 3932.8 3989.44 326 950 2401 5324.25 18823 ▇▃▂▁▁▁▁▁
-- Variable type:numeric -------------------------------------------------------
variable missing complete n mean sd p0 p25 p50 p75 p100 hist
carat 0 53940 53940 0.8 0.47 0.2 0.4 0.7 1.04 5.01 ▇▅▁▁▁▁▁▁
depth 0 53940 53940 61.75 1.43 43 61 61.8 62.5 79 ▁▁▁▃▇▁▁▁
table 0 53940 53940 57.46 2.23 43 56 57 59 95 ▁▅▇▁▁▁▁▁
x 0 53940 53940 5.73 1.12 0 4.71 5.7 6.54 10.74 ▁▁▁▇▇▃▁▁
y 0 53940 53940 5.73 1.14 0 4.72 5.71 6.54 58.9 ▇▁▁▁▁▁▁▁
z 0 53940 53940 3.54 0.71 0 2.91 3.53 4.04 31.8 ▇▃▁▁▁▁▁▁
glimpse(diamonds)
Observations: 53,940
Variables: 10
$ carat <dbl> 0.23, 0.21, 0.23, 0.29, 0.31, 0.24, 0.24, 0.26, 0.22, 0.23, 0.30, 0.23, 0.22, 0.3...
$ cut <ord> Ideal, Premium, Good, Premium, Good, Very Good, Very Good, Very Good, Fair, Very ...
$ color <ord> E, E, E, I, J, J, I, H, E, H, J, J, F, J, E, E, I, J, J, J, I, E, H, J, J, G, I, ...
$ clarity <ord> SI2, SI1, VS1, VS2, SI2, VVS2, VVS1, SI1, VS2, VS1, SI1, VS1, SI1, SI2, SI2, I1, ...
$ depth <dbl> 61.5, 59.8, 56.9, 62.4, 63.3, 62.8, 62.3, 61.9, 65.1, 59.4, 64.0, 62.8, 60.4, 62....
$ table <dbl> 55, 61, 65, 58, 58, 57, 57, 55, 61, 61, 55, 56, 61, 54, 62, 58, 54, 54, 56, 59, 5...
$ price <int> 326, 326, 327, 334, 335, 336, 336, 337, 337, 338, 339, 340, 342, 344, 345, 345, 3...
$ x <dbl> 3.95, 3.89, 4.05, 4.20, 4.34, 3.94, 3.95, 4.07, 3.87, 4.00, 4.25, 3.93, 3.88, 4.3...
$ y <dbl> 3.98, 3.84, 4.07, 4.23, 4.35, 3.96, 3.98, 4.11, 3.78, 4.05, 4.28, 3.90, 3.84, 4.3...
$ z <dbl> 2.43, 2.31, 2.31, 2.63, 2.75, 2.48, 2.47, 2.53, 2.49, 2.39, 2.73, 2.46, 2.33, 2.7...