Overview
Brought to you by YData
Dataset statistics
| Number of variables | 5 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 39.2 KiB |
| Average record size in memory | 40.1 B |
Variable types
| Categorical | 4 |
|---|---|
| Numeric | 1 |
Dataset
| Description | This profiling report was generated for the datacamp learning resources. |
|---|---|
| Author | JR |
| URL | https://data.gov/ |
| Copyright | (c) JR_DataCamp, Inc. 2024 |
Reproduction
| Analysis started | 2024-10-29 11:48:19.635833 |
|---|---|
| Analysis finished | 2024-10-29 11:48:21.128788 |
| Duration | 1.49 second |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
Name
Categorical
| Distinct | 20 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Larry Ellison | 67 |
|---|---|
| Amancio Ortega | 65 |
| Rob Walton | 62 |
| Elon Musk | 58 |
| Michael Bloomberg | 57 |
| Other values (15) |
Length
| Max length | 28 |
|---|---|
| Median length | 15 |
| Mean length | 12.776 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rob Walton |
|---|---|
| 2nd row | Sergey Brin |
| 3rd row | Steve Ballmer |
| 4th row | Mukesh Ambani |
| 5th row | Jim Walton |
Common Values
| Value | Count | Frequency (%) |
| Larry Ellison | 67 | 6.7% |
| Amancio Ortega | 65 | 6.5% |
| Rob Walton | 62 | 6.2% |
| Elon Musk | 58 | 5.8% |
| Michael Bloomberg | 57 | 5.7% |
| Alice Walton | 54 | 5.4% |
| Bill Gates | 54 | 5.4% |
| Mukesh Ambani | 52 | 5.2% |
| Charles Koch | 51 | 5.1% |
| Sergey Brin | 51 | 5.1% |
| Other values (10) | 429 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| walton | 163 | 8.0% |
| larry | 109 | 5.3% |
| koch | 93 | 4.5% |
| ellison | 67 | 3.3% |
| ortega | 65 | 3.2% |
| amancio | 65 | 3.2% |
| rob | 62 | 3.0% |
| elon | 58 | 2.8% |
| musk | 58 | 2.8% |
| bloomberg | 57 | 2.8% |
| Other values (27) | 1248 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1164 | 9.1% |
| r | 1047 | 8.2% |
| 1045 | 8.2% | |
| a | 997 | 7.8% |
| l | 894 | 7.0% |
| o | 801 | 6.3% |
| n | 671 | 5.3% |
| t | 590 | 4.6% |
| i | 584 | 4.6% |
| s | 461 | 3.6% |
| Other values (29) | 4522 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12776 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1164 | 9.1% |
| r | 1047 | 8.2% |
| 1045 | 8.2% | |
| a | 997 | 7.8% |
| l | 894 | 7.0% |
| o | 801 | 6.3% |
| n | 671 | 5.3% |
| t | 590 | 4.6% |
| i | 584 | 4.6% |
| s | 461 | 3.6% |
| Other values (29) | 4522 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12776 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1164 | 9.1% |
| r | 1047 | 8.2% |
| 1045 | 8.2% | |
| a | 997 | 7.8% |
| l | 894 | 7.0% |
| o | 801 | 6.3% |
| n | 671 | 5.3% |
| t | 590 | 4.6% |
| i | 584 | 4.6% |
| s | 461 | 3.6% |
| Other values (29) | 4522 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12776 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1164 | 9.1% |
| r | 1047 | 8.2% |
| 1045 | 8.2% | |
| a | 997 | 7.8% |
| l | 894 | 7.0% |
| o | 801 | 6.3% |
| n | 671 | 5.3% |
| t | 590 | 4.6% |
| i | 584 | 4.6% |
| s | 461 | 3.6% |
| Other values (29) | 4522 |
Country
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| USA | |
|---|---|
| France | |
| Mexico | 52 |
| Spain | 51 |
| India | 49 |
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 3.632 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mexico |
|---|---|
| 2nd row | USA |
| 3rd row | USA |
| 4th row | USA |
| 5th row | USA |
Common Values
| Value | Count | Frequency (%) |
| USA | 756 | |
| France | 92 | 9.2% |
| Mexico | 52 | 5.2% |
| Spain | 51 | 5.1% |
| India | 49 | 4.9% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| usa | 756 | |
| france | 92 | 9.2% |
| mexico | 52 | 5.2% |
| spain | 51 | 5.1% |
| india | 49 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 807 | |
| U | 756 | |
| A | 756 | |
| a | 192 | 5.3% |
| n | 192 | 5.3% |
| i | 152 | 4.2% |
| c | 144 | 4.0% |
| e | 144 | 4.0% |
| F | 92 | 2.5% |
| r | 92 | 2.5% |
| Other values (6) | 305 | 8.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3632 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 807 | |
| U | 756 | |
| A | 756 | |
| a | 192 | 5.3% |
| n | 192 | 5.3% |
| i | 152 | 4.2% |
| c | 144 | 4.0% |
| e | 144 | 4.0% |
| F | 92 | 2.5% |
| r | 92 | 2.5% |
| Other values (6) | 305 | 8.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3632 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 807 | |
| U | 756 | |
| A | 756 | |
| a | 192 | 5.3% |
| n | 192 | 5.3% |
| i | 152 | 4.2% |
| c | 144 | 4.0% |
| e | 144 | 4.0% |
| F | 92 | 2.5% |
| r | 92 | 2.5% |
| Other values (6) | 305 | 8.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3632 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 807 | |
| U | 756 | |
| A | 756 | |
| a | 192 | 5.3% |
| n | 192 | 5.3% |
| i | 152 | 4.2% |
| c | 144 | 4.0% |
| e | 144 | 4.0% |
| F | 92 | 2.5% |
| r | 92 | 2.5% |
| Other values (6) | 305 | 8.4% |
Industry
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Technology | |
|---|---|
| Retail | |
| Manufacturing | |
| Media | |
| Cosmetics | |
| Other values (5) |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 9.376 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Finance |
|---|---|
| 2nd row | Automotive |
| 3rd row | Manufacturing |
| 4th row | Technology |
| 5th row | Fashion |
Common Values
| Value | Count | Frequency (%) |
| Technology | 350 | |
| Retail | 217 | |
| Manufacturing | 94 | 9.4% |
| Media | 57 | 5.7% |
| Cosmetics | 51 | 5.1% |
| Telecommunications | 51 | 5.1% |
| Finance | 50 | 5.0% |
| Fashion | 44 | 4.4% |
| Automotive | 43 | 4.3% |
| Petrochemicals | 43 | 4.3% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| technology | 350 | |
| retail | 217 | |
| manufacturing | 94 | 9.4% |
| media | 57 | 5.7% |
| cosmetics | 51 | 5.1% |
| telecommunications | 51 | 5.1% |
| finance | 50 | 5.0% |
| fashion | 44 | 4.4% |
| automotive | 43 | 4.3% |
| petrochemicals | 43 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1026 | |
| e | 956 | 10.2% |
| n | 784 | 8.4% |
| c | 733 | 7.8% |
| i | 701 | 7.5% |
| l | 661 | 7.0% |
| a | 650 | 6.9% |
| t | 542 | 5.8% |
| g | 444 | 4.7% |
| h | 437 | 4.7% |
| Other values (15) | 2442 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9376 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1026 | |
| e | 956 | 10.2% |
| n | 784 | 8.4% |
| c | 733 | 7.8% |
| i | 701 | 7.5% |
| l | 661 | 7.0% |
| a | 650 | 6.9% |
| t | 542 | 5.8% |
| g | 444 | 4.7% |
| h | 437 | 4.7% |
| Other values (15) | 2442 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9376 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1026 | |
| e | 956 | 10.2% |
| n | 784 | 8.4% |
| c | 733 | 7.8% |
| i | 701 | 7.5% |
| l | 661 | 7.0% |
| a | 650 | 6.9% |
| t | 542 | 5.8% |
| g | 444 | 4.7% |
| h | 437 | 4.7% |
| Other values (15) | 2442 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9376 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1026 | |
| e | 956 | 10.2% |
| n | 784 | 8.4% |
| c | 733 | 7.8% |
| i | 701 | 7.5% |
| l | 661 | 7.0% |
| a | 650 | 6.9% |
| t | 542 | 5.8% |
| g | 444 | 4.7% |
| h | 437 | 4.7% |
| Other values (15) | 2442 |
Net Worth (in billions)
Real number (ℝ)
| Distinct | 982 |
|---|---|
| Distinct (%) | 98.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102.61627 |
| Minimum | 1.57 |
|---|---|
| Maximum | 199.24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1.57 |
|---|---|
| 5-th percentile | 12.3585 |
| Q1 | 54.96 |
| median | 103.365 |
| Q3 | 151.9125 |
| 95-th percentile | 189.5755 |
| Maximum | 199.24 |
| Range | 197.67 |
| Interquartile range (IQR) | 96.9525 |
Descriptive statistics
| Standard deviation | 56.796062 |
|---|---|
| Coefficient of variation (CV) | 0.55348008 |
| Kurtosis | -1.1691633 |
| Mean | 102.61627 |
| Median Absolute Deviation (MAD) | 48.43 |
| Skewness | -0.0083658018 |
| Sum | 102616.27 |
| Variance | 3225.7926 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 116.42 | 3 | 0.3% |
| 98.27 | 2 | 0.2% |
| 192.11 | 2 | 0.2% |
| 185.43 | 2 | 0.2% |
| 65.74 | 2 | 0.2% |
| 40.2 | 2 | 0.2% |
| 105.84 | 2 | 0.2% |
| 78.03 | 2 | 0.2% |
| 179.62 | 2 | 0.2% |
| 167.65 | 2 | 0.2% |
| Other values (972) | 979 |
| Value | Count | Frequency (%) |
| 1.57 | 1 | |
| 1.86 | 1 | |
| 2.07 | 1 | |
| 2.41 | 1 | |
| 2.49 | 1 | |
| 2.77 | 1 | |
| 2.92 | 1 | |
| 2.99 | 1 | |
| 3.18 | 1 | |
| 3.21 | 1 |
| Value | Count | Frequency (%) |
| 199.24 | 1 | |
| 199.21 | 1 | |
| 199.2 | 1 | |
| 199.1 | 1 | |
| 199 | 1 | |
| 198.77 | 1 | |
| 198.34 | 1 | |
| 198.05 | 2 | |
| 197.6 | 1 | |
| 197.41 | 1 |
Company
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Walmart | |
|---|---|
| Microsoft | |
| Koch Industries | |
| LVMH | |
| Other values (10) |
Length
| Max length | 19 |
|---|---|
| Median length | 15 |
| Mean length | 9.021 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Walmart |
|---|---|
| 2nd row | |
| 3rd row | Koch Industries |
| 4th row | |
| 5th row | Walmart |
Common Values
| Value | Count | Frequency (%) |
| Walmart | 160 | |
| 101 | ||
| Microsoft | 101 | |
| Koch Industries | 99 | |
| LVMH | 57 | 5.7% |
| Reliance Industries | 56 | 5.6% |
| L'Oreal | 54 | 5.4% |
| Zara | 51 | 5.1% |
| Grupo Carso | 49 | 4.9% |
| 49 | 4.9% | |
| Other values (5) | 223 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| walmart | 160 | 12.4% |
| industries | 155 | 12.0% |
| microsoft | 101 | 7.8% |
| 101 | 7.8% | |
| koch | 99 | 7.7% |
| lvmh | 57 | 4.4% |
| reliance | 56 | 4.3% |
| l'oreal | 54 | 4.2% |
| zara | 51 | 4.0% |
| 49 | 3.8% | |
| Other values (9) | 408 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 907 | 10.1% |
| o | 822 | 9.1% |
| r | 800 | 8.9% |
| e | 698 | 7.7% |
| s | 553 | 6.1% |
| l | 504 | 5.6% |
| t | 463 | 5.1% |
| i | 359 | 4.0% |
| c | 352 | 3.9% |
| 291 | 3.2% | |
| Other values (31) | 3272 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9021 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 907 | 10.1% |
| o | 822 | 9.1% |
| r | 800 | 8.9% |
| e | 698 | 7.7% |
| s | 553 | 6.1% |
| l | 504 | 5.6% |
| t | 463 | 5.1% |
| i | 359 | 4.0% |
| c | 352 | 3.9% |
| 291 | 3.2% | |
| Other values (31) | 3272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9021 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 907 | 10.1% |
| o | 822 | 9.1% |
| r | 800 | 8.9% |
| e | 698 | 7.7% |
| s | 553 | 6.1% |
| l | 504 | 5.6% |
| t | 463 | 5.1% |
| i | 359 | 4.0% |
| c | 352 | 3.9% |
| 291 | 3.2% | |
| Other values (31) | 3272 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9021 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 907 | 10.1% |
| o | 822 | 9.1% |
| r | 800 | 8.9% |
| e | 698 | 7.7% |
| s | 553 | 6.1% |
| l | 504 | 5.6% |
| t | 463 | 5.1% |
| i | 359 | 4.0% |
| c | 352 | 3.9% |
| 291 | 3.2% | |
| Other values (31) | 3272 |
Interactions
Correlations
| Company | Country | Industry | Name | Net Worth (in billions) | |
|---|---|---|---|---|---|
| Company | 1.000 | 0.044 | 0.000 | 0.000 | 0.039 |
| Country | 0.044 | 1.000 | 0.024 | 0.000 | 0.000 |
| Industry | 0.000 | 0.024 | 1.000 | 0.000 | 0.027 |
| Name | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
| Net Worth (in billions) | 0.039 | 0.000 | 0.027 | 0.000 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| Name | Country | Industry | Net Worth (in billions) | Company | |
|---|---|---|---|---|---|
| 0 | Rob Walton | Mexico | Finance | 8.50 | Walmart |
| 1 | Sergey Brin | USA | Automotive | 44.76 | |
| 2 | Steve Ballmer | USA | Manufacturing | 13.43 | Koch Industries |
| 3 | Mukesh Ambani | USA | Technology | 120.44 | |
| 4 | Jim Walton | USA | Fashion | 122.39 | Walmart |
| 5 | Sergey Brin | USA | Technology | 93.19 | Walmart |
| 6 | Michael Bloomberg | USA | Cosmetics | 117.96 | Reliance Industries |
| 7 | Warren Buffett | France | Retail | 36.62 | Microsoft |
| 8 | Carlos Slim | USA | Technology | 97.35 | Reliance Industries |
| 9 | Larry Page | USA | Technology | 88.05 | Walmart |
| Name | Country | Industry | Net Worth (in billions) | Company | |
|---|---|---|---|---|---|
| 990 | Charles Koch | USA | Retail | 93.70 | Walmart |
| 991 | Jim Walton | USA | Fashion | 9.18 | L'Oreal |
| 992 | Charles Koch | India | Retail | 19.53 | L'Oreal |
| 993 | Larry Ellison | Mexico | Automotive | 75.21 | Tesla |
| 994 | Mark Zuckerberg | Mexico | Finance | 87.07 | Reliance Industries |
| 995 | Warren Buffett | USA | Retail | 142.66 | |
| 996 | Amancio Ortega | USA | Media | 166.87 | Walmart |
| 997 | Alice Walton | USA | Retail | 30.44 | Walmart |
| 998 | Amancio Ortega | Spain | Retail | 163.18 | Reliance Industries |
| 999 | Jim Walton | USA | Retail | 186.94 | Berkshire Hathaway |