From the course: Complete Guide to Generative AI for Data Analysis and Data Science
Visualizing data
From the course: Complete Guide to Generative AI for Data Analysis and Data Science
Visualizing data
- [Instructor] Data visualizations are an important part of data analysis and data science, and they're tools that really fundamentally help us understand data and help us understand data quickly. They're typically used for things like showing the relationship between variables, getting an idea of what a distribution is for a particular variable, and spotting trends and other patterns in data. Now, data visualizations are important because they enable a rapid assessment of properties of the datasets and variables within those datasets, and they're also really useful for helping us communicate insights as well. Now, there are many types of visualizations. Some of the most common are used for trend analysis, and those visualizations are things like line and area charts. When we're dealing with a question of composition, pie charts and stacked bar charts are good options. If we're trying to understand the distribution of the data of a particular variable, then histograms are great, box plots, Q-Q plots, Normality, of course, is a big consideration in many statistics tests. They work well when the data's normally distributed. Sometimes we want to take a quick look at those. Histograms and Q-Q plots are good for those. Correlation, when we're trying to understand if there is a relationship between variables, such as either a positive relationship, like both variables increase together, or a negative relationship, where one increases and another decreases. So scatterplots and bubble plots can help with that. And of course, when you're looking at, like, geographic data, just mapping data on maps helps. And another type of visualization that's often used is heat maps as well. So these are different types of visualizations, and going forward, we're going to take a look at some of these different types of visualizations.
Download courses and learn on the go
Watch courses on your mobile device without an internet connection. Download courses using your iOS or Android LinkedIn Learning app.
Contents
-
-
(Locked)
Distributions of data7m 27s
-
(Locked)
Visualizing a normal distribution in a spreadsheet3m 29s
-
(Locked)
Jupyter Notebook and Colab3m 51s
-
Generating a normal distribution6m 23s
-
(Locked)
Visualizing a normal distribution in Python4m 56s
-
(Locked)
Visualizing a uniform distribution in Python3m
-
(Locked)
Visualizing a bimodal distribution in Python5m 54s
-
(Locked)
Challenge: Distributions of data40s
-
(Locked)
Solution: Distribution of data4m 7s
-
(Locked)
-
-
(Locked)
Inferential statistics4m 25s
-
(Locked)
Hypothesis testing methodology4m 17s
-
(Locked)
Analyzing customer preferences11m 20s
-
(Locked)
Type I and type II errors1m 30s
-
(Locked)
ANOVA tests for comparing means1m 55s
-
(Locked)
Generating Python scripts for ANOVA3m 45s
-
(Locked)
Testing independence of categorical variables1m 53s
-
(Locked)
Generating Python Scripts for Chi-squared tests3m 33s
-
Correlation analysis7m 12s
-
(Locked)
Testing for normality2m 25s
-
(Locked)
Generating Python for testing normality3m 46s
-
(Locked)
Generating Python for correlation analysis2m 12s
-
(Locked)
Challenge: Making inferences from data24s
-
(Locked)
Solution: Making inferences from data3m 17s
-
(Locked)
-
-
Linear regression7m 44s
-
(Locked)
Evaluating linear regression models2m 37s
-
(Locked)
Visualizing sales data1m 56s
-
(Locked)
Building a linear regression model4m 16s
-
(Locked)
Evaluating a sales linear regression model2m 46s
-
(Locked)
Challenge: Building a regression model48s
-
(Locked)
Solution: Building a regression model4m 32s
-
-
-
(Locked)
Data files4m 9s
-
Using spreadsheets with CSV files2m 43s
-
(Locked)
Reviewing an example JSON file4m 29s
-
(Locked)
Using jq with JSON files6m 23s
-
(Locked)
Generating jq commands using AI6m 1s
-
(Locked)
Dataframes in Python8m 20s
-
(Locked)
Loading CSV data into dataframes3m 44s
-
(Locked)
Loading JSON into dataframes6m 17s
-
(Locked)
Inspecting dataframes4m 12s
-
(Locked)
Data quality and data cleansing6m 28s
-
(Locked)
Using AI for data quality and data cleansing5m 6s
-
(Locked)
Challenge: Missing data35s
-
(Locked)
Solution: Missing data4m
-
(Locked)
-
-
Relational databases15m 15s
-
(Locked)
NoSQL databases10m 21s
-
(Locked)
Extraction, transformation, and loading data into databases5m 46s
-
(Locked)
Introduction to SQL5m 45s
-
(Locked)
Creating tables and inserting data8m 2s
-
(Locked)
Querying data with SQL10m 28s
-
(Locked)
Joining data with SQL6m 57s
-
(Locked)
Descriptiive statistics in SQL4m 55s
-
(Locked)
Generating synthetic data sets for a relational database7m 12s
-
(Locked)
Generating a star schema, synthetic data, and queries3m 41s
-
(Locked)
Challenge: Generate a relational data model1m 12s
-
(Locked)
Solution: Generate a relational data model4m 32s
-
-
-
Simple classification model8m 34s
-
(Locked)
Handling missing data5m
-
(Locked)
Comparing multiple algorithms6m 43s
-
(Locked)
Classification with neural networks14m 22s
-
(Locked)
Hyperparameter tuning6m 32s
-
(Locked)
Evaluating feature importance2m 24s
-
(Locked)
Challenge: Predicting consumer intent41s
-
(Locked)
Solution: Predicting consumer intent7m 26s
-
-
-
Introduction to graph theory5m 54s
-
(Locked)
NetworkX4m 27s
-
(Locked)
Analyzing a social network7m 15s
-
(Locked)
Supply chains and network analysis3m 20s
-
(Locked)
Generating a synthetic supply chain4m 5s
-
(Locked)
Visualizing a complex supply chain3m 37s
-
(Locked)
Finding highest betweenness scores4m 36s
-
(Locked)
Advanced topics in supply chain analysis6m 26s
-
(Locked)
Challenge: Analyzing a social network19s
-
(Locked)
Solution: Analyzing a social network2m 35s
-
-
-
Introduction to simulations2m 42s
-
(Locked)
Types of simulations10m 3s
-
(Locked)
Modeling inventory management7m 13s
-
(Locked)
Agent-based modeling9m 43s
-
(Locked)
Modeling the spread of infectious diseases4m 29s
-
(Locked)
Agent-base infectious diseases modeling5m 21s
-
(Locked)
Challenge: Simulating forest fires55s
-
(Locked)
Solution: Simulating forest fires5m 49s
-