Complete Guide to Generative AI for Data Analysis and Data Science
With Dan Sullivan
Liked by 116 users
Duration: 10h 21m
Skill level: Intermediate
Released: 9/27/2024
Course details
GenAI has the potential to enable many more people to work with and analyze data, but to succeed, you need a solid foundation in data management, statistics, and machine learning. This course provides that foundation. Instructor Dan Sullivan teaches how to break down business questions and data science questions into components that can be addressed programmatically and then how to use genAI to create programs and scripts to implement a solution. This course focuses on the three pillars needed to be a successful data analyst or data scientist: problem solving skills, an understanding of statistics and machine learning, and practical experience with data management procedures.
Skills you’ll gain
Earn a sharable certificate
Share what you’ve learned, and be a standout professional in your desired industry with a certificate showcasing your knowledge gained from the course.
Learning
LinkedIn Learning
Certificate of Completion
-
Showcase on your LinkedIn profile under “Licenses and Certificate” section
-
Download or print out as PDF to share with others
-
Share as image online to demonstrate your skill
Meet the instructor
Learner reviews
4.6 out of 5
The overall rating is calculated using the average of submitted ratings. Ratings and reviews can only be submitted when non-anonymous learners complete at least 40% of the course. This helps us avoid fake reviews and spam.
-
Julián González
Julián González
Blockchain Developer at Fujitsu Luxembourg
Contents
-
-
(Locked)
Distributions of data7m 27s
-
(Locked)
Visualizing a normal distribution in a spreadsheet3m 29s
-
(Locked)
Jupyter Notebook and Colab3m 51s
-
Generating a normal distribution6m 23s
-
(Locked)
Visualizing a normal distribution in Python4m 56s
-
(Locked)
Visualizing a uniform distribution in Python3m
-
(Locked)
Visualizing a bimodal distribution in Python5m 54s
-
(Locked)
Challenge: Distributions of data40s
-
(Locked)
Solution: Distribution of data4m 7s
-
(Locked)
-
-
(Locked)
Inferential statistics4m 25s
-
(Locked)
Hypothesis testing methodology4m 17s
-
(Locked)
Analyzing customer preferences11m 20s
-
(Locked)
Type I and type II errors1m 30s
-
(Locked)
ANOVA tests for comparing means1m 55s
-
(Locked)
Generating Python scripts for ANOVA3m 45s
-
(Locked)
Testing independence of categorical variables1m 53s
-
(Locked)
Generating Python Scripts for Chi-squared tests3m 33s
-
Correlation analysis7m 12s
-
(Locked)
Testing for normality2m 25s
-
(Locked)
Generating Python for testing normality3m 46s
-
(Locked)
Generating Python for correlation analysis2m 12s
-
(Locked)
Challenge: Making inferences from data24s
-
(Locked)
Solution: Making inferences from data3m 17s
-
(Locked)
-
-
Linear regression7m 44s
-
(Locked)
Evaluating linear regression models2m 37s
-
(Locked)
Visualizing sales data1m 56s
-
(Locked)
Building a linear regression model4m 16s
-
(Locked)
Evaluating a sales linear regression model2m 46s
-
(Locked)
Challenge: Building a regression model48s
-
(Locked)
Solution: Building a regression model4m 32s
-
-
-
(Locked)
Data files4m 9s
-
Using spreadsheets with CSV files2m 43s
-
(Locked)
Reviewing an example JSON file4m 29s
-
(Locked)
Using jq with JSON files6m 23s
-
(Locked)
Generating jq commands using AI6m 1s
-
(Locked)
Dataframes in Python8m 20s
-
(Locked)
Loading CSV data into dataframes3m 44s
-
(Locked)
Loading JSON into dataframes6m 17s
-
(Locked)
Inspecting dataframes4m 12s
-
(Locked)
Data quality and data cleansing6m 28s
-
(Locked)
Using AI for data quality and data cleansing5m 6s
-
(Locked)
Challenge: Missing data35s
-
(Locked)
Solution: Missing data4m
-
(Locked)
-
-
Relational databases15m 15s
-
(Locked)
NoSQL databases10m 21s
-
(Locked)
Extraction, transformation, and loading data into databases5m 46s
-
(Locked)
Introduction to SQL5m 45s
-
(Locked)
Creating tables and inserting data8m 2s
-
(Locked)
Querying data with SQL10m 28s
-
(Locked)
Joining data with SQL6m 57s
-
(Locked)
Descriptiive statistics in SQL4m 55s
-
(Locked)
Generating synthetic data sets for a relational database7m 12s
-
(Locked)
Generating a star schema, synthetic data, and queries3m 41s
-
(Locked)
Challenge: Generate a relational data model1m 12s
-
(Locked)
Solution: Generate a relational data model4m 32s
-
-
-
Simple classification model8m 34s
-
(Locked)
Handling missing data5m
-
(Locked)
Comparing multiple algorithms6m 43s
-
(Locked)
Classification with neural networks14m 22s
-
(Locked)
Hyperparameter tuning6m 32s
-
(Locked)
Evaluating feature importance2m 24s
-
(Locked)
Challenge: Predicting consumer intent41s
-
(Locked)
Solution: Predicting consumer intent7m 26s
-
-
-
Introduction to graph theory5m 54s
-
(Locked)
NetworkX4m 27s
-
(Locked)
Analyzing a social network7m 15s
-
(Locked)
Supply chains and network analysis3m 20s
-
(Locked)
Generating a synthetic supply chain4m 5s
-
(Locked)
Visualizing a complex supply chain3m 37s
-
(Locked)
Finding highest betweenness scores4m 36s
-
(Locked)
Advanced topics in supply chain analysis6m 26s
-
(Locked)
Challenge: Analyzing a social network19s
-
(Locked)
Solution: Analyzing a social network2m 35s
-
-
-
Introduction to simulations2m 42s
-
(Locked)
Types of simulations10m 3s
-
(Locked)
Modeling inventory management7m 13s
-
(Locked)
Agent-based modeling9m 43s
-
(Locked)
Modeling the spread of infectious diseases4m 29s
-
(Locked)
Agent-base infectious diseases modeling5m 21s
-
(Locked)
Challenge: Simulating forest fires55s
-
(Locked)
Solution: Simulating forest fires5m 49s
-
What’s included
- Learn on the go Access on tablet and phone