From the course: AWS Certified Machine Learning - Specialty (MLS-C01) Cert Prep: 1 Data Engineering
Unlock this course with a free trial
Join today to access over 24,100 courses taught by industry experts.
Data storage - Amazon Web Services (AWS) Tutorial
From the course: AWS Certified Machine Learning - Specialty (MLS-C01) Cert Prep: 1 Data Engineering
Data storage
- [Instructor] Here we have data sources. When an organization is implementing a data engineering strategy, an important first step is to catalog all of the different data sources in the company. You may start with a mobile application and look at the user data and analyze where it's stored. For example, it could be commonly stored in a key-value store like DynamoDB, and that user record with the information about the particular profile would be one of the sources that could be cataloged so that later the data engineers could do something with it. Likewise, if you wanted to catalog your log data, you could identify what it is, potentially in a spreadsheet that has all the other data sources in it, and you could catalog the content type, for example, HTTP requests, so how many 200s, how many 400s, what was the URL that was referring that request, those kinds of log messages. And then you could identify the location as well.…