Data: Scrubbing, Scraping & Crawling

As mentioned in the first video “What is Artificial Intelligence” we isolated machine learning as a more appropriate title for the scope of this course. We also pointed out that Machine Learning operates on data to make it’s predictions. In this lecture we look at data: what it is, and where to find it. We point out APIs on the web. We also lay the ground work for why ML is dramatically different than rules based programming. After this module you should be able to:

  • Identify data in the world
  • Know where to find data to generate your own data set
  • Define and locate APIs
  • Differentiate between rules based programming and machine learning
  • Describe preprocessing in regards to data sets