dataset-fcc

Key Learnings

  1. Log everything!

FILE MANAGEMENT

  1. Avoid Dropbox at all costs!

R1D5: more #datawrangling.
PROTIP: NEVER USE DROPBOX to store/share large volumes of files to edit.
rename = delete + add; so renaming a parent folder = 25k files to be synced, a frozen macbook + notification spam for everyone.
Don’t even try loops. 😭 #100daysofCode

— Cynthia Huang (@cynthiahqy) April 19, 2018
  1. Add extra leading zeroes to file names!
    str.zfill(3)
    

DATABASE SCRAPING

  1. check pagination!
  2. debug debug test debug before running loop
  3. cache innerHTML from webdrive, then scrape data out of html later.