Crawling
4/34



  • Domain specific: file system, CMS, Google
  • Should indicate different fields of a document: title, description, meta-tags, body