Crawling
4/35



  • Domain specific: file system, CMS, Google
  • Should indicate different fields of a document: title, description, meta-tags, body