Crawling
4/25
Domain specific: file system, CMS, Google
Should indicate different fields of a document: title, description, meta-tags, body