this dir | view | cards | source | edit | dark
top
- information retrieval
- deals with the representation, storage, organization of and access to information items
- is finding material of an unstructured nature that satisfies an information need from within large collections
- differences between databases and information retrieval
- data are strongly structured × documents are not
- documents are meant for human interaction, data are meant for the computer
- central elements of IR: documents, information need, satisfaction of the user
- information definition … what a user gets from documents using his own knowledge
- document definition … a trace that a human is sending to another human
- we have a very large set of documents, very imprecise information need and we need to satisfy it very fast
- the system needs to know the format of the document to make sense of it
- main dimensions of documents
- physical (form) – to read a digital document, you need suitable tools
- meaning (content) – context is part of the meaning construction
- social (medium) – there's an intention behind every document
- document forms
- media
- text (types: book, article, letter)
- image (types: x-ray, photograph, graphic)
- video
- structured documents
- documents can consist of different documents
- user query
- user's information need expressed using a predefined language
- satisfaction
- system should be simple to use
- must give the best possible answers, they must be relevant
- must return reasonable quantities of answers
- must be fast
- other factors to take into account
- user's expertise
- temporal and geographical need
- relevance
- at document side: document, surrogate, information
- at user side: information need, request, query
- IR context