this dir | view | cards | source | edit | dark top

Lecture

Lecture

credit

Probability, information theory

event AA as a set of basic outcomes

AΩA\subseteq\Omega

Probability, information theory

we can estimate the probability of event AA by experiment

Probability, information theory

axioms

Probability, information theory

entropy

nothing can be more uncertain than the uniform distribution

Probability, information theory

perplexity

G(p)=2H(p)G(p)=2^{H(p)}

Probability, information theory

chain rule

H(X,Y)=H(YX)+H(X)H(X,Y)=H(Y\mid X)+H(X)

Probability, information theory

coding interpretation

entropy … the least average number of bits needed to encode a message

Probability, information theory

mutual information

Noisy channel model

we try to recover the original input from a noised output

Noisy channel model

to get input

Noisy channel model

language modelling

Noisy channel model

smoothing

Noisy channel model

homework

Morphological analysis

morphological annotation

POS tags

Morphological analysis

tagsets

Morphological analysis

Czech positional tags of PDT

Morphological analysis

Penn Treebank tagset

Morphological analysis

universal POS tags (from Universal Dependencies)

noun, proper noun, verb, adjective, adverb, interjection, pronoun, determiner, auxiliary, numeral, adposition, subordinating conjunction, coordinating conjunction, particle, punctuation, symbol, unknown

Morphological analysis

ancient Greek word classes

Morphological analysis

traditional parts of speech

Morphological analysis

openness vs. closeness, content vs. function words

Morphological analysis

morphological analysis

Morphological analysis

morphological analysis vs. tagging

Morphological analysis

finite-state morphology

Morphological analysis

lexicon is implemented as a FSA (trie)

Morphological analysis

problem with phonology: baby+s → babies (not babys)

Morphological analysis

finite-state transducer (převodník)

Morphological analysis

another way of rule notation: two-level grammar

Syntactic analysis

syntactic annotation

Syntactic analysis

surface syntax

Syntactic analysis

syntactic parsers

Information Retrieval

boolean retrieval

Information Retrieval

text processing

Information Retrieval

boolean retrieval: good for experts, good for applications, not good for the majority of users

Information Retrieval

tf-idf weighting

wt,d=(1+logtft,d)logNdftw_{t,d}=(1+\log\text{tf}_{t,d})\cdot\log\frac N{\text{df}_t}

Information Retrieval

we want to somehow measure similarity between the query and the documents

Information Retrieval

evaluation

Neural Networks

representing words

Neural Networks

representing sequences

Neural Networks

Transformers

Neural Networks

LM as sequence labeling

Strojový překlad

BLEU score combines precision and recall

Strojový překlad

phrase-based machine translation

Hurá, máš hotovo! 🎉
Pokud ti moje kartičky pomohly, můžeš mi koupit pivo.