Final

Small Language Model

Why:

Textual Entailment Recognition: given two text fragments, determine whether the meaning of one text is entailed (can be inferred) from the other text. (no neutral case in our problem)

Failure Mode: no retrival, The scoring system currently cannot select options like "None of the above". Due to evaluation.

Machine Translation

MT Problems: (1) Lexical divergences: no one-to-one mapping in word meaning (2) Structural divergences: Syntax, word order; Syntax-semantics relationship

Solution: linking words (If a word in the target frequently co-occurs with a word in the source, these will be, over several iterations, aligned with relatively greater frequency)

BLEU scores are based on token ngram overlap

chrF: A good machine translation will tend to contain characters and words that occur in a human translation of the same sentence. Correlates with human judgments quite well while being robust to tokenization difference

RNN

RNN

RNN

RNNs:

LSTM

LSTM: vanishing gradient backprop too far

Encoder-Decoder LSTM with attention

Encoder-Decoder LSTM with attention

Sequence Labeling

Closed class words: function words - short, frequent words with grammatical function

Open class words: content words: Nouns, Verbs, Adjectives, Adverbs

Sequence Labeling

Sequence Labeling

Supervised ML for Part of Speech Tagging:

Named Entity Recognition (NER): PER (person), LOC (location), ORG (organization), GPE (geo-political entity)

BIO tagging: begin, inside, outside, need 2n+1 tags for n entity types

Algorithm:

Table of Content