this dir | view | cards | source | edit | dark top

Labs

at least 50% homework to get the credit
we will get the feedback approximately a week after the deadline (or later)
we should create the merge request before the deadline
- (it is okay to make a tiny change after the deadline if we found a bug)
we should run the automated tests before we create the merge request
- the tests should not crash, they may not pass
first assignment
- we can switch the domain later
- but we should imagine two domains
- at least one of them should be our preferred
- we want a task-oriented system
  - anything that operates on top of a database or API
- we don't need to focus on parsing errors (or misunderstanding) in the flowchart
- we can use anything as the backend – even a local file
second assignment
- look at the data in data/hw2 (only the train data?)
- write a script that separates the user and the system turns
- some search calls (after silence) will be parts of the text
  - we ignore only the lines without tabs
- don't forget to comment the results
- we should run the tests locally
third assignment
- NLU, just understanding, no reply
- we will install dialmonkey using pip (see readme)
  - it's an empty shell
- there is a config file
  - there are some components
- jupyter notebook with demo snippets
- pick one domain
- implement ruse-based NLU
  - there are restaurant examples we can look at
  - we need to update our repo from the upstream
- we should create a config file – we can start with the sample config and replace the dummy NLU with our own implementation
- we should test it (write 15 test utterances)
  - input on the left, dialogue act on the right
  - we should separate it by a real tab character
  - there can be multiple intents on the right (with & separating them)
fourth assignment
- we won't work with our domain, we will use DSTC2 restaurant data
- for each sentence, there is a DA annotation (sentence-level, not token-level)
- our goal is to do sentence-level classification (no need for sequence analysis)
- idea: for each intent-slot pair we should train a classifier (~ 50 classifiers total)
  - the goal is to make it work, we don't have to follow that strictly
- we should put the classification results back together into DAs
- there's an evaluation script
- then we should set it up so that we can use it to chat
fifth assignment
- belief tracker
- so far we've been filling dial.nlu
- now, we need to fill dial.state dictionary … key = slot, value = dict (value → probability)
  - dial.state = {'price': {'cheap': 0.2, 'moderate': 0.5, 'expensive': 0.1, None: 0.2}, 'area': {'north': 0.5, 'east': 0.1, None: 0.4}}
  - after each turn we update the values with probabilities from dial.nlu
  - (initially: dial.state = {'price': {None: 1.0}, 'area': {None: 1.0}})
- the probabilities should sum up to one! (it's a job of NLU to assure that)
- in hw4 we don't need to set the probabilities (but we can do that)
- in hw5 the NLU needs to set the probabilities (in sklearn, predict_proba)
assignments 6 & 7
- implement the rule-based policy
- we should take the instructions about the policy with a grain of salt
- we can merge hw3, we should not delete the branch
- the database can be CSV table, SQLite database, …
hw 2
- how to count the bigrams?
assignment 8
- we don't need to optimize the number of used templates (in the third requirement) – if we do not find the match, we can go one by one
- load the upstream changes into the repository, there is a reference implementation we can use
assignment 12
- find the top 10 corresponding keys (utterances) using TF-IDF cosine similarity
- select one utterance randomly
- return the response to that utterance